Illustration Image

Town Hall Replay: Bad Partition Handling & Large Language Models

Melissa Logan on July 26, 2023

Town Hall Replay: Bad Partition Handling & Large Language Models

Apache Cassandra Town Halls are monthly opportunities to share use cases, tips, and learn about Cassandra project news. If you are an Apache Cassandra end user or part of the Cassandra engineering community, Town Halls are a great way to stay up-to-date on community activities. 

Town halls occur at 8am PT on the fourth Thursday of every month.

Improving Bad Partition Handling in Apache Cassandra
Presented by Jordan West and Cheng Wang, Netflix
Reading and compacting bad partitions have long been known to impact Cassandra performance. They have been the root cause of various production issues at Netflix. While there are several potential solutions for addressing them at an implementation level, we must also deal with them today when they arise. There are several forms of bad partitions, which include: a partition that gets large in size several GBs+; a partition with many (millions or more) small rows, potentially spread across many sstables; a partition with many small rows and many of them have been deleted or expired; and a partition with rows that themselves are very large (e.g. blobs of binary or text). This talk presents an approach used at Netflix to handle bad partitions when they arise. Specifically, how to identify, block, and mitigate bad partitions during production incidents. Jordan and Cheng also share ongoing efforts to improve some existing tools as well as new tools for the Cassandra community. 

Unleashing the Power of Large Language Models with Apache Cassandra and Vector Search
Presented by Jonathan Ellis, DataStax
In this talk, Jonathan explores the transformative role of Large Language Models (LLMs) like GPT-4 in developing AI-powered applications. Jonathan also covers how LLMs simplify traditional AI processes, reducing the need for complex data pipelines and bespoke model training to straightforward text-based inputs and queries. Through real-world application examples, Jonathan highlights the potential of LLMs in creating efficient AI systems, and the role of vector search. 

Apache Cassandra Project Updates
Presented by Josh McKenzie, Cassandra PMC Chair

Ways to Participate

To catch an upcoming Town Hall, check out our Planet Cassandra Global Meetup Group. To view previous Town Hall recordings, visit the Planet Cassandra YouTube Channel

If you’re interested in sharing a case study or use case with the community, let us know

For more information or to join the discussion, join us on these channels: 

Become part of our
growing community!
Welcome to Planet Cassandra, a community for Apache Cassandra®! We're a passionate and dedicated group of users, developers, and enthusiasts who are working together to make Cassandra the best it can be. Whether you're just getting started with Cassandra or you're an experienced user, there's a place for you in our community.
A dinosaur
Planet Cassandra is a service for the Apache Cassandra® user community to share with each other. From tutorials and guides, to discussions and updates, we're here to help you get the most out of Cassandra. Connect with us and become part of our growing community today.
© 2009-2023 The Apache Software Foundation under the terms of the Apache License 2.0. Apache, the Apache feather logo, Apache Cassandra, Cassandra, and the Cassandra logo, are either registered trademarks or trademarks of The Apache Software Foundation.

Get Involved with Planet Cassandra!

We believe that the power of the Planet Cassandra community lies in the contributions of its members. Do you have content, articles, videos, or use cases you want to share with the world?