Illustration Image

What exactly is cassandra partition?

I read a lot of cassandra docs, I understand that we have partition key, hash of this key used to split data between partitions to evenly distribute data between nodes.

But what exactly is partition ? Is it a table, or some subset in table, or just another calculated stuff used in order rows on node ? Is it a pure virtual thing, or some real entity that give some overhead ?

Is it better to limit amount of partitions ? For example, I can take remainder from uuid division and use it as partition key, that still equalise data between partitions, but keep partition count low, or I can just use whole uuid ?

Become part of our
growing community!
Welcome to Planet Cassandra, a community for Apache Cassandra®! We're a passionate and dedicated group of users, developers, and enthusiasts who are working together to make Cassandra the best it can be. Whether you're just getting started with Cassandra or you're an experienced user, there's a place for you in our community.
A dinosaur
Planet Cassandra is a service for the Apache Cassandra® user community to share with each other. From tutorials and guides, to discussions and updates, we're here to help you get the most out of Cassandra. Connect with us and become part of our growing community today.
© 2009-2023 The Apache Software Foundation under the terms of the Apache License 2.0. Apache, the Apache feather logo, Apache Cassandra, Cassandra, and the Cassandra logo, are either registered trademarks or trademarks of The Apache Software Foundation. Sponsored by Anant Corporation and Datastax, and Developed by Anant Corporation.

Get Involved with Planet Cassandra!

We believe that the power of the Planet Cassandra community lies in the contributions of its members. Do you have content, articles, videos, or use cases you want to share with the world?