Is High-Cardinality Partition Key a Problem in ScyllaDB?

process_clock_tick · June 3, 2025, 5:06pm

Installation details
#ScyllaDB version: 5.2
os (RHEL/CentOS/Ubuntu/AWS AMI): CentOS

I’m designing a ScyllaDB table where each partition is tied to a unique job_id, which is a UUID. Each job typically inserts a small number of rows — sometimes just one or two. Over time, the number of jobs may grow to several million, resulting in a large number of small partitions.

My current schema looks like this:

PRIMARY KEY ((job_id), item_id, ...other info)

Is it acceptable in ScyllaDB to have millions of small partitions with high-cardinality UUIDs? Or Am I misunderstanding it? Since I read it’s very adviced to focus on high cardinality for partition keys or secondary indexes

Gabriel · June 6, 2025, 9:27am

In ScyllaDB, it’s absolutely acceptable (and often desirable) to have millions of small, high-cardinality partitions, especially when:

Each job_id is unique.
Each partition contains few rows.
Queries are typically by partition key (job_id) — e.g., SELECT * FROM table WHERE job_id = ?.

Topic		Replies	Views
How Do Many Small Partitions Influence Memory Usage in ScyllaDB? ScyllaDB data-model , performance , bloom-filter	1	135	September 25, 2024
Data model for frequent deletes with partition key ScyllaDB data-model	7	80	August 8, 2024
Recommendations for partitioning imbalanced data ScyllaDB data-model , hot-partition	1	125	November 22, 2024
Datamodel for Scylla/Cassandra for table partition key is not known beforehand -> static field? ScyllaDB	1	296	February 9, 2023
What is the maximum number of records that a scylla table can carry? ScyllaDB	3	1488	June 8, 2023

Is High-Cardinality Partition Key a Problem in ScyllaDB?

Related topics