Major Compaction by Partition

GarrettPoore · October 30, 2023, 9:02pm

We’re considering a major compaction in the next few days, due to massive partitions that we are going to shrink and then want to clean up soon afterwards. We believe they are causing performance issues on the cluster.

When looking at the nodetool compact doc page, I see there is a --partition <partition_key> option. The help text isn’t very helpful here, but does that mean we can join the partition columns of the table to specify only that partition(s) to be compacted?

Our primary key is similar to this:

PRIMARY KEY ((foo, bar, baz), id1, id2)

So, based on the doc page we could somehow combine foo, bar, and baz to compact a specific partition?

We’re also confused as to how that even works at the SSTable level, since there should be multiple partitions per SSTable file.

felipemendes · October 30, 2023, 10:41pm

Hah! Good find. We actually removed the --partition option last week in docs: nodetool compact: remove unsupported partition option · scylladb/scylladb@70ba6b9 · GitHub

I guess that answers your question

FYI @Anna @Botond_Denes

GarrettPoore · October 31, 2023, 2:14am

Yep, that does indeed answer all of my questions, thank you Felipe.

Topic	Replies	Views
Memory issue - deleting big partitions in ScyllaDB with TimeWindowCompactionStrategy ScyllaDB data-model , troubleshooting , sizing , compaction	311	March 19, 2024
Last week in scylladb.git master (issue #192; 2023-08-13) ScyllaDB git-news	234	August 13, 2023
Last week in scylladb.git master (issue #235; 2024-06-23) ScyllaDB git-news	75	June 23, 2024
[RELEASE] ScyllaDB Enterprise 2022.1.10 Release Notes enterprise , enterprise-release , enterprise-2022-1	359	September 4, 2023
Last week in scylladb.git master (issue #161; 2023-01-01) ScyllaDB git-news	339	January 1, 2023

Major Compaction by Partition

Related topics