Get the approximate number of rows of a table?

Hartmut · December 25, 2022, 12:57pm

Hi, is there a (cheap + fast) way to get the approximate number of rows of a table?
I know there’s e.g. ‘Number of partitions (estimate)’ of tablestats.

I’d like to be able to be able to query via CQL, as an light-weight alternative to select count(*) from my_table.

avikivity · December 25, 2022, 5:23pm

In general there is no good way to do this in a way similar to compaction statistics. While we can count the rows in an sstables, those rows could overlap the rows in another sstable (so we’d count them twice), or could overlap a tombstone in another sstable (and so should not be counted at all).

Starting with ScyllaDB 5.1, SELECT COUNT(*) FROM tab is automatically parallelized across all nodes and shards. In conjunction with Consistency Level LOCAL_ONE, this is much faster that before, but still requires significant CPU and I/O resources.

Hartmut · December 25, 2022, 5:47pm

Understood.
Thanks for the reply anyway!

Topic		Replies	Views
Counter Table vs SELECT COUNT for partition row count ScyllaDB	2	298	July 8, 2024
How to solve row count in a table time out ScyllaDB	1	336	January 17, 2024
Best way to Fetch N rows in ScyllaDB: Count, Limit or Paging ScyllaDB data-model , ttl , drivers	2	3623	December 11, 2022
Latency issue and data retrieval, page size and number of threads (Java Driver) ScyllaDB data-model , java-driver , paging , latency , threads	0	43	December 23, 2024
How to find all of the database's partitions efficiently, full table scan ScyllaDB data-model , performance , cql	0	141	September 1, 2024

Get the approximate number of rows of a table?

Related topics