How Does ScyllaDB Find the Node Containing the Data I Want?

Guy · December 11, 2022, 7:53am

The driver can connect to any Scylla node and perform a query. That node will be designated as the coordinator node for the given query. The coordinator node can be the replica node (the one holding the data), but it doesn’t have to be.

In ScyllaDB (and Apache Cassandra) Each node in the cluster is responsible for a set of tokens.

The coordinator node hashes the Partition Key, using the Partition Hash Function to determine which nodes are responsible for that data.

Because the partition hash function is known to the client, token-aware drivers can optimize the performance by choosing the coordinator node as one of the replica nodes.

This is efficient and as a result the number of network hops is lower and the cluster internal load gets reduced.

Scylla shard-aware drivers further increase performance by routing the query not only to the right replica node but also to the right shard (or CPU core) within that node.

Additional Resources:

Using Scylla Drivers course on ScyllaDB University
Cluster - Node Ring on ScyllaDB University

Scylla Architecture - Fault Tolerance on ScyllaDB Docs

*The question was originally asked on the user slack channel

Topic		Replies	Views
How to get Node Sharding information using Scylla Java Driver? ScyllaDB drivers	2	468	July 14, 2023
Does a CQL query become token-aware depending on its size? ScyllaDB drivers , cql , architecture	1	152	April 15, 2024
Why does a token-unaware query involving a local secondary index require a round trip? ScyllaDB data-model , drivers , secondary-index	1	27	July 28, 2025
How to ensure a Batch query reaches the correct partition? ScyllaDB drivers , rust	7	359	January 30, 2024
Can I use NodeJS to interact with ScyllaDB? Any examples? ScyllaDB drivers	1	331	September 28, 2023

How Does ScyllaDB Find the Node Containing the Data I Want?

Related topics