Bootstrap repair of us-west nodes takes time in multi-dc cluster

swarooppatra · February 8, 2024, 7:40am

Hi,

We have a Multi-DC ScyllaDB cluster. With 10 nodes in AWS us-east-1 and 10 nodes in us-west-2. We use scylla-ansible-role to bring up new clusters. We have observed that bootstrap of nodes in usw2 takes lot of time than in use1. Looks like table repair during bootstrap is taking long time. Any pointer on how to debug and fix this.

Thanks,
Swaroop

avikivity · February 8, 2024, 2:57pm

Please provide the relevant logs that show what is taking time.

Botond_Denes · February 14, 2024, 9:29am

Do the nodes in different DCs have differing shard count? Are you using RBNO based bootstrap?

swarooppatra · February 22, 2024, 2:33pm

Here are few lines from logs. Looks like the repair during bootstrap takes a lot of time in us-west-2. In US-EAST-1 it took 13 sec where as in US-WEST-2 it took about 27 min.

in US-EAST-1:
Feb 22 13:06:07 x.y.z.ec2.internal scylla[2966]: [shard 0:stre] repair - bootstrap_with_repair: started with keyspaces={system_traces, system_distributed_everywhere, system_distributed, system_auth}, nr_ranges_total=9179
Feb 22 13:06:20 x.y.z.ec2.internal scylla[2966]: [shard 0:stre] repair - bootstrap_with_repair: finished with keyspaces={system_traces, system_distributed_everywhere, system_distributed, system_auth}

in US-WEST-2:
Feb 22 13:07:34 a.b.c.ec2.internal scylla[3055]: [shard 0:stre] repair - bootstrap_with_repair: started with keyspaces={system_traces, system_distributed_everywhere, system_distributed, system_auth}, nr_ranges_total=9421
Feb 22 13:34:28 a.b.c.ec2.internal scylla[3055]: [shard 0:stre] repair - bootstrap_with_repair: finished with keyspaces={system_traces, system_distributed_everywhere, system_distributed, system_auth}

swarooppatra · February 29, 2024, 8:50am

@avikivity ,
I have attached some log for reference in this thread. Can you provide some pointer please.

Botond_Denes · February 29, 2024, 12:26pm

The attached logs do not contain any information w.r.t. to what might be the cause of the slowness.

swarooppatra · February 29, 2024, 12:48pm

I didn’t found any error in scylla-server logs. Let me know where else to check.

Botond_Denes · February 29, 2024, 12:52pm

Can you please answer this? The answer might provide a lead.

swarooppatra · February 29, 2024, 1:19pm

Nodes in different DC are of same EC2 types. These nodes got same number of shards.
I think it is using RBNO based bootstrap. I am using scylla-ansible-roles git repo to create scylla cluster. In the repo scylla.yaml template is at scylla-ansible-roles/ansible-scylla-node/templates/scylla.yaml.j2 at master · scylladb/scylla-ansible-roles · GitHub
I don’t see any config for RBNO in this template. So I think it is RBNO(default approach).

swarooppatra · March 4, 2024, 10:10am

Could this be because there is only 1 seed and this seed is in us-east-1?

Botond_Denes · March 4, 2024, 2:04pm

Seeds are only used when joining the cluster, they are not used afterwards.

Botond_Denes · March 5, 2024, 5:20am

The fact that small tables take a lot of time to repair, much more than what would expect, is a known problem and we recently merged a pull request improving this.
That said, I don’t know why those small system tables take so much more time to stream in one DC, compared with the other.
How did you configure the replication of system_auth? This is a keyspace that the user is expected to adjust as the cluster is expanded?

swarooppatra · March 5, 2024, 9:45am

I hope this new PR will reduce some time consumption.

After all nodes in cluster is up and running I run a script which

changes RF of system_auth
Add new roles and delete cassandra role
run a nodetool repair. BTW, this repair also takes a lot of time to complete in a multi-DC cluster just like bootstrap.

Botond_Denes · March 5, 2024, 11:56am

Yes, it is repair that takes a long time for tiny tables. We have recently moved node-operations to use repair behind the scenes (hence the name RBNO) and now node operations are affected too.

For reference, this is the PR: repair: Introduce small table optimization by asias · Pull Request #15974 · scylladb/scylladb · GitHub

Topic		Replies	Views
Trying to setup a 2 node multi dc cluster for the first time... Seed node comes online fine, second node gets stuck repairing tables and constantly in a state of UJ ScyllaDB open-source , troubleshooting , multi-dc	1	220	March 4, 2024
Error when adding new nodes to cluster, and repair based node operations (RBNO) ScyllaDB troubleshooting , administration , repair , bootstrap	0	58	December 22, 2024
Scylladb return inconsistent data after node full rebuild Database Community	1	332	December 13, 2022
Running CQL query on different nodes in the cluster gives different results ScyllaDB troubleshooting	1	832	December 11, 2022
Error when running repairs with a mixed shard-count cluster ScyllaDB troubleshooting , repair	0	232	April 3, 2024

Bootstrap repair of us-west nodes takes time in multi-dc cluster

Related topics