Extracting data from Scylla for data mining

Bill_Dunmire · July 12, 2023, 9:46pm

I would like to extract data in batches from ScyllaDB and dump them into files or an OLAP database for data mining. Do you have a suggestion?

Guy · July 13, 2023, 9:00am

There are a number of ways to achieve this. One is using Presto (see this tech talk).
Another is by using Spark with ScyllaDB. By doing so, you deploy analytics workloads on information stored in ScyllaDB. You can learn more about this and see a hands-on lab in the Using Spark with ScyllaDB lesson on ScyllaDB University.

Topic		Replies	Views
How to bulk fetch data from ScyllaDB? ScyllaDB cdc , kafka , elasticsearch	2	529	December 13, 2022
How to integrate Apache Solr and Apache Spark with ScyllaDB? ScyllaDB migration	3	374	October 12, 2023
Integration with PrestoDB and Metbase Hands-on Blog Posts integration	0	188	September 12, 2023
Where can I find a docker-compose for ScyllaDB with Spark - Analytical workloads? ScyllaDB docker , spark , analytics , olap	0	23	September 25, 2024
Copying ScyllaDB data to S3, using Spark, performance optimization ScyllaDB performance , sstable , backup-restore	0	62	November 17, 2024

Extracting data from Scylla for data mining

Related topics