Request for Guidance: Using ScyllaDB for API Response Caching (POC in Progress)

Dadasaheb · September 21, 2025, 6:16pm

Dear ScyllaDB Team,

I am currently working on a POC (Proof of Concept) using ScyllaDB on-prem trial version as a caching layer for some of our APIs, such as transaction history , account summary etc. The goal is to reduce load on our primary database and serve frequently accessed responses with low latency.

As part of this POC, I want to validate the best approach and architecture to utilize ScyllaDB effectively for this scenario. Below are some design options I am considering, and I would appreciate your input or alternative recommendations. Please feel free to suggest best alternatives, your guidance would be very valuable:

Request Hash–based Caching

Request Hash-based Caching
1. Store API responses against a unique request hash.
2. Example:
```
CREATE TABLE api_cache.responses (
  request_hash TEXT PRIMARY KEY,
  endpoint TEXT,
  response TEXT,
  created_at TIMESTAMP
);
```
3. Concern: If requests grow into millions, ScyllaDB may accumulate old unused data.
4. Question: Should we rely on TTL for automatic cleanup, or are there better strategies for eviction and cache invalidation?
Parameter–based Caching (Storing input params as columns)
1. Save request parameters directly as columns with the API response.
2. Example:
```
CREATE TABLE api_cache.transaction_history (
  account_id TEXT,
  from_date DATE,
  to_date DATE,
  response TEXT,
  PRIMARY KEY ((account_id), from_date, to_date)
);
```
3. Concern: If new request parameters are introduced later, schema evolution may be required.
4. Question: Is this a recommended approach for caching APIs where input parameters may evolve over time?
Cluster Sizing & Replication Factor
1. For a read-heavy caching workload with medium writes, how many nodes would you recommend starting with?
2. What replication factor is best suited for caching scenarios where fast retrieval is critical but cost efficiency also matters?
Integration with .NET Core Middleware
1. 1. Our APIs are developed in .NET, and we are planning a middleware that decides whether to fetch data from ScyllaDB or the primary DB.
  2. Question: Are there recommended .NET client libraries or best practices for this integration pattern?

We would highly value your guidance on:

The most suitable caching strategy (hash-based vs. param-based or hybrid).
Best practices for TTL, cache invalidation, and handling schema evolution.
Recommended cluster sizing for POC vs. production workloads.

As this is part of our ongoing POC, your suggestions will be critical in shaping our final architecture and ensuring that we leverage ScyllaDB’s strengths effectively.

Looking forward to your recommendations.

Best regards,
Dadasaheb

Guy · December 1, 2025, 7:50am

What is the primary database you’re using?
Some users started out using ScyllaDB as a caching layer, but later switched the entire database to ScyllaDB to get better results. This webinar is a useful resource.

A replication factor of three works for many use cases.

Regarding sizing, it really depends, what is your payload size?

This database and cache internals blog post might also be relevant, and maybe can add more info.

Topic		Replies	Views
Do I ever need to disable the ScyllaDB cache to use less memory? ScyllaDB cache	2	395	February 8, 2024
I want to know Best Practices for Scaling ScyllaDB in a High-Traffic Environment ScyllaDB	0	68	January 21, 2025
How to Optimize ScyllaDB Performance for High Throughput Applications? ScyllaDB data-model , performance , scylladb-monitoring , architecture	1	418	October 28, 2024
95% memory usage scyllaDB even when idle ScyllaDB testing	3	607	April 13, 2024
Is ScyllaDB Right for My Application? ScyllaDB's Sweet Spot ScyllaDB performance , high-availability , latency , throughput	4	2409	January 17, 2024

Request for Guidance: Using ScyllaDB for API Response Caching (POC in Progress)

Related topics