Probabilistic Data: HyperLogLog

Does it exist HyperLogLog data structure for distinct counting like in Redis or Aerospike?

Did you find a solution for this?
It seems that Spark supports this, you can try to look at connecting Spark with ScyllaDB.

No, but it’s an interesting idea!
I suggest opening an issue for it, or commenting on Implement custom merging algorithms · Issue #1321 · scylladb/scylladb · GitHub