Requirement is simple: we have to create a table which will have only 24 hours of data.
We have two options
- Defile TTL with each insert
- Make table property default_time_to_live for 24 hours.
I have general idea about both the things but internally which one will be helpful to deal with tombstones? or both will generate same amount of tombstones? Which one will be better and why any reference link will be appreciated.
If a table has default_time_to_live
on it then rows that exceed this time limit are deleted immediately without tombstones being written. This will not affect rows / columns that have an explicit TTL set on them. These will be tombstoned.
If you go down the TTL route then you should consider setting the gc_grace_seconds
property on the table to something less than the default (10 days). Particularly if you are looking at a 24 hour TTL.
References:
How data is deleted <-- Good background
CREATE TABLE properties <-- Table property reference
About Deletes and Tombstones in Cassandra <-- Everything you ever wanted to know about deletes and tombstones
If you use Cassandra 3.0 you can also define materialized view, see details: https://docs.datastax.com/en/cql/3.3/cql/cql_using/useCreateMV.html
Using TTL is not that effective as you will generate lots of tombstones, which depending on the amount of data might effect your read performance.
Also I think your question regarding the TTL is answered here:
cassandra TTL for table behaviour