the mathematics of compression in database systems
understanding LSM trees via read, write, and space amplification
sorted string tables (SST) from first principles