how LSM trees work and why they can be optimized for almost any workload
Read amplification and write amplification must
also be measured in number of IO requests and
not just in relative data volumes.
Thanks for the comment! I agree, but since those are "typically" correlated I papered over that to make the core concept simpler. I'll add a footnote to clarify.
Relative data volumes and relative numbers of IO
requests are not correlated at all. Take a close look
at how file systems work. Extending writes in sparse
allocation file systems may result in multiple small
IOs to space maps and to inodes for even one byte
of data written.
Read amplification and write amplification must
also be measured in number of IO requests and
not just in relative data volumes.
Thanks for the comment! I agree, but since those are "typically" correlated I papered over that to make the core concept simpler. I'll add a footnote to clarify.
Relative data volumes and relative numbers of IO
requests are not correlated at all. Take a close look
at how file systems work. Extending writes in sparse
allocation file systems may result in multiple small
IOs to space maps and to inodes for even one byte
of data written.