On 6 Jan 2020, Eric Wheeler spake thusly:
On Sat, 4 Jan 2020, Coly Li wrote:
In year 2007 high performance SSD was still expensive, in order to save more space for real workload or meta data, the readahead I/Os for non-meta data was bypassed and not cached on SSD.
It's also because readahead data is more likely to be useless.
In now days, SSD price drops a lot and people can find larger size SSD with more comfortable price. It is unncessary to bypass normal readahead I/Os to save SSD space for now.
Doesn't this reduce the utility of the cache by polluting it with unnecessary content? It seems to me that we need at least a *litle* evidence that this change is beneficial. (I mean, it might be beneficial if on average the data that was read ahead is actually used.)
What happens to the cache hit rates when this change has been running for a while?