Re: [GIT PULL] Ext3 latency fixes



On Sat, Apr 04 2009, Linus Torvalds wrote:


On Sat, 4 Apr 2009, Jens Axboe wrote:

I'm sorry, but that fsync thing _is_ a real-world case, and it's the one
that a hell of a lot more people care about than some idiotic sqlite
throughput issue.

sqlite is just one case, I'm sure there are others. My point is that we
should make sure that we don't regress on the throughput side. It's a
trade off, we don't want throughput to fall through the floor either.

Jens, we _have_ regressed on the latency side. Everybody agrees.

It appears so, yes.

Also, I may be odd, but I really do think latency is more important than
throughput. When my disk has latencies in the sub-milliseconds, I simply
do not think it is _acceptable_ to have hickups that affect my workload in
human-visible terms.

Not everyone has an Intel SSD. But yes, latency is definitely more
important than throughput. That's not the same as saying that throughput
doesn't matter, because it definitely does.

You say sqlite might regress by 4-5x. But Ted's numbers improve latencies
by mor than that. I haven't re-created them yet myself (still reading
email), but the point is, 4-5x may sound bad to you, but turn it around:
the current latency situation is _really_ bad. If we can fix it, we
definitely should.

I haven't either. On monday I'll throw some testing and patches on the
boxes here. We can get the latency right, I want that as much as the
next guy. I just want to make sure it doesn't become too one-sided.

Quite frankly, the fact that I can see _seconds_ of latencies with a
really good SSD is not acceptable. The fact that it is by design is even
less so.

Agree, multi-second latencies is not acceptable.

I can literally send you strace output from my MUA, where it pauses for
ten seconds after it has written about 5kB (that's _kilobytes_) of data
and does a 'fsync'.

Unless you make all journal writes sync, ext3 fsync will always suck big
time. But I get your point.

--
Jens Axboe

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/



Relevant Pages

  • Re: IO scheduler based IO controller V10
    ... On Fri, 2 Oct 2009, Jens Axboe wrote: ... Mostly they care about throughput, and when they come running because ... So yes, latency is not ignored, but mostly they ...
    (Linux-Kernel)
  • Re: [GIT PULL] Ext3 latency fixes
    ... On Sat, 4 Apr 2009, Jens Axboe wrote: ... Big nack on this patch. ... should make sure that we don't regress on the throughput side. ... It's pretty easy to get good latency if you disregard any ...
    (Linux-Kernel)
  • Re: [GIT PULL] Ext3 latency fixes
    ... On Sat, 4 Apr 2009, Jens Axboe wrote: ... should make sure that we don't regress on the throughput side. ... we _have_ regressed on the latency side. ... You say sqlite might regress by 4-5x. ...
    (Linux-Kernel)
  • Re: 2.6.12 Performance problems
    ... > at the cost of decreased throughput - can't ... "Low latency comes at the cost of decreased ... Threading the kernel ...
    (Linux-Kernel)
  • Re: 2.6.12 Performance problems
    ... > at the cost of decreased throughput - can't ... "Low latency comes at the cost of decreased ... Threading the kernel ...
    (Linux-Kernel)