Re: Higer latency with dynamic tick (need for an io-ondemand govenor?)
- From: Arjan van de Ven <arjan@xxxxxxxxxxxxx>
- Date: Sat, 19 Apr 2008 23:19:48 -0700
On Fri, 18 Apr 2008 10:43:32 -0500
"Woodruff, Richard" <r-woodruff2@xxxxxx> wrote:
Hi,
When capturing some traces with dynamic tick we were noticing the
interrupt latency seems to go up a good amount. If you look at the
trace the gpio IRQ is now offset a good amount. Good news I guess is
its pretty predictable.
* If we couple this with progressively higher latency C-States we see
that IO speed can fall by a good amount, especially for PIO mixes.
Now if QOS is maintained you may or may-not care.
I was wondering what thoughts of optimizing this might be.
One thought was if an io-ondemand of some sort was used. It could
track interrupt statistics and be feed back into cpu-idle. When
there is a high interrupt load period it could shrink the acceptable
latency and thus help choose a good a C-State which favors
throughput. Some moving average window could be used to track it.
Perhaps a new interrupt attribute could be attached at irq request
time to allow the tracking of bandwidth important devices.
The attached is captured on a .22 kernel. The same should be
available in a bit on a .24 kernel.
So right now we have the pmqos framework (and before that we had a simpler version of this);
so if your realtime (or realtime-like) system cannot deal with latency longer then X usec,
you can just tell the kernel,, and the deeper power states that have this latency, just won't get used.
What you're mentioning is sort-of-kinda different. It's the "most of the time go as deep as you can,
but when I do IO, it hurts throughput".
There's two approaches to that in principle
1) Work based on historic behavior, and go less deep when there's lots of activity in the (recent) past
A few folks at Intel are working on something like this
2) You have the IO layer tell the kernel "heads up, something coming down soon"
This is more involved, especially since it's harder to predict when the disk will be done.
(it could be a 10msec seek, but it could also be in the disks cache memory, or it could be an SSD or,
the disk may have to read the sector 5 times because of weak magnetics... it's all over the map)
Another complication is that we need to only do this for "synchronous" IO's.. which is known at higher layers
in the block stack, but I think gets lost towards the bottom.
There's another problem with 2); in a multicore world; all packages EXACPT for the one which will get the irq can go to a deeper state anyway....
but it might be hard to predict which CPU will get the completion irq.
--
If you want to reach me at my work email, use arjan@xxxxxxxxxxxxxxx
For development, discussion and tips for power savings,
visit http://www.lesswatts.org
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
- Follow-Ups:
- RE: Higer latency with dynamic tick (need for an io-ondemand govenor?)
- From: Woodruff, Richard
- RE: Higer latency with dynamic tick (need for an io-ondemand govenor?)
- References:
- Higer latency with dynamic tick (need for an io-ondemand govenor?)
- From: Woodruff, Richard
- Higer latency with dynamic tick (need for an io-ondemand govenor?)
- Prev by Date: Re: x86: 4kstacks default
- Next by Date: Re: [PATCH] Check for breakpoint in text_poke to eliminate bug_on
- Previous by thread: Re: [linux-pm] Higer latency with dynamic tick (need for an io-ondemand govenor?)
- Next by thread: RE: Higer latency with dynamic tick (need for an io-ondemand govenor?)
- Index(es):
Relevant Pages
|