Re: Stolen and degraded time and schedulers



On Tue, 2007-03-13 at 14:59 -0700, Jeremy Fitzhardinge wrote:
Daniel Walker wrote:
The frequency tracking you mention is done to some extent inside the
timekeeping adjustment functions, but I'm not sure it's totally accurate
for non-timekeeping, and it also tracks things like interrupt latency.
Tracking frequency changes where it's important to get it right
shouldn't be done I think ..

If you want accurate time accounting, don't use the TSC .


I'm not sure I follow you here. Clocksources have the means to adjust
the rate of time progression, mostly to warp the time for things like
ntp. The stability or otherwise of the tsc is irrelevant.

The adjustments that I spoke of above are working regardless of ntp ..
The stability of the TSC directly effects the clock mult adjustments in
timekeeping, as does interrupt latency since the clock is essentially
validated against the timer interrupt.

If you had a clocksource which was explicitly using the rate at which a
CPU does work as a timebase, then using the same warping mechanism would
allow you to model CPU speed changes.

like I said there are other factors so that's not going to exactly model
cpu speed changes. You could come up with another method, but that would
likely require another known constant clock.

The sched_clock interface is basically a stripped down clocksource..
I've implemented sched_clock as a clocksource in the past ..


Yes, that works. But a clocksource is strictly about measuring the
progression of real time, and so doesn't generally measure how much work
a CPU has done.

sched_clock doesn't measure amounts of cpu work either, it's all about
timing.

We currently have a sched_clock interface in paravirt_ops to deal with
the hypervisor aspect. It only occurred to me this morning that cpufreq
presents exactly the same problem to the rest of the kernel, and so
there's room for a more general solution.


Are there other architecture which have this per-cpu clock frequency
changing issue? I worked with several other architectures beyond just
x86 and haven't seen this issue ..

Well, lots of cpus have dynamic frequencies. Any scheduler which
maintains history will suffer the same problem, even on UP. If
processes A and B are supposed to have the same priority and they both
execute for 1ms of real time, did they make the same amount of
progress? Not if the cpu changed speed in between.

That's true, but given a constant clock (like what sched_clock should
have) then the accounting is similarly inaccurate. Any connection
between the scheduler and the TSC frequency changes aren't part of the
design AFAIK ..

And any system which commonly runs virtualized (s390, power, etc) will
need to deal with the notion of stolen time.

I haven't followed the "stolen time" discussion, but just a brief look
at your first email I'd say don't mess with the clocks .. The clocks
should always reflect the time accurately .. That's the point of the
clocks, and when the TSC, or any other clock, changes frequency it
sucks..

I haven't thought it through completely, but you might be able to solve
the issue by adding a value to each jiffie in the scheduler or altering
the scheduler to extend the number of jiffies a task gets pending on the
virtual speed of the cpu..

Daniel

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/



Relevant Pages

  • Re: System clock speed too high - 2.6.3 kernel
    ... There are three clock sources in the x86 kernel, ... The PIT has been in the hardware since, well, day one. ... The TSC is a newer thing so is not available on all ... direct indicator of the cpu speed. ...
    (Linux-Kernel)
  • Re: [PATCH] [15/58] i386: Rewrite sched_clock
    ... Do everything per CPU. ... around cpu frequency changing, so fall back during a backing ... physical TSC. ... +/* A fast clock for the scheduler. ...
    (Linux-Kernel)
  • [PATCH] [4/45] i386: Rewrite sched_clock
    ... Do everything per CPU. ... physical TSC. ... +/* A fast clock for the scheduler. ... extern int unsynchronized_tsc; ...
    (Linux-Kernel)
  • [PATCH] [15/58] i386: Rewrite sched_clock
    ... Do everything per CPU. ... This avoids problems with TSCs that ... physical TSC. ... +/* A fast clock for the scheduler. ...
    (Linux-Kernel)
  • Re: [PATCH] [15/58] i386: Rewrite sched_clock
    ... Do everything per CPU. ... around cpu frequency changing, so fall back during a backing ... physical TSC. ... + * Scheduler clock - returns current time in nanosec units. ...
    (Linux-Kernel)