Re: [PATCH RFC] pm_qos_requirement might sleep



On Wed, Aug 13, 2008 at 10:24:54AM +0200, John Kacur wrote:
On Wed, Aug 13, 2008 at 12:49 AM, mark gross <mgross@xxxxxxxxxxxxxxx> wrote:
On Wed, Aug 06, 2008 at 12:18:08AM +0200, John Kacur wrote:
On Tue, Aug 5, 2008 at 11:09 PM, Peter Zijlstra <peterz@xxxxxxxxxxxxx> wrote:
On Tue, 2008-08-05 at 13:49 -0700, mark gross wrote:
On Tue, Aug 05, 2008 at 09:25:01AM +0200, Peter Zijlstra wrote:
On Mon, 2008-08-04 at 22:52 +0200, John Kacur wrote:
Even after applying some fixes posted by Chirag and Peter Z, I'm still
getting some messages in my log like this

BUG: sleeping function called from invalid context swapper(0) at
kernel/rtmutex.c:743
in_atomic():1 [00000001], irqs_disabled():1
Pid: 0, comm: swapper Tainted: G W 2.6.26.1-rt1.jk #2

Call Trace:
[<ffffffff802305d3>] __might_sleep+0x12d/0x132
[<ffffffff8046cdbe>] __rt_spin_lock+0x34/0x7d
[<ffffffff8046ce15>] rt_spin_lock+0xe/0x10
[<ffffffff802532e5>] pm_qos_requirement+0x1f/0x3c
[<ffffffff803e1b7f>] menu_select+0x7b/0x9c
[<ffffffff8020b1be>] ? default_idle+0x0/0x5a
[<ffffffff8020b1be>] ? default_idle+0x0/0x5a
[<ffffffff803e0b4b>] cpuidle_idle_call+0x68/0xd8
[<ffffffff803e0ae3>] ? cpuidle_idle_call+0x0/0xd8
[<ffffffff8020b1be>] ? default_idle+0x0/0x5a
[<ffffffff8020b333>] cpu_idle+0xb2/0x12d
[<ffffffff80466af0>] start_secondary+0x186/0x18b

---------------------------
| preempt count: 00000001 ]

snip

+ spin_unlock_irqrestore(&pm_qos_rawlock, flags);

return ret_val;
}

As long as RAW_SPINLOCK compiles to a normal spinlock for non-RT premept
kernels I'm don't see a problem, as the change is almost a no-op for
non-RT kernels.

Correct, kernels with the rt patch that are configured to be non-rt
change the raw_spinlock to a normal spinlock. This patch still applies
to rt kernels only.

I was confused about this point, as I thought I saw raw_spinlock defined
in the mainline tree.



Signed-off-by: mark gross <mgross@xxxxxxxxxxxxxxx>

Should I send an updated patch that includes a change to the comment
block regarding the locking design after this patch or instead of it?


I've updated my patch to include changes to the comment block about
the locking design. I've also added your Signed-off-by line . (Maybe
Acked-by: would be more appropriate?)

thanks,

Ok, see below for Acked-by: line.


Now that I've separated locking of the target value from the other
locks, Peter's question still remains. Could the lock protecting
target be dropped from mainline which would allow us to drop this
patch altogether from rt? For now the safe thing to do is keep it
protected, but could you explain why it is needed? (it may very well
be)


This code is doing list deletions, insertions and walks / element
updates in a multi threaded environment where many processes on many
CPU's can be adding removing and updating PM_QOS request, a lot (tm).

So I think I still need to locking for the list walking and list element
updating code on face value. I'm not supper good with lists, perhaps
there is a trick to protecting the lists better than the way I'm doing
it.

Keeping a lock around the different "target_value"s may not be so
important. Its just a 32bit scaler value, and perhaps we can make it an
atomic type? That way we loose the raw_spinlock.

Acked-by: mark gross <mgross@xxxxxxxxxxxxxxx>

Thank You.
(updated patch attached)

pm_qos_requirement-fix
Add a raw_spinlock_t for target. target is modified in pm_qos_requirement
called by idle so it cannot be allowed to sleep.

Signed-off-by: John Kacur <jkacur at gmail dot com>
Signed-off-by: mark gross <mgross@xxxxxxxxxxxxxxx>

Index: linux-2.6.26.1-rt1.jk/kernel/pm_qos_params.c
===================================================================
--- linux-2.6.26.1-rt1.jk.orig/kernel/pm_qos_params.c
+++ linux-2.6.26.1-rt1.jk/kernel/pm_qos_params.c
@@ -42,9 +42,10 @@
#include <linux/uaccess.h>

/*
- * locking rule: all changes to target_value or requirements or notifiers lists
+ * locking rule: all changes to requirements or notifiers lists
* or pm_qos_object list and pm_qos_objects need to happen with pm_qos_lock
- * held, taken with _irqsave. One lock to rule them all
+ * held, taken with _irqsave. target is locked by pm_qos_rawlock because it
+ * is modified in pm_qos_requirement called from idle and cannot sleep.

Actually, the target is only getting read by CPUIDLE from idle. It
shouldn't get changed from the idle context.

--mgross

*/
struct requirement_list {
struct list_head list;
@@ -111,6 +112,7 @@ static struct pm_qos_object *pm_qos_arra
};

static DEFINE_SPINLOCK(pm_qos_lock);
+static DEFINE_RAW_SPINLOCK(pm_qos_rawlock);

static ssize_t pm_qos_power_write(struct file *filp, const char __user *buf,
size_t count, loff_t *f_pos);
@@ -149,13 +151,15 @@ static void update_target(int target)
extreme_value = pm_qos_array[target]->comparitor(
extreme_value, node->value);
}
+ spin_unlock_irqrestore(&pm_qos_lock, flags);
+ spin_lock_irqsave(&pm_qos_rawlock, flags);
if (pm_qos_array[target]->target_value != extreme_value) {
call_notifier = 1;
pm_qos_array[target]->target_value = extreme_value;
pr_debug(KERN_ERR "new target for qos %d is %d\n", target,
pm_qos_array[target]->target_value);
}
- spin_unlock_irqrestore(&pm_qos_lock, flags);
+ spin_unlock_irqrestore(&pm_qos_rawlock, flags);

if (call_notifier)
blocking_notifier_call_chain(pm_qos_array[target]->notifiers,
@@ -195,9 +199,12 @@ int pm_qos_requirement(int pm_qos_class)
int ret_val;
unsigned long flags;

- spin_lock_irqsave(&pm_qos_lock, flags);
+ /*
+ * pm_qos_requirement is called from idle, so it cannot sleep
+ */
+ spin_lock_irqsave(&pm_qos_rawlock, flags);
ret_val = pm_qos_array[pm_qos_class]->target_value;
- spin_unlock_irqrestore(&pm_qos_lock, flags);
+ spin_unlock_irqrestore(&pm_qos_rawlock, flags);

return ret_val;
}

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/



Relevant Pages

  • Re: CMultiLock example
    ... Seriously, though, locking is *mandatory* if two or more threads are accessing non-scalar ... lists ever be modified while the threads are running. ... without a lock. ... You must prevent that one piece of data is modified from multiple ...
    (microsoft.public.vc.mfc)
  • Re: [PATCH] bttv: fix mutex use before init
    ... the process fix uninitialized deref introduced in c37db91fd0d. ... While your patch fixes the issue, it has some other troubles, like to ... just one lock, instead of 3, there's no risk of dead locks. ... Could you take this patch to remove all of the comments about locking ...
    (Linux-Kernel)
  • Re: [PATCH 2/2] module: fix bne2 "gave up waiting for init of module libcrc32c"
    ... "I hate this patch. ... It makes already-ugly locking in module.c awful, ... Take the very first mail you sent: ... will no longer be valid if the lock was dropped. ...
    (Linux-Kernel)
  • Re: CMultiLock example
    ... Then, of course, there is the issue of "why lock at all?" ... For me, this means designing a ... avoids unnecessary locking. ... There are multiple link lists modified within the thread. ...
    (microsoft.public.vc.mfc)
  • Re: [Fwd: [PATCH 2/7] lock_list: a fine grain locked double linked list]
    ... It is build upon the regular double linked list primitives, spinlocks and RCU. ... Locking is peculiar in that edges are locked, ... dependancy created by the fact that the regular linked lists are circular. ...
    (Linux-Kernel)