Re: kernel 2.4.37 usb acm hard lockup



Hello,

On Thu, Jan 08, 2009 at 02:19:21PM -0700, duane@xxxxxxxxxxxxxx wrote:
Hello, I have a linux-2.4.37 - kernel *HANG* hard lockup that I am
trying to debug, ctl-alt-del does not fix the problem. The problem does
not occur on current 2.6 kernels.

Specifically the device I am talking to is an USB-ACM type 'scanner'
type device, it shows up as a USB com port /dev/input/ttyACM0 - and I
read/write commands to & from the device.

You make me realize that I have never encountered an usb-acm device yet.
I did not even know what they were supposed to be :-/

Some fixes have already been back ported from 2.6 (trying to get it to
run on 2.4.20-8 [rh9]) and I'm not getting far and asking for some help
or some pointers where to go next.

The bug is quite reproducible - I have a small C program, and a small
shell script loop. It sends a few commands (ie: initialize, scan and
transmit image) then saves the image to a file. Somewhere between 70 to
150 scans BANG a hard hang occurs, takes between 5 to 15 minutes.

I get *NO* output on the console, like an oops or something like that.
Only debug messages I have added in my program.

I have already turned the USB KERNEL debug messages on in the .config
file, and I have "dmesg -n 9".

And - if I do add too many of my own debug messages, depending on the
volume of messages the problem takes longer to appear or goes away (I
give up after several hours) - which makes me think it is a timing/race
condition :-(

If this is the wrong list, or there is a better list for this specific
2.4/usb/acm problem please let me know, I don't really know where else
to ask.

Since it's probably the first time I read a report about usb-acm, I think
that this driver is very rarely used. So it may simply be broken or full
of race conditions or improper locking.

What you could do :
- try without SMP if you're on an SMP machine.
- try with an old gcc (2.95.3 is perfect for 2.4, 3.3 is good too).

Adding support for gcc 3.4 a few years ago was not easy, and it is
possible that some drivers are still not properly fixed.

Also, do you remember the latest 2.4 kernel where this driver worked
correctly (if any at all) ?

Regards,
Willy

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/



Relevant Pages

  • Suspend/hibernate wireless regression with 2.6.33 kernel
    ... in kernel 2.6.33/2.6.33.1 using the in-kernel b43 driver, ... wireless becomes permanently ... b43-phy0 debug: b2062: Using crystal tab entry 19200 kHz. ...
    (Linux-Kernel)
  • Re: driver probe error reporting
    ... errors that are due to configuration such as missing device configuration ... data error is a bloat of code as a properly debugged kernel should never ... only available if the the driver code itself defines DEBUG. ... IMO, there's a problem with that, because it turns on _all_ of the debug info ...
    (Linux-Kernel)
  • Re: A CAN Driver Port from CE 4.0 to CE 6.0
    ... The driver is a stream interface driver, which was used on WinCE 4.0 ... the part of debug messages concerning the CAN driver. ... So there are some debug messages I cannot see. ...
    (microsoft.public.windowsce.platbuilder)
  • Re: A CAN Driver Port from CE 4.0 to CE 6.0
    ... The driver is a stream interface driver, which was used on WinCE 4.0 ... the part of debug messages concerning the CAN driver. ... So there are some debug messages I cannot see. ...
    (microsoft.public.windowsce.platbuilder)
  • Re: Unable to get debug message which is in kernel
    ... kernel image on to the target and I have ensured that ARMInit function ... I don't get the debug message " ARMInit done" printed on the ... and I get the debug messages which are in bootloader code but I don't ... I take it that you have no JTAG debugger? ...
    (microsoft.public.windowsce.platbuilder)