RE: e1000e regression (interface hang) with latest -git
- From: "Allan, Bruce W" <bruce.w.allan@xxxxxxxxx>
- Date: Wed, 21 Jan 2009 09:35:16 -0800
-----Original Message-----
From: Ingo Molnar [mailto:mingo@xxxxxxx]
Sent: Wednesday, January 21, 2009 2:29 AM
To: linux-kernel@xxxxxxxxxxxxxxx; Kirsher, Jeffrey T; Brandeburg, Jesse;
Allan, Bruce W; Waskiewicz Jr, Peter P; e1000-devel@xxxxxxxxxxxxxxxxxxxxx;
netdev@xxxxxxxxxxxxxxx
Cc: Rafael J. Wysocki
Subject: e1000e regression (interface hang) with latest -git
I've got a Nehalem testbox that developed a new e1000e problem in this
merge window: after a few minutes of uptime the network interface goes
dead - no rx and no tx. If i ifdown/ifup the interface it comes back. If i
wait too long then even ifdown/ifup does not help anymore - only a reboot.
Other e1000e using testboxes i have are working just fine - so the problem
is specific to this hw.
Is this a known problem?
I have this hw:
01:00.0 Ethernet controller: Intel Corporation 82575EB Gigabit Network
Connection (rev 02)
01:00.1 Ethernet controller: Intel Corporation 82575EB Gigabit Network
Connection (rev 02)
If this is a new problem, what kind of other info do you need from me to
debug and fix this?
I started seeing this very early in the merge window, so candidates would
be one of these early commits:
eb14f01: Merge branch 'master' of
master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6
cb7b48f: igb/e1000e: Naming interrupt vectors
5b9ab2e: Merge branch 'master' of
master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6
e243455: e1000e: check return code from NVM accesses and fix bank
detection
a20e4cf: e1000e: fix incorrect link status when switch module pulled
8452759: e1000e: store EEPROM version number to prevent unnecessary NVM
reads
0285c8d: e1000e: cosmetic newline in debug message
5c48ef3: e1000e: sync change flow control variables with ixgbe
8f12fe8: e1000e: link up/down messages must follow a specific format
75eb0fa: e1000e: ESB2 config after link up
438b365: e1000e: check return of pci_save_state
1605927: e1000e: update comments listing supported parts for each MAC
family
63dcf3d: e1000e: 82571 check for link fix on 82571 serdes
5aa49c8: e1000e: commit speed/duplex changes for m88 PHY
005cbdf: e1000e: disable correctable errors for quad ports while going to
D3
0082982: netdev: add more functions to netdevice ops
651c246: e1000e: convert to net_device_ops
198d6ba: Merge branch 'master' of
master.kernel.org:/pub/scm/linux/kernel/git/davem/net-2.6
6ea7ae1: e1000e: enable ECC correction on 82571 silicon
4cf1653: netdevice: safe convert to netdev_priv() #part-2
babcda7: drivers/net: Kill now superfluous ->last_rx stores.
7c510e4: net: convert more to %pM
If you suspect a specific list of commits i can test their revert. (But
the box is a slow booter and the problem can take up to 15 minutes to
trigger so i'd rather not spend half a day bisecting it, if it can be
avoided.)
Thanks,
Ingo
82575EB is not supported by e1000e, it is supported by igb. Are you sure that is the correct device? Please send the system log and output of:
# for dev in `lspci | grep Ethernet | awk ' { print $1 } '`; do lspci -s $dev -vvv -n; done
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/
- Follow-Ups:
- Re: e1000e regression (interface hang) with latest -git
- From: Ingo Molnar
- Re: e1000e regression (interface hang) with latest -git
- References:
- e1000e regression (interface hang) with latest -git
- From: Ingo Molnar
- e1000e regression (interface hang) with latest -git
- Prev by Date: System clock runs too fast after 2.6.27 -> 2.6.28.1 upgrade
- Next by Date: Re: [patch] SLQB slab allocator
- Previous by thread: e1000e regression (interface hang) with latest -git
- Next by thread: Re: e1000e regression (interface hang) with latest -git
- Index(es):
Relevant Pages
|