GBit speed problem - TG3 on HP DL380 G3



Hi,

Initially, I will summarise the environment, then I will discuss the
problem. I appologize if this is the incorrect place to address.
Please direct me to the correct place.

== The environment ==

I have a HP DL380 G3 server with duel GBit ethernet. Here is output of
lspci:

root@koyoko:~# lspci
00:00.0 Host bridge: Broadcom CMIC-WS Host Bridge (GC-LE chipset) (rev
13)
00:00.1 Host bridge: Broadcom CMIC-WS Host Bridge (GC-LE chipset)
00:00.2 Host bridge: Broadcom CMIC-LE
00:03.0 VGA compatible controller: ATI Technologies Inc Rage XL (rev
27)
00:04.0 System peripheral: Compaq Computer Corporation Integrated
Lights Out Controller (rev 01)
00:04.2 System peripheral: Compaq Computer Corporation Integrated
Lights Out Processor (rev 01)
00:0f.0 ISA bridge: Broadcom CSB5 South Bridge (rev 93)
00:0f.1 IDE interface: Broadcom CSB5 IDE Controller (rev 93)
00:0f.2 USB Controller: Broadcom OSB4/CSB5 OHCI USB Controller (rev
05)
00:0f.3 Host bridge: Broadcom CSB5 LPC bridge
00:10.0 Host bridge: Broadcom CIOB-X2 PCI-X I/O Bridge (rev 05)
00:10.2 Host bridge: Broadcom CIOB-X2 PCI-X I/O Bridge (rev 05)
00:11.0 Host bridge: Broadcom CIOB-X2 PCI-X I/O Bridge (rev 03)
00:11.2 Host bridge: Broadcom CIOB-X2 PCI-X I/O Bridge (rev 03)
01:03.0 RAID bus controller: Compaq Computer Corporation Smart Array
5i/532 (rev 01)
02:01.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5703X
Gigabit Ethernet (rev 02)
02:02.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5703X
Gigabit Ethernet (rev 02)
03:01.0 RAID bus controller: Silicon Image, Inc. SiI 3124 PCI-X Serial
ATA Controller (rev 01)
06:02.0 SCSI storage controller: Adaptec AHA-2940U2/U2W
06:1e.0 PCI Hot-plug controller: Compaq Computer Corporation PCI
Hotplug Controller (rev 14)

It has two "Broadcom Corporation NetXtreme BCM5703X" as you can see.
The two ports are identically set up in "interfaces" configuration
file:

root@koyoko:~# cat /etc/network/interfaces
# This file describes the network interfaces available on your system
# and how to activate them. For more information, see interfaces(5).

# The loopback network interface
auto lo
iface lo inet loopback

# The primary network interface
auto eth0
iface eth0 inet dhcp

auto eth1
iface eth1 inet dhcp

We are connecting this to a NetGear 8port Gbit switch GS108:

http://www.netgear.com/Products/Switches/DesktopSwitches/GS108.aspx

We have used a variety of cables.

Versions:

root@koyoko:~# uname -a
Linux koyoko 2.6.22-14-server #1 SMP Sun Oct 14 23:34:23 GMT 2007 i686
GNU/Linux
root@koyoko:~# ethtool -i eth0
driver: tg3
version: 3.77
firmware-version: 5703-v2.21a
bus-info: 0000:02:01.0


== The problem ==

Initially, both ports would connect at Gbit speed almost
instantaneously. I was using only eth0 initially. However, one day I
was transferring some files, and it was going very slowly. I noticed
the port was running at 100Mbits. I tried a lot of things but nothing
worked. Using ethtool showed that it was set to autonegotiate to 10,
100 and 1000Mbits, but it would settle at 100Mbits.

After a couple of hours of mucking about, I reset the machine and
pulled out all the cables - power, ethernet, kvm, scsi. Then, the
machine negotiated correctly at 1000Mbits. I couldn't understand why
this was, but put it down to something erratic.

Now recently it has occurred again, and this time, cold-starting the
machine won't fix the problem.

Throughout all these incidents, when eth0 is plugged in initially, it
takes about 20-30 seconds for it to negotiate 100Mbits. eth1, only
takes a second at most to negotiate 1Gbit connection. This output from
dmesg should give show the time difference between eth0 and eth1 - I
just disconnect the cable and plug back in. This timing is consistent.

[ 5001.617724] tg3: eth0: Link is down.
[ 5019.558241] tg3: eth0: Link is up at 100 Mbps, full duplex.
[ 5019.558249] tg3: eth0: Flow control is on for TX and on for RX.
[ 5038.595038] tg3: eth1: Link is down.
[ 5041.375736] tg3: eth1: Link is up at 1000 Mbps, full duplex.
[ 5041.375744] tg3: eth1: Flow control is on for TX and on for RX.

Using ethtool, I downloaded eeprom data. The data is different for
eth0 and eth1. Is this something that may be an issue? Could the data
be corrupt for eth0 or configured incorrectly? Also, register dump was
different. I am not exactly sure what these correspond to, so if
someone could elaborate, that would be much appreciated.

Is there some way to reset this device? (Has anyone had experience
with HP DL380 servers?) I also emailed the guy I bought it from asking
for advise.

I have changed the network settings to use eth1 for now, which is
currently running at Gbit speed, appears to be running fine. I'll keep
an eye on it to see if any problems develop.

I wonder whether this is a hardware problem, or if there are some
options I can change. Is it possible (or desirable) to load the eeprom
data from eth1 to eth0?

(Also, I must note, I am going away on holiday for about 1.5 weeks, so
I might not reply immediately, although I will still try to get online
during that time.)

I'm really stuck at this point. Is there any way I can debug this
issue? Increase log level or something like that?

Thanks to any who can help,
Samuel

.



Relevant Pages

  • libata/sata_sil doesnt detect drives on second SiL3112A based ad apter w/kernel 2.4.27?
    ... The host is a Compaq DL760 with 8 CPUs/8gb ... storage controllers, one IDE controller and now two SiL-3112a based SATA ... 00:03.0 Unknown mass storage controller: CMD Technology Inc: Unknown device ... 00:0b.0 PCI Hot-plug controller: Compaq Computer Corporation PCI Hotplug ...
    (Linux-Kernel)
  • FS: 1U rackmount systems
    ... £50 each, collect from London NW4, or I can deliver to reasonable ... 00:03.0 VGA compatible controller: ATI Technologies Inc Rage XL ... 00:05.0 System peripheral: Compaq Computer Corporation Integrated Lights ... 01:05.0 Ethernet controller: Broadcom Corporation NetXtreme BCM5701 ...
    (uk.adverts.computer)
  • Why is my Firewire interfaces designated as an Etherenet interface?
    ... with integrated network and Firewire interfaces. ... Mandriva continues to relate to the Firewire interface as eth1 ... 00:00.0 Memory controller: nVidia Corporation CK804 Memory Controller (rev ...
    (comp.os.linux.networking)
  • Fidre channel array
    ... I have an old compaq 6500 with 2 fibre channel arrays insalled. ... 00:0b.0 PCI Hot-plug controller: Compaq Computer Corporation PCI Hotplug ...
    (Debian-User)
  • Re: Eth 1 problem in FC7
    ... apparently you are taking the IP information from a dhcpd. ... is dhcpd ready to attribute the necessary info to eth1? ... do you have the IP info ready for eth1 locally ... Controller ...
    (Fedora)