Weird NFS problem

From: Stuart Bailey (stuart_at_linusoft.co.uk)
Date: 06/21/05

  • Next message: Adam Antoniak: "sound issue"
    To: fedora-list@redhat.com
    Date: Tue, 21 Jun 2005 09:56:07 +0100
    
    

    I'm having trouble with NFS on a Fedora 3 server. The client machines are a
    mixture of HP Unix and Solaris (also samba is used for Windows clients).
    Most of the time the NFS export point runs perfectly fine, a client have it
    mounted all the time, and files and directories can be listed as expected.
    All users have write access.

    However, occasionally (about once a day) the files in various directories
    disappear from the client machines. If I go to the server and cd to the
    troublesome directories, they immediately become available to the clients
    again. As a quick fix, I have created a cron job that runs ls -R for the root
    directory of the nfs export.

    The only error I get in the logs is:
    Jun 13 07:39:58 kelvin rpc.statd[2248]: Received erroneous SM_UNMON request
    from kelvin.test for 172.16.44.23
    but this indicates a write access problem with /var/lib/nfs/statd/sm and sm.d.
    But I have checked that the rpc users has r/w access to these.

    I tried a tcpdump and got the following:

    13:27:36.751186 IP (tos 0x0, ttl 64, id 52539, offset 0, flags [DF], proto 6,
    length: 148) tescat2.680050 > kelvin.test.nfs:108 lookup [|nfs]
    13:27:36.751223 IP (tos 0x0, ttl 64, id 14271, offset 0, flags [DF], proto 6,
    length: 160) kelvin.test.nfs > tescat2.680050:reply ok 120 lookup ERROR: No
    such file or directory post dattr:
    13:27:36.751646 IP (tos 0x0, ttl 64, id 52540, offset 0, flags [DF], proto 6,
    length: 132) tescat2.680051 > kelvin.test.nfs:92 getattr [|nfs]

    Based on this I have tried turning off client side attribute caching, to no
    effect.

    I have exported the directories as such:

    /data 172.16.44.0/24 (rw,async,insecure,insecure_locks,no_subtree_check)

    I have also tried sync instead of async, and removing the no_subtree_check
    flag, also to no effect.
    Other things I have tried:
            reducing the MTU on the network to 1000 from 1500
            

    Any ideas would be appreciated.

    Many thanks,

    Stuart.

    -- 
    ---------------------------------------
    Stuart Bailey BSc (hons) CEng CITP MBCS
             LinuSoft (Proprietor)
               Linux Specialist
                (01953) 601294
                (07778) 383739
           http://www.linusoft.co.uk
    ----------------------------------------
    **********************************************************************
    This email and any files transmitted with it are confidential. If you
    are not the intended recipient, please email postmaster@linusoft.co.uk 
    immediately.
    You should not copy or use this email or attachments for any purpose
    nor disclose their contents to any other person.
    NO BINDING CONTRACT WILL RESULT FROM THIS E-MAIL UNTIL SUCH TIME AS A 
    WRITTEN DOCUMENT IS SIGNED ON BEHALF OF LinuSoft.
    LinuSoft cannot accept any responsibility for the completeness or 
    accuracy of this message as it has been transmitted over public networks.
    *************************************************************************
    -- 
    fedora-list mailing list
    fedora-list@redhat.com
    To unsubscribe: http://www.redhat.com/mailman/listinfo/fedora-list
    

  • Next message: Adam Antoniak: "sound issue"

    Relevant Pages

    • V210 BGE0@1000FDX
      ... When connecting a server to a Gig interface you need to enable autoneg ... Blocked port after process kill ... NFS oddity ... where hostname is the name of the NFS client which will automount the ...
      (SunManagers)
    • 2.6.9: NFS (+XFS) Problem - Clients getting Stale filehandles.
      ... I'm having a rather vierdNFS Problem. ... We have a disk-backup server ... running an NFS server exporting an XFS filesystem to a number of clients ... Client and server are on the same LAN - no firewall. ...
      (Linux-Kernel)
    • Exchange 5.5
      ... Open relay is closed on the server. ... I have Norton client antivirus install on all the client machines ...
      (microsoft.public.exchange.admin)
    • Exhange 5.5
      ... Open relay is closed on the server. ... I have Norton client antivirus install on all the ... client machines and exchange email scanning option is turned on. ...
      (microsoft.public.exchange.admin)
    • Re: Unable to add a Client Computer to Windows Small Business Serv
      ... IPS and it's running Windows small business server 2003. ... I'm definitely no expert when it comes to network setup so any advice would ... Where is the client computer? ... client machines would have private addresses and join the domain by using ...
      (microsoft.public.windows.server.networking)