Re: A problem about DIRECT IO on ext3

From: Badari Pulavarty (pbadari_at_gmail.com)
Date: 10/17/05

  • Next message: Roman Zippel: "Re: [PATCH] ktimers subsystem 2.6.14-rc2-kt5"
    To: Jens Axboe <axboe@suse.de>
    Date:	Mon, 17 Oct 2005 09:36:10 -0700
    
    

    On Mon, 2005-10-17 at 11:51 +0200, Jens Axboe wrote:
    > On Mon, Oct 17 2005, li nux wrote:
    > >
    > >
    > > --- Jens Axboe <axboe@suse.de> wrote:
    > >
    > > > On Mon, Oct 17 2005, Grzegorz Kulewski wrote:
    > > > > On Mon, 17 Oct 2005, Jens Axboe wrote:
    > > > > >>how to correct this problem ?
    > > > > >
    > > > > >See your buffer address, it's not aligned. You
    > > > need to align that as
    > > > > >well. This is needed because the hardware will
    > > > dma directly to the user
    > > > > >buffer, and to be on the safe side we require the
    > > > same alignment as the
    > > > > >block layer will normally generate for file
    > > > system io.
    > > > > >
    > > > > >So in short, just align your read buffer to the
    > > > same as your block size
    > > > > >and you will be fine. Example:
    > > > > >
    > > > > >#define BS (4096)
    > > > > >#define MASK (BS - 1)
    > > > > >#define ALIGN(buf) (((unsigned long) (buf) +
    > > > MASK) & ~(MASK))
    > > > > >
    > > > > >char *ptr = malloc(BS + MASK);
    > > > > >char *buf = (char *) ALIGN(ptr);
    > > > > >
    > > > > >read(fd, buf, BS);
    > > > >
    > > > > Shouldn't one use posix_memalign(3) for that?
    > > >
    > > > Dunno if one 'should', one 'can' if one wants to. I
    > > > prefer to do it
    > > > manually so I don't have to jump through #define
    > > > hoops to get at it
    > > > (which, btw, still doesn't expose it on this
    > > > machine).
    > > >
    > > > --
    > > > Jens Axboe
    > >
    > > Thanx a lot Jens :-)
    > > Its working now.
    > > I did not have to make these adjustments on 2.6
    > > Is looks to be having more relaxation.
    >
    > 2.6 does have the option of checking the hardware dma requirement
    > seperately, but for this path you should run into the same restrictions.
    > Perhaps you just got lucky when testing 2.6?

    2.6 also has the same restriction. But, if the "filesystem
    blocksize alignment" (soft block size) fails, we try to see
    if its aligned with hard sector size (512). If so, we can do the IO.
     
    2.4 fails if the offset or buffer is NOT filesystem blocksize
    aligned. Period.

    So, its possible that your buffer is atleast 512byte aligned,
    there by succeeding on 2.6

    BTW, posix_memalign() or valloc() should be safe.
     
    >
    > > Can somebody please throw some light on how to find
    > > your system's hard/soft block size ?
    >
    > It's a per-device (or even per-partition, in case of mounted partitions)
    > setting, you can use the BLKBSZGET and BLKSSZGET ioctls to query for
    > soft/hard sector sizes.
    >

    Thanks,
    Badari

    -
    To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
    the body of a message to majordomo@vger.kernel.org
    More majordomo info at http://vger.kernel.org/majordomo-info.html
    Please read the FAQ at http://www.tux.org/lkml/


  • Next message: Roman Zippel: "Re: [PATCH] ktimers subsystem 2.6.14-rc2-kt5"

    Relevant Pages

    • Re: [PATCH] speed up SATA
      ... write-cache-off queueing will be slightly slower than ... since most drives have slightly stricter ... but an 8MB buffer doing 4K random-ops could ... To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/ ...
      (Linux-Kernel)
    • Re: BugCheck FE, {1, 0, 0, 0} , XP, WinDDK 6001.18002
      ... try cutting down the size of the buffer and see if that works, let's say 1 MB chunks and if that still fails, try going down further... ... > USB Driver bugcheck, first parameter is USB bugcheck code. ... > Pipe, Request, RequestOptions, MemoryDescriptor, BytesWritten); ...
      (microsoft.public.development.device.drivers)
    • Re: cast-as-lvalue (Thank You)
      ... other type -- pointing into the buffer, ... Of course, this fails to compile cleanly in any correct C compiler, ... the resulting machine code (on the SPARC or MIPS ... Of course, even this shorter version still generally fails at runtime, ...
      (comp.lang.c)
    • Re: offtopic: how to break huge patch into smaller independent patches?
      ... if all else fails, ... but the option parsing ... espdiff: invalid option -- h ... send the line "unsubscribe linux-kernel" in ...
      (Linux-Kernel)
    • Re: Lockless file reading
      ... > a 3-byte sequence to not be written when both ... > a buffer of such data is controlled so a write ... writes aren't always in processor order (see ... send the line "unsubscribe linux-kernel" in ...
      (Linux-Kernel)