Re: utf8 or utf-8

From: Adam Sampson (azz_at_us-lot.org)
Date: 01/23/04

  • Next message: Evaldo Gardenali: "buggy raid checksumming selection?"
    To: law <lkml@tlinx.org>
    Date:	Fri, 23 Jan 2004 13:35:06 +0000
    
    

    law <lkml@tlinx.org> writes:

    > Should all references be UTF8 or utf-8?

    From Markus Kuhn's excellent Unicode FAQ:

    "The official name and spelling of this encoding is UTF-8, where UTF
    stands for UCS Transformation Format. Please do not write UTF-8 in any
    documentation text in other ways (such as utf8 or UTF_8), unless of
    course you refer to a variable name and not the encoding itself."
    <http://www.cl.cam.ac.uk/~mgk25/unicode.html>

    -- 
    Adam Sampson <azz@us-lot.org>                        <http://offog.org/>
    -
    To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
    the body of a message to majordomo@vger.kernel.org
    More majordomo info at  http://vger.kernel.org/majordomo-info.html
    Please read the FAQ at  http://www.tux.org/lkml/
    

  • Next message: Evaldo Gardenali: "buggy raid checksumming selection?"

    Relevant Pages

    • Re: A Great Idea (tm) about reimplementing NLS.
      ... would translate filenames form utf-8 stored on the media to e.g. latin2 ... one encoding for a particular language ... The filesystem is already populated with UFT-8 names. ... send the line "unsubscribe linux-kernel" in ...
      (Linux-Kernel)
    • Re: UTF-8 and case-insensitivity
      ... > the Linux kernel will need to efficiently support a userspace policy ... and say: UTF-8 is *the* encoding. ... send the line "unsubscribe linux-kernel" in ...
      (Linux-Kernel)
    • Re: A Great Idea (tm) about reimplementing NLS.
      ... Ext2/3's encoding has always been utf-8. ... send the line "unsubscribe linux-kernel" in ... Please read the FAQ at http://www.tux.org/lkml/ ...
      (Linux-Kernel)
    • Re: A Great Idea (tm) about reimplementing NLS.
      ... doesn't magically convert!utf-8 to utf-8. ... Unless the original encoding is 'wrong' and has two mapped characters ... Thats a fault of the file system design, ... send the line "unsubscribe linux-kernel" in ...
      (Linux-Kernel)
    • Re: How to detect text file encoding in Perl
      ... The BOM is the relevant encoding of the Unicode character U+FEFF. ... and in utf-8 it's *three* bytes. ... the Unicode FAQ points out that utf-8 can usefully ...
      (comp.lang.perl.misc)