Re: Unicode Text on Linux and WindowsXP



Update:
After reading IFS's Troubleshooting page, it seems that the current
version doesn't support utf-8.

Quoted from their site:
" When I access to an Ext2/Ext3 volume on Windows, file names with
language-specific characters appear distorted.

It is probably caused by an activated UTF-8 encoding in your Linux
installation. You may check it by the locale tool (on Linux):
locale
When it outputs something, which ends with "utf8" in the line
"LC_CTYPE=", UTF-8 encoding is enabled. Unfortunately, the current
version of the Ext2 IFS software does not support UTF-8 encoded file
names. (The driver always uses the current code page of Windows.) "

On 1/26/06, Panos Laganakos <panos.laganakos@xxxxxxxxx> wrote:
> > First, how did you get windows to read/write data to an ext3 filesystem?
>
> I installed IFS[1] in order to be able to view the ext3 partition, and
> be able to mount it in a Directory Drive.
>
> > Surely it is due to the two systems not using the same encoding for
> > the characters.
> >
> > Background:
> > The basic uint of data computers process is a 'byte'. The problem is
> > that a byte is not able to uniquely represent every character of every
> > language in the world. Unicode specifies a unique value for each
> > character of each language, but does so using 2 bytes per character.
> > Thus some mechanism must be used to encode these sequences of 2-byte
> > values as sequences of individual bytes. UTF-8 is one such encoding
> > (the most commonly used)
> >
> > Hopefully this will help point you in the right direction. I have
> > never used non-ASCII characters in file names. Furthermore, I have
> > not been able to find any way of changing Windows to use UTF-8 instead
> > of CP1252.
>
> Thanks for the info, that helped me fix some issues I had with NTFS
> mounted partitions. I had to pass iocharset=utf8 in the mounting
> (/etc/fstab) and now, I'm able to view greek spelled directories.
>
> Still I haven't been able to make windows or linux be able to read
> each other's chars right. I suppose since its ext3, it should use the
> same codepage, so it might be windows/IFS fault. Still not sure how to
> overcome this, I might have to contact IFS project.
>
> [1]IFS = http://www.fs-driver.org/
>
-- ubuntu-users mailing list
ubuntu-users@xxxxxxxxxxxxxxxx
https://lists.ubuntu.com/mailman/listinfo/ubuntu-users


Relevant Pages

  • Re: Tk 8.4.11 / Windows XP / Encoding problem
    ... Some of our clients are experiencing a weird problem on their Windows XP PCs. ... suddenly they start showing other characters instead of the correct utf-8 ... but it reported "utf-8" as it should. ...
    (comp.lang.tcl)
  • Re: get wide character and multibyte character value
    ... How do you think the characters which requires more than 16-bit in UTF-16, ... it is why I think Windows has limitations to ... later added support for surrogate pairs, so what you see could depend ... Most of my code is handling text as UTF-8. ...
    (microsoft.public.vc.language)
  • Re: Filename Encoding Help
    ... I suggest UTF-8, it's the most efficient for regular text, and it's the default for all methods reading and writing text files in .NET. ... UTF-16 but I am not sure what Windows Vista does. ... UTF-8 can represent the full Unicode spectrum, but many characters wind up encoded in just one or two bytes. ...
    (microsoft.public.dotnet.framework)
  • Re: Im sure glad I didnt buy a Mac Mini!
    ... somebody using Outlook. ... replace them with chinese characters. ... Explain how you having a problem sending UTF-8 to someone besides me makes ... It's a problem for Windows users who try to deal with UTF-8 charsets. ...
    (comp.sys.mac.advocacy)
  • =?utf-8?B?UmU6IFN0cmluZyAiw6LigqzihKIiIHRyYW5zbGF0ZWQgdG8gYXBvc3Ryb3BoZS4gV2h5Pw==?=
    ... it works), though it seems to use mostly just Ascii characters, representing ... but the author is not making the best possible use of UTF-8. ... They don't map it to ASCII apostrophe, ... Latin 1 encoding. ...
    (alt.html)