Re: [opensuse] uncompessing zip files and accented characters



* Dave Howorth (dhoworth@xxxxxxxxxxxxxxxxx) [20100713 10:50]:

They already are, I think.

No they aren't. The zip format neither specifies an encoding to use nor does
it offer a field that identifies the encoding. Thus unzip in its original
form can't handle different encodings and you also can't specify the
encoding. And as stated, upstream has rejected all patches up till now,
stating that utf8 should be used. Right, as if any Win* user would be able
to do so.

To remedy the situation a bit I've accepted a patch to openSUSE's unzip that
will decode russian and czech encodings. As librcc is extensible, maybe it
could be extended to also handle hungarian file names sokmetime in the
future.

Philipp

--
To unsubscribe, e-mail: opensuse+unsubscribe@xxxxxxxxxxxx
For additional commands, e-mail: opensuse+help@xxxxxxxxxxxx



Relevant Pages

  • Re: A taxonomy of types
    ... specifies its meaning, one would need to ask Alan Kay. ... It is needed whenever data, whose encoding, layout or language might ... type can extend the verb »toString«. ...
    (comp.lang.misc)
  • Re: Enforce Specific Encoding To Messages Arrived From Specific Ad
    ... US-ASCII encoding. ... I know how to encode each message manually when I ... I don't mind whether this process must be done using VBS ... ...
    (microsoft.public.outlook.program_vba)
  • Re: Xml parser and character encoding
    ... The original code specifies the *OUTPUT* encoding, ... I understand it) the SAX implementation is /supposed/ to take it from the XML ... code I commented on uses the Java system default decoder (whatever that happens ...
    (comp.lang.java.programmer)
  • Encodings for newsgroup messages (was: How to read this NG)
    ... Newsgroup messages should have a 'Content-Type' field in the header ... that specifies what character encoding is used for that message. ... If the encoding is incorrectly specified, ... not the software reading it. ...
    (sci.lang.japan)
  • Re: Missing XML header that specifies the encoding
    ... Joe Fawcett (MVP - XML)http://joe.fawcett.name ... specifies the encoding. ...
    (microsoft.public.sqlserver.xml)