Re: National-character conversion



On 31/01/07, Ulrich Drepper <drepper@xxxxxxxxxx> wrote:
Tim wrote:
> Though, in this day and age, we've
> probably got to the stage where you're better off using UTF-8, rather
> than entities.

This is certainly the best advise. If for some reason it's not
acceptable, there is recode:

$ echo Ä | recode -d ..HTML
&Auml;


Wow, recode is _nice_. I didn't know about this package before:

recode.i386 3.6-22.fc6 extras
Matched from:
recode
The `recode' converts files between character sets and usages.
It recognises or produces nearly 150 different character sets
and is able to transliterate files between almost any pair. When exact
transliteration are not possible, it may get rid of the offending
characters or fall back on approximations. Most RFC 1345 character sets
are supported.
http://recode.progiciels-bpi.ca/


Dotan Cohen

http://lyricslist.com/lyrics/lyrics/137/12/aaliyah/age_ain_t_nothing_but_a_number.html
http://what-is-what.com/what_is/html_email.html

--
fedora-list mailing list
fedora-list@xxxxxxxxxx
To unsubscribe: https://www.redhat.com/mailman/listinfo/fedora-list



Relevant Pages

  • Re: Cyrillic and UTF-8
    ... that the current installed base of receivers supports a) UTF8, ... Another thing to consider is the support of either in Modality ... UTF-8 can essentially support all older, ... character sets, not just Asian multi-byte character sets, with a ...
    (comp.protocols.dicom)
  • Re: stand or sit?
    ... >> Character sets are driving me crazy this week. ... do you have more information about this PHP problem? ... utf-8 with PHP/MySQL can be messy at ... My main problem is that I want to store HTML so that bibliographic ...
    (soc.motss)
  • Re: DOCTYPE
    ... Is there any overriding principle that determines whether they use UTF-8 or not? ... Also, utf-8 allows you to mix character sets, thus rendering Korean and Danish in one sentence if you wish. ... A string is just a sequence of bytes, whereas a text is something that you can read. ... The difference, off course, is the encoding it is rendered in, and the problem is that texts are stored as strings. ...
    (comp.lang.php)
  • Re: Cyrillic and UTF-8
    ... UTF-8 can essentially support all older, ... character sets, not just Asian multi-byte character sets, with a ...
    (comp.protocols.dicom)
  • Re: special characters
    ... >> I switched Netscape to UTF-8. ... > if you see something else in Netscape, ... When I switched character sets in ... In Preferences, it had also popped back to Western. ...
    (comp.sys.mac.system)