Gedit and Chinese text

From: Tony Nelson (tonynelson_at_georgeanelson.com)
Date: 06/30/05

  • Next message: Tony Nelson: "Set initial size of gnome-terminal?"
    Date: Wed, 29 Jun 2005 21:46:49 -0400
    To: <fedora-list@redhat.com>
    
    

    I know nothing about Chinese, but some of you do. I'm writing a Gedit
    plugin using Unicode. My plugin wraps text for block comments in programs,
    and it assumes a fixed-width font, such as Monospace. Reading about
    Unicode, I see that in a "fixed-width" font that some characters are
    double-wide, and I'm wondering if I need to handle that in Gedit.

    I installed ttfonts-zh_CN in an attempt to see what happens with Chinese
    characters in Gedit, but I'm not sure what I learned. I think that AR PL
    Kait6iM GB is monospaced, but 7 random (double-width) chinese characters
    take up 13 western characters, not 14. They also do so in other fonts I
    tried (most seemed to substitute AR PL SungtiL GB for the Chinese
    characters).

    Is this double-width thing real, or am I confused?

    Does Gedit do it, say, if a real Chinese language installation is done?

    Does it make any sense to try to handle Chinese characters? Do
    word-wrapping and Chinse go together in, say, C programming source code
    comments?

    If they do go together, uhh, what constitutes word breaks? Those odd
    Unicode spaces? Any double-wide character boundary? (Not followed by
    punctuation?)
    ____________________________________________________________________
    TonyN.:' <mailto:tonynelson@georgeanelson.com>
          ' <http://www.georgeanelson.com/>

    -- 
    fedora-list mailing list
    fedora-list@redhat.com
    To unsubscribe: http://www.redhat.com/mailman/listinfo/fedora-list
    

  • Next message: Tony Nelson: "Set initial size of gnome-terminal?"

    Relevant Pages

    • Re: UNICODE to MBCS
      ... a conversion internally from Unicode encoding to ANSI encoding. ... INI = UNICODE containing code points for English and Chinese ... Chinese characters where all unresolved and appearing as '?'. ...
      (microsoft.public.vc.language)
    • Re: Japanese Chinese tea web sites
      ... >>> character pairs are used for Japanese font sets. ... >>> see are from the Japanese fonts and not Chinese. ... >>> languages take two characters for representation and a corresponding ... But UTF-8 *is* Unicode. ...
      (rec.food.drink.tea)
    • Re: Unicode is driving me nuts!
      ... using unicode, as you also mentioned you used to have ... hopefully not to have a lot of unreadable characters. ... > sample Chinese document. ... > Anthony> But when I attempted to run the script ...
      (comp.lang.python)
    • Higher Unicode characters
      ... Can someone explain something regarding unicode? ... I have to work with documents written in Simplified/Traditional Chinese (by ... read the characters and it pops up the Unicode conversion dialogue box. ... Also, once the characters have been written, why is it that applying a font ...
      (microsoft.public.word.printingfonts)
    • Re: MFC(VC6) Application Localization from French to Chinese(RPC)
      ... If you are using VC6 you must be sure to have the Chinese code page loaded for the characters to work correctly. ... If you are using 2005 you can also open the RC file with notepad and resave it as Unicode and the VC resource editor will maintain it in Unicode for you. ...
      (microsoft.public.vc.mfc)