aterm/rxvt man page issues

D. Hugh Redelmeier hugh-pmF8o41NoarQT0dZR+AlfA at public.gmane.org
Thu Jun 1 23:54:33 UTC 2006


| From: Giles Orr <gilesorr-Re5JQEeQqe8AvxtiuMwx3w at public.gmane.org>

| When I use "man" (actually a special mode of "less" on most Linux
| systems), it does weird things when displaying "`", "'", and "-"
| (back-tick, single quote, and hyphen).  I get the letter "a" with a
| caret over it replacing any of them, and some extra spaces.

These programs are outputting UTF-8 (multicharacter encoding of
UNICODE).  In UTF-8, characters within the ASCII set have the same encoding
as in ASCII.

Other characters (eg. those you mentioned) take multiple bytes.
ISO-8859-1 characters that are not within ASCII take two bytes.

The best fix would be to convince your terminal program that it is
expecting UTF-8 from the programs running beneath it.

Alternatively, you could try to convince the programs to generate what
your terminal program is expecting.  But there are a lot of programs
to convince (including gcc!).

On Fedora Core 5, changing the definition of LANG in
/etc/sysconfig/i18n should do the trick (but I'm not sure that it
does).

I have a heck of a time with this, even when I start xterm like this:
  LANG=en_CA xterm -lc
I lose xterms when I display some SPAM subject lines with Chinese in
them.  I'm generally doing this through ssh from a FC5 system to a
RHL7.0 system, so there may be some kind of impedance mismatch in
their handling of $LANG.  The newest devolutions of xterm confuse
things with a helper program called "luit".
--
The Toronto Linux Users Group.      Meetings: http://tlug.ss.org
TLUG requests: Linux topics, No HTML, wrap text below 80 columns
How to UNSUBSCRIBE: http://tlug.ss.org/subscribe.shtml





More information about the Legacy mailing list