[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: utf-8 cut/paste
From: |
Sam Steingold |
Subject: |
Re: utf-8 cut/paste |
Date: |
Tue, 25 May 2004 11:41:09 -0400 |
User-agent: |
Gnus/5.1003 (Gnus v5.10.3) Emacs/21.3.50 (windows-nt) |
> * Eli Zaretskii <address@hidden> [2004-05-25 14:36:07 +0200]:
>
> No, it doesn't. My comment was a minor one, to help Sam avoid
> possible confusion in the future.
I am sorry, you lost me long ago (when MULE was merged into Emacs).
I understand what a CHARACTER is (a type in CL).
E.g., #\C is a "LATIN CAPITAL LETTER C", or
#\С is a "CYRILLIC CAPITAL LETTER ES" (even through they might look
similar in your font).
I understand that there are many (partial) function between (subsets of)
(INTEGER 0) and CHARACTER, called "encodings".
I don't know what a "charset" is, but I would guess that it is a subset
of CHARACTERs on which a particular encoding is defined.
I seem to recall that MULE considers characters as elements of these
charsets, not as elements of the class CHARACTER, i.e., each character
comes equipped with its integer encoding, and 2 characters which are
identical elements of CHARACTER, but appear in two different encodings
(e.g., #\Ц encoded in koi8 and in alt) are different characters in MULE.
This is so absurd that I can hardly believe that anyone could ever
conceive of this, let alone implement it.
This reminds me of a story (<http://v2.anekdot.ru/an/an0303/o030321.html#10>):
The Soviet space capsule of the 60-ies, Soyuz, was supposed to have
been made from titanium, but the titanium turned out to be too hard to
process, so it was made of a heavier aluminum alloy. This violated
the mass properties and thus aerodynamic stability of the craft. There
was no time to re-design everything (Moon race!), so the stability was
restored by adding a 150 kilogram lead dead-weight to the construction.
(only Soyuz-TMA in the early 2000 got rid of this thing!)
I hope it will take Emacs less than 30 years to get rid of the MULE
dead-weight.
--
Sam Steingold (http://www.podval.org/~sds) running w2k
<http://www.camera.org> <http://www.iris.org.il> <http://www.memri.org/>
<http://www.mideasttruth.com/> <http://www.honestreporting.com>
Save your burned out bulbs for me, I'm building my own dark room.
- utf-8 cut/paste, Sam Steingold, 2004/05/23
- Re: utf-8 cut/paste, Benjamin Riefenstahl, 2004/05/24
- Re: utf-8 cut/paste, Eli Zaretskii, 2004/05/25
- Re: utf-8 cut/paste, Benjamin Riefenstahl, 2004/05/25
- Re: utf-8 cut/paste, Eli Zaretskii, 2004/05/25
- Re: utf-8 cut/paste,
Sam Steingold <=
- Re: utf-8 cut/paste, Kenichi Handa, 2004/05/26
- Re: utf-8 cut/paste, Sam Steingold, 2004/05/28
- Re: utf-8 cut/paste, Jason Rumney, 2004/05/29
- Re: utf-8 cut/paste, Miles Bader, 2004/05/26
- Re: utf-8 cut/paste, Sam Steingold, 2004/05/26
- Re: utf-8 cut/paste, David Kastrup, 2004/05/26
- Re: utf-8 cut/paste, Benjamin Riefenstahl, 2004/05/26
- Re: utf-8 cut/paste, Eli Zaretskii, 2004/05/26
- Re: utf-8 cut/paste, Sam Steingold, 2004/05/27
- Re: utf-8 cut/paste, Stefan Monnier, 2004/05/26