texmacs-dev
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Texmacs-dev] Cyrillic encoding problem


From: Павел Пантелеев
Subject: [Texmacs-dev] Cyrillic encoding problem
Date: Thu, 7 Feb 2008 00:39:54 +0300

Hello, all.

  I often use TeXmacs to write my papers.
  I really like it! Many thanks to all developers.
  But I have some troubles with Cyrillic encoding.
  As far as I understand TeXmacs represents
  cyrillic letters as unicode symbols.
  For example, if I write "Hello!" (in russian "Privet!")
  TeXmacs saves in tm-file the string:
  "\<#41F\>\<#440\>\<#438\>\<#432\>\<#435\>\<#442\>!".
  Maybe Unicode for internal representation is good but
  it's very inconveniently for many reasons:

  1) TeXmacs Hyphenation tables for russian language use TeX T2
     encoding (very close to standard cp1251 windows encoding).
     So it doesn't work.
     
  2) I can't use external editor to edit and view tm-files.

  3) I can't use spell-checker in TeXmacs.

  4) Import to Latex doesn't work. It writes empty string instead of
     "Hello!" (in russian cp1251) but it produces valid header:
     \usepackage[cp1251]{inputenc}
     \usepackage[russian]{babel}
     I also can read latex files in cp1251 encoding very well. But if I
     save them in tm-format they store russian text in TeX T2A
     encoding instead of unicode!
     

  But if replace the sequence:
  "\<#41F\>\<#440\>\<#438\>\<#432\>\<#435\>\<#442\>!" in tm-file
  in external editor with "Hello!" (in russian in T2A
  encoding) and read it by TeXmacs then all is OK. Hyphenation
  works pretty good. So I deduce that TeXmacs can handle T2A encoding
  pretty good.

  Is there a way to store russian texts in TeXmacs in cp1251 or
  in T2A encoding? Maybe it's appropriate to use current
  representation for kernel but there should be a simple way
  to handle another encodings such as T2A, cp1251, koi8r and etc.

  I tried to resolve the problem by kbd-map but failed.
  I wonted to convert on the fly symbols like \<#438\>
  to corresponding T2A analogs.
  But I don't know a way to represent unicode symbols
  like \<#440\> in this command.
  Is there a way to use this kind of symbols in scheme
  programs?

  I use TeXmacs (ver. 1.0.6.12) under Windows XP SP2.
  I compiled it under Cygwin and use xkb for language
  switching.

-- 
Regards,
 Pavel                          mailto:address@hidden





reply via email to

[Prev in Thread] Current Thread [Next in Thread]