emacs-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [ELPA] New package: pyim


From: Zhu Zihao
Subject: Re: [ELPA] New package: pyim
Date: Tue, 23 Feb 2021 00:29:42 +0800
User-agent: mu4e 1.4.15; emacs 27.1

Hi, tumashu.

I'm a big fan of pyim input method, I would be very happy to see pyim
can be a part of GNU ELPA or even Emacs!

I just do some research on the issue we faced. We can extract the
bopomofo data from unihan database.

For example, the pinyin of CJK character "爱"(means love) is "ai". Can be found 
at
https://www.unicode.org/cgi-bin/GetUnihanData.pl?codepoint=7231.

The full database can be downloaded at
https://www.unicode.org/Public/UCD/latest/ucd/Unihan.zip. Unihan
database is licensed under Unicode license, a free license approved by
FSF(https://www.gnu.org/licenses/license-list.html#Unicode).

We can write a simple program to extract the pinyin of needed characters
in Unihan_Readings.txt. Unihan database contains almost all reading of
CJK character, we have to limit it to a subset. Maybe refer to
《通用汉字规范表》 (Table of General Standard Chinese Characters)
http://www.gov.cn/gzdt/att/att/site1/20130819/tygfhzb.pdf. 
-- 
Retrieve my PGP public key:

  gpg --recv-keys D47A9C8B2AE3905B563D9135BE42B352A9F6821F

Zihao




reply via email to

[Prev in Thread] Current Thread [Next in Thread]