silpa-discuss
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[silpa-discuss] Improving Transliteration System


From: Yash Sinha
Subject: [silpa-discuss] Improving Transliteration System
Date: Thu, 18 Apr 2013 13:17:17 +0530

Hello!,
I wish to improve the transliteration system through the following ways:

Add Hindi as an intermediate language for transliteration.

The second way can be using Compressed Word Format Mapping algorithm using Modified Levenshtein algorithm.

1. CWF will convert a word to its basic phonetic alphabets.
For example:  musharraf  ->  musaraf,  chidhambaram -> cidambaram

2. Levenshtein’s Edit Distance algorithm is popularly used to calculate the edit distance between any two strings in the same language. So we will try to incorporate acoustically equivalent characters of two languages and modify it.

3. We will then rank the words according to CWF+Mlev distance
The second idea is not mine. I got it from http://web2py.iiit.ac.in/publications/default/download/inproceedings.pdf.108c19fc-3602-4d7d-b089-1e903960f881.pdf

Yash Sinha


reply via email to

[Prev in Thread] Current Thread [Next in Thread]