Hello!,
I wish to improve the transliteration system through the following ways:
• Add Hindi as an intermediate language for transliteration.
• The second way can be using Compressed Word Format Mapping algorithm using Modified Levenshtein algorithm.
1. CWF will convert a word to its basic phonetic alphabets.
For example: musharraf -> musaraf, chidhambaram -> cidambaram
2. Levenshtein’s Edit Distance algorithm is popularly used to calculate the edit distance between any two strings in the same language. So we will try to incorporate acoustically equivalent characters of two languages and modify it.
3. We will then rank the words according to CWF+Mlev distance
Yash Sinha