This is an FST converter that converts modified Avar Cyrillic alphabet (used in Zilo Andi, zilo1238) to IPA and vice versa. It includes the following additions:
- <ӏ> (U+04CF, Cyrillic small letter Palochka) and <Ӏ> (U+04C0, Cyrillic letter Palochka) can be used interchangeably. It's also possible to use (U+0049, Latin Capital letter I) and <1> (U+0031, Digit One) for the same purpose.
- Nasalisation is absent in Avar, so to denote nasalisation in Zilo Andi Cyrillic, several options are possible: <ᴴ> (U+1D34, Modifier Letter Capital H, e.g., <аᴴ>), <2> (U+0031, Digit Two, e.g., <а2>), or a tilde over the vowel (e.g., <а̃>).
- We expect [je] following a vowel or in word-initial position to be spelled as <йе>, which contradicts Russian spelling.
- The sequence <лъ> is ambiguous. We decided to use <лъ> to denote a lateral fricative [ɬ] and <л.ъ> to denote a combination of [l] and [ʔ], so you need to preprocess your text before using the transliterator.
Please refer to the transliterator as follows:
Moroz G. (2026) An FST transliterator from Zilo Andi Cyrillic to IPA, https://github.com/LingConLab/zilo_andi_transliterator
$ make
$ echo "Микъи битIилъи\!" | hfst-proc -C zilo_andi_cyrillic2ipa.hfstol | cg3 -g zilo_andi_cyrillic2ipa.cg3
"<Микъи>"
"miqχ'i"
"<битIилъи>"
"bit'iɬi"
Transliterator is based on hfst and cg3, so make sure you have hfst installed. Here are the installation instructions.
$ curl -s https://apertium.projectjj.com/apt/install-nightly.sh | sudo bash
$ sudo apt-get install hfst cg3
All metadata could be seen using the following command:
$ hfst-edit-metadata zilo_andi_cyrillic2ipa.hfstol
address: Moscow
author: George Moroz
cg3: true
citation: Moroz G. (2026) An FST transliterator from Zilo Andi Cyrillic to IPA, Linguistic Convergence Laboratory, HSE University, Moscow, https://github.com/LingConLab/zilo_andi_transliterator
family: East Caucasian
formulaic-definition: Id _1^∞ .
fst_type: transliterator
language: Zilo Andi (zilo1238)
licence: GPL-3.0
name: Zilo Andi Cyrillic to IPA transliterator
publisher: Linguistic Convergence Laboratory, HSE University
type: HFST_OL
usage_example: echo "Микъи битIилъи\!" | hfst-proc -C zilo_andi_cyrillic2ipa.hfstol | cg3 -g zilo_andi_cyrillic2ipa.cg3
version: 3.3
year: 2026
I decided not to do the reverse IPA to cyrillic side as a simple reverse after I gathered all possible capitals:
$ echo "anʒilo" | hfst-proc zilo_andi_ipa2cyrillic.hfstol
^anʒilo/АНЖИЛО/АНЖИЛо/АНЖИлО/АНЖИло/АНЖиЛО/АНЖиЛо/АНЖилО/АНЖило/АНжИЛО/АНжИЛо/АНжИлО/АНжИло/АНжиЛО/АНжиЛо/АНжилО/АНжило/АнЖИЛО/АнЖИЛо/АнЖИлО/АнЖИло/АнЖиЛО/АнЖиЛо/АнЖилО/АнЖило/АнжИЛО/АнжИЛо/АнжИлО/АнжИло/АнжиЛО/АнжиЛо/АнжилО/Анжило/аНЖИЛО/аНЖИЛо/аНЖИлО/аНЖИло/аНЖиЛО/аНЖиЛо/аНЖилО/аНЖило/аНжИЛО/аНжИЛо/аНжИлО/аНжИло/аНжиЛО/аНжиЛо/аНжилО/аНжило/анЖИЛО/анЖИЛо/анЖИлО/анЖИло/анЖиЛО/анЖиЛо/анЖилО/анЖило/анжИЛО/анжИЛо/анжИлО/анжИло/анжиЛО/анжиЛо/анжилО/анжило$