Skip to content

LingConLab/zilo_andi_transliterator

Repository files navigation

Zilo Andi Transliterator

This is an FST converter that converts modified Avar Cyrillic alphabet (used in Zilo Andi, zilo1238) to IPA and vice versa. It includes the following additions:

  • <ӏ> (U+04CF, Cyrillic small letter Palochka) and <Ӏ> (U+04C0, Cyrillic letter Palochka) can be used interchangeably. It's also possible to use (U+0049, Latin Capital letter I) and <1> (U+0031, Digit One) for the same purpose.
  • Nasalisation is absent in Avar, so to denote nasalisation in Zilo Andi Cyrillic, several options are possible: <ᴴ> (U+1D34, Modifier Letter Capital H, e.g., <аᴴ>), <2> (U+0031, Digit Two, e.g., <а2>), or a tilde over the vowel (e.g., <а̃>).
  • We expect [je] following a vowel or in word-initial position to be spelled as <йе>, which contradicts Russian spelling.
  • The sequence <лъ> is ambiguous. We decided to use <лъ> to denote a lateral fricative [ɬ] and <л.ъ> to denote a combination of [l] and [ʔ], so you need to preprocess your text before using the transliterator.

Please refer to the transliterator as follows:

Moroz G. (2026) An FST transliterator from Zilo Andi Cyrillic to IPA, https://github.com/LingConLab/zilo_andi_transliterator

Usage

$ make
$ echo "Микъи битIилъи\!" | hfst-proc -C zilo_andi_cyrillic2ipa.hfstol | cg3 -g zilo_andi_cyrillic2ipa.cg3

"<Микъи>"
	"miqχ'i"
"<битIилъи>"
	"bit'iɬi"

Transliterator is based on hfst and cg3, so make sure you have hfst installed. Here are the installation instructions.

$ curl -s https://apertium.projectjj.com/apt/install-nightly.sh | sudo bash
$ sudo apt-get install hfst cg3

All metadata could be seen using the following command:

$ hfst-edit-metadata zilo_andi_cyrillic2ipa.hfstol
address: Moscow
author: George Moroz
cg3: true
citation: Moroz G. (2026) An FST transliterator from Zilo Andi Cyrillic to IPA, Linguistic Convergence Laboratory, HSE University, Moscow, https://github.com/LingConLab/zilo_andi_transliterator
family: East Caucasian
formulaic-definition: Id _1^∞ .
fst_type: transliterator
language: Zilo Andi (zilo1238)
licence: GPL-3.0
name: Zilo Andi Cyrillic to IPA transliterator
publisher: Linguistic Convergence Laboratory, HSE University
type: HFST_OL
usage_example: echo "Микъи битIилъи\!" | hfst-proc -C zilo_andi_cyrillic2ipa.hfstol | cg3 -g zilo_andi_cyrillic2ipa.cg3
version: 3.3
year: 2026

I decided not to do the reverse IPA to cyrillic side as a simple reverse after I gathered all possible capitals:

$ echo "anʒilo" | hfst-proc zilo_andi_ipa2cyrillic.hfstol 

^anʒilo/АНЖИЛО/АНЖИЛо/АНЖИлО/АНЖИло/АНЖиЛО/АНЖиЛо/АНЖилО/АНЖило/АНжИЛО/АНжИЛо/АНжИлО/АНжИло/АНжиЛО/АНжиЛо/АНжилО/АНжило/АнЖИЛО/АнЖИЛо/АнЖИлО/АнЖИло/АнЖиЛО/АнЖиЛо/АнЖилО/АнЖило/АнжИЛО/АнжИЛо/АнжИлО/АнжИло/АнжиЛО/АнжиЛо/АнжилО/Анжило/аНЖИЛО/аНЖИЛо/аНЖИлО/аНЖИло/аНЖиЛО/аНЖиЛо/аНЖилО/аНЖило/аНжИЛО/аНжИЛо/аНжИлО/аНжИло/аНжиЛО/аНжиЛо/аНжилО/аНжило/анЖИЛО/анЖИЛо/анЖИлО/анЖИло/анЖиЛО/анЖиЛо/анЖилО/анЖило/анжИЛО/анжИЛо/анжИлО/анжИло/анжиЛО/анжиЛо/анжилО/анжило$

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Contributors