Comments (6)
Hi
First of all, it took me quite some time to realize what did I do to this project in past that I'm being tagged here. But that's fine :)
I did a quick refresh on romanization of languages. According to a comparison table on Wikipedia, it seems there are multiple options to choose from, either is correct according to one standard or another.
https://en.wikipedia.org/wiki/Romanization_of_Russian#Transliteration_table
To make things completely absurd, there's even "ั" -> "ia" variant for passports!
So, I think, the question is malformed, and can not be answered without specifying particular standard as well ๐
from cyrillic-transliteration.
@georgeslabreche Just in case: iuliia
offers only Cyrillic -> Latin, but not the other way around (which was important to me because I've tried to create TTS for mixed text in Cyrillic and Latin based on Russian voice), this is why I search for other libraries, found your library, wasn't fully satisfied by it, and did my contribution. So, your library is not fully comparable to iuliia
, they have different use cases.
from cyrillic-transliteration.
Thank you for flagging this! Which one is the correct one? cc @ratijas and @rominf.
from cyrillic-transliteration.
Thank you @ratijas for your input! Sorry for the unsolicited tagging๐ฌ.
from cyrillic-transliteration.
Hi, @georgeslabreche.
I agree with what @ratijas said: standards matter. My variant is based on the standard of Russian government (and it's mentioned in the PR), while first version of Russian transliteration is apparently not based on any standard (at least, I'm not aware of it and the PR doesn't mention it). Speaking of this particular example: "ั" -> "ja" or "ya", the latter one is more popular in Russia, take https://en.wikipedia.org/wiki/Yandex for example.
It was me who broke the compatibility, but I believe I did the right thing (standards matter more than compatibility). It was clear from tests that the compatibility is broken, but probably I should have emphasize this in the PR/commit messages and recommend you to create version 2.0 to follow semantic versioning. Sorry about that.
If you want to provide full coverage (hard, especially Latin -> Cyrillic), you probably want to add scheme
argument as it's done here: https://github.com/nalgeon/iuliia-py.
from cyrillic-transliteration.
I agree with both of you, thank you for your inputs and clarification. @rominf: no need to apologize, it's an excellent contribution.
I like the schema approach. However, I'm not sure if I'll go ahead with implementing something similar in the near feature since iuliia already offers that elegant alternative for Russian transliteration.
from cyrillic-transliteration.
Related Issues (14)
- File and command line? HOT 11
- Cyrillic letter ะช transliterate it to #. HOT 2
- Latin letter "q" is not transliterated to Russian HOT 2
- Why not combine all *_CYR_TO_LAT_DICTs? HOT 2
- Reference of mappings HOT 2
- Handle vowels with accent marks HOT 1
- Greek support HOT 1
- H is not getting transliterated to russian HOT 3
- Ukrainian support: doesn't process ะ (yi) letter HOT 4
- Support casing variations for 2 letters in latin from Mongolian HOT 2
- Non existing examples HOT 1
- ImportError: No module named 'mapping' HOT 9
- Python 3: 'str' object has no attribute 'decode' HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from cyrillic-transliteration.