Eldamo To Anki

Takes the marvellous wordlist from eldamo.org and converts it into input digestable by Anki.

Usage

The decks for Neo-Quenya and Neo-Sindarin based on the lists in this repository can be found on Anki (unless they get deleted because they do not receive enough downloads).

Some lists can be found in the output folder of this repository. They are ready to be imported. They do not include any names or phrases. The Neo-Quenya and Neo-Sindarin lists do not include deprecated words.

The lists are:

Adunaic (ca. 180 cards)
Black Speech (ca. 40 cards)
Early Noldorin (ca. 700 cards)
Early Quenya (ca. 3100 cards)
Gnomish (ca. 2700 cards)
Khuzdul (ca. 40 cards)
Middle Quenya (ca. 1700 cards)
Noldorin (ca. 1300 cards)
Primitive Elvish (ca. 800 cards)
Neo-Primitive Elvish (ca. 800 cards)
Quenya (ca. 2200 cards)
Neo-Quenya (ca. 5300 cards)
Sindarin (ca. 1200 cards)
Neo-Sindarin (ca. 3100 cards)
Telerin (ca. 200 words)

Thanks to the very structured input data curated by Paul Strack, it is extremely easy to add more languages to that list. Just drop me an issue and I'll do that for you.

If you want to curate your own version of a list you can use the generate.py script to do that. It is called from the command line via:

python3 generate.py <language>

Depending on your Python install, the first command may be py or python instead.

For the <language> argument, type the name of the language, or its id (usually its first letter).

You can add optional arguments:

--neo: Assemble Neo-Eldarin lists, drawing from words invented by Tolkien from the 1930s onwards, as well as fan-invented words.
--individual-names: Include names of individuals and places.
--collective-names: Include names for collective people.
--proper-names: Include proper names.
--phrases: Include phrases.
--include-origin: Include the linguistic origin of the word in the card.
--include-deprecated: Include words that Paul Strack has marked as deprecated in neo lists.
--check-for-updates: Forces a re-download of the Eldamo database.
--verbose: Print more output.

You can check out the generate_all.sh script for example usages.

Neo-Quenya draws from words from (Late) Quenya, Middle Quenya, and fan inventions.

Neo-Sindarin draws from words from Sindarin, Noldorin, and fan inventions.

Design Decisions

In the simplest case, the generated words are given without any further adjustments for the Tolkienian language. The English translation lists the part of speech:

corto|circle (n)

costaima|debatable (adj)

If a word is listed with a dedicated word stem, that stem is appended in parentheses:

oron (oront-)|mountain (n)

Some words have several translation. The script tries to make the Tolkienian side unique, first by checking if the part of speech will do that:

cuiva (adj)|awake (adj)

cuiva (n)|animal (n)

If this does not suffice to make the words unique, and if a category is provided for that word, then the latter is used instead:

au (Mind and Thought)|if only (adv)

au (Spatial Relations)|away, off, not here (of position) (adv)

Finally, if this doesn't help as well, the translations are merged into one:

hyarna|compact, compressed; southern (adj)

This last step is also true for English words with several Tolkienian translations:

artatúrë; ohérë|government (n)

Some Tolkienian words are listed with variant versions. The script recognises this and treats them as a single word, so the inputs lá and (a)lá are listed as one:

(a)lá|yes (interj)

Some English translations are prepended with the marker * or ?, denoting some uncertainty. These markers are retained, unless the word is listed more than once, and at least one translation does not have this marker:

canya-|?to command (vb)

Several words are provided with additional information on the spelling in Tengwar, if it deviates from the default. This information is appended in brackets:

isilmë [þ]|moonlight (n)

nairë [ñ-]|space (as a physical dimension) (n)

Special Treatment for Quenya

The list also contains some archaïc words which still incorporate the old spelling. To reduce duplicated information, the script recognises these and derives the Tengwar annotations. Since this treatment needs to happen on a per language and per sound basis, it is currently implemented only for my personal use-case (Neo-)Quenya. The relevant linguistic information is taken from the Eldamo Quenya course.

Any þ is replaced with s, so minaþurië becomes:

minasurië [þ]|enquiry (n)

Initial ñ- is replaced with n-, turning ñwalmë into:

nwalmë [ñ-]|torment (n)

The rules for w are more complicated. Any w following a consonant or the diphthongs ai or oi is retained, any other w is replaced with v.

Because the archaïc w-origin of v is not represented in Tengwar, it is also not included in the output:

artanwa|award (n)

maiwë|gull (n)

oiwa|glossy (adj)

lassevinta|leaf fall, autumn, *(lit.) leaf blowing (n)

vilya|air, sky (n)

The latin transcription of Quenya words changed throughout Tolkiens life. The script makes several replacements to normalise the spelling:

kw and standalone q become qu.
ks becomes x.
k in other positions becomes c.
the non-diphthong vowel combinations are spelled ëa, ëo, ië and öa.
a trailing e becomes ë.

Acknowledgments

Almost all the credit here goes to Paul Strack, maintainer of the Eldamo website and database. They gathered all canonical Tolkienian words in one place, collected thousands of fan-made extensions, and organise it all in the structured xml format. Finding this database made writing this script pure bliss.

thecomamba / eldamotoanki Goto Github PK

eldamotoanki's Introduction

Eldamo To Anki

Usage

Design Decisions

Special Treatment for Quenya

Acknowledgments

eldamotoanki's People

Contributors

Watchers

eldamotoanki's Issues

Recommend Projects

Recommend Topics

Recommend Org