View Code? Open in Web Editor
NEW
This project forked from klingtnet/dh-project-ws14
project for the digital humanities course at university of leipzig in ws 2014
License: MIT License
dh-project-ws14's Introduction
project for the digital humanities course at university of leipzig in ws 2014
- requires
python3
and pip
- @github [private]
- on wikipedia
- function word
- only meaning in combination with other words or phrases
- seperate part of speech
- position in text is relevant, before a noun and after an adjective for example
- typically words that encode grammatical categories, such as negation (, mood, tense, or case not in greek!)
- sentences are everything between
.!?
(note that in greek the punctuation is different):
- period:
.
- comma:
,
- question mark:
;
- semicolon:
·
,
commas will be ignored (temporarily)
- authorship attribution from stylistic analysis based on particle distribution
- frequency
- position in text
- relative position to other words
- one word before and one word after
- relative position to the beginning of the sentence, or subsentence
,
- 5 (or 6) semantical categories of particles
- make it possible to search for particles that are in a specific semantical category
- maybe: development of particle usage over time and in different genres
- understanding the meaning of abbreviations from the part of speech
- getting the list of particles and their variations
- accent
- combinations with other particles
- combinations with other words
- run tests with
py.test
, make sure that you have installed all the dependencies with pip install -r /path/to/requirements.txt
dh-project-ws14's People
Contributors
Watchers