Comments (8)
@ftyers we can use secondary tags to propagate the case till the post generator and then apply it there if needed.
from apertium.
At what stage exactly and on the basis of which information? I'm thinking about how dealing with the difference in French nouns like "allemand" (the language) and "Allemand" (a person). Currently, I do this in transfer.
from apertium.
This is related: #75
from apertium.
@hectoralos I would do it in posttransfer using the LU and perhaps a 1-2 word context window.
from apertium.
@ftyers basically only using dictionary case and "is this a sentence end"-context and ignoring input case? We'd lose the ability to keep UPPER CASE and Titles with Titlecase but maybe that's worth the code simplification …
from apertium.
lt-proc
could record the original capitalization and put that in word-bound blanks which could then be used to determine that.
from apertium.
@mr-martian lt-proc
outputs the original word form anyway, so a separate step can do the job. I actually have a branch of nno-nob that just adds tags aa/Aa/AA
that way to all words (capstag.rlx
runs after morph ana/dis), removed again in transfer. I'm considering switching to this system so we can get dictionary-based correction but keep input caps (for start of sentence or where there are several upper-cased words in a row), but have to make sure it doesn't lead to regressions first.
from apertium.
Processor added in 7e7004d
from apertium.
Related Issues (20)
- Duplication of subreadings in apertium-tagger HOT 2
- Python module undefined symbol
- apertium-preprocess-transfer: symbol lookup error: /usr/lib/x86_64-linux-gnu/libapertium3-3.8.so.1: undefined symbol: _ZN8Alphabet5writeEP8_IO_FILE HOT 1
- apertium-preprocess-transfer (and -validate-) don't check for existence of def-attr's
- Perceptron tagger broken on latest Apertium (3.8.1) HOT 7
- Missing #include directive in stream.cc causes compilation to fail
- apertium-transfer segfault HOT 2
- Bug in docx output? HOT 3
- Perceptron tagger error with lemmas containing "+"
- Transfer inserting extra carriage returns on windows HOT 4
- apertium -f line doesn't work line by line
- "assignment to 'sl' side" *has* effect
- Several variety flags in for 'v' using apertium-filter-rules
- Several (language) parameters in apertium-filter-rules
- Matchings with lookahead in transfer rules HOT 9
- Problem with HTML deformatter entities and UTF-8 HOT 1
- Capitalization restoration does not remove internal marks HOT 1
- apertium-pretransfer -n fails with escaped lemma `\/`
- Conversion to and from the universal tagset HOT 8
- Possibility of showing relevant preferences in text
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from apertium.