Comments (1)
-
One possible point of confusion here is that the
DATA
directories are actually parsing models (the numbers are counts, probabilities, etc.) not the actual training data (treebanks). For the includedDATA
directories, the actual training data is the Penn Treebank (EN
andLM
) and Chinese Treebank (CH
). If you have these treebanks, you can add other treebanks to them and then train a combined model. -
The training script (
trainParser
) helps construct the parsing model directories (converts the real training trees to the various files inside the model directory). See the READMEs in thefirst-stage
andTRAIN
directories for more information. See my answer in #27 for where you can download or license some treebanks.
Hope this helps -- please let me know if I can clarify anything.
from bllip-parser.
Related Issues (20)
- Facing WindowsError: [Error 32] error while running sd.sd.convert_tree HOT 4
- Can't pickle RerankingParser: attribute lookup RerankerModel on importlib._bootstrap failed
- MacOS: Compiled successfully, but module is missing HOT 3
- Biomedical named entities being treated as Cardinals HOT 2
- Will not compile on windows HOT 1
- Head node not a direct child HOT 2
- bllip parser in Server HOT 3
- Handling quotes HOT 4
- python import failing with undefined symbol HOT 2
- Compilation in OSX HOT 1
- Error downloading model (501 Not Implemented) HOT 1
- Error downloading model (101 Network is unreachable) HOT 4
- slow results in docker HOT 2
- Segmentation fault in `InputTree::printproper` when using a comprehension or loop to collect the heads of parse trees HOT 1
- Compilation errors on windows: /usr/bin/sh: -c: line 0: syntax error near unexpected token `(' HOT 2
- pip install errors on linux (requires installing gcc & gxx) HOT 1
- Python3 bindings (pip) not working HOT 3
- Compilation error when installing from pip HOT 2
- Session crashes when loading RerankingParser.from_unified_model_dir() HOT 1
- AttributeError: type object 'CharniakParser' has no attribute 'loadModel' HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from bllip-parser.