Comments (9)
What do you exactly mean by "meaningless"? can you quote a couple here? on which data did you train your model?
from dl4mt-tutorial.
I trained the europarl as is in the session 1 script files. Meaningless in the sense that almost no sentence has been translated correctly, a bunch of UNK too.
below first 10 lines of each newstest2011.en.tok and newstest2011.trans.fr.tok
What will they do ? CSSD lacks knowledge of both Voldemort and candy bars in Prague
New Councilors of CSSD will most probably have to overcome certain language barriers to understand their old-new colleagues from ODS in Prague Council and municipal council .
Aktuálně.cz " tested " the Social Democrat members of the new Council in terms of the well-established slang that originated in the town hall during the few last years , when Prague was ruled by the current coalition partners .
Coded vocabulary that was established by Prague political elite during the previous era of the mayor Pavel Bem , describes some of the most famous persons , situations and affairs in the city .
Surprisingly , it turned out that the new council members do not understand the well-known concepts .
At least they say so .
" Who is Voldemort ? "
" I really do not know . "
" I 'm rather a novice in Prague politics , " responded Lukas Kaucky , the Councilor for culture , to the test of " Godfather " vocabulary .
And even though he is a political veteran , the Councilor Karel Brezina responded similarly .
Qu' est-ce qu' ils UNK ?
Peut-être membres .
UNK " .
La plupart des affaires étrangères .
comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que les membres ont pas comprendre que
moins .
UNK " ?
"
Selon moi , je cite , je considère que la culture " UNK .
Et pourtant une réponse .
from dl4mt-tutorial.
re ran it with GPU GTX 980 ti .... ah ah amazingly fast.
but still stopping after epoch 2 updates 140000 and bad translations.....
what is wrong ?
from dl4mt-tutorial.
@vince62s, from my experience, it is not very beneficial to early stop according to validation set log probabilities. I may suggest using BLEU for that, but anyways, may be you can increase patience
to let it train a little bit more.
Also, session1 is just a warm-up for session2, is there a particular reason that you're not using session2 for nmt?
from dl4mt-tutorial.
OK I will run session2.
now you trigger my curiosity.
How do I stop learning based on BLEU, is this embedded already in the code ?
if not and using "patience" what kind of good value do we need to use ?
I see 10 in one place and 1000 in another place. What does this mean ?
from dl4mt-tutorial.
Early stopping based on BLEU is straightforward if you do it in the training loop, which means that you effectively pause the training, and call translate.py
, then call a script (or a function) to compute BLEU given translation and reference files. Keep track of the BLEU scores as it is already done for validation log-probabilities, and finally early stop according to patience.
If you don't want to pause the training, since translating a validation set takes time, you can save your model parameters without overwriting each other (giving different names for each params.npz
). Then another process can be employed to compute BLEU scores using the saved model parameters.
Actually adding this feature makes the code complicated and this codebase provides a starter/reference implementation of NMT. If you want to have it in your fork, you can check these two reference implementations, in blocks and groundhog
This is also what @amirj asked in #33 .
from dl4mt-tutorial.
thanks.
otherwise regarding the "patience" setting, what 10 or 1000 relates to ?
from dl4mt-tutorial.
@orhanf can we make the code so that we can optionally save all the intermediate models to be evaluated later manually for early-stopping based on BLEU?
from dl4mt-tutorial.
@kyunghyuncho i've added an option overwrite
to save the model parameters according to the iteration number without overwriting each other, also synched session 2 and 3 fixing #29
from dl4mt-tutorial.
Related Issues (20)
- Why the grads have to be shared? HOT 2
- Cost is Nan after one epoch if maxlen > 50 HOT 3
- Where is dataset='/ichec/work/dl4mt_data/nec_files/wiki.tok.txt.gz'? HOT 2
- discrepancy between paper and code HOT 1
- NaN detected HOT 1
- L2 regularization on bias terms?
- Unnecessary bias term? HOT 1
- Asymmetry in read gate application
- dim == dim_nonlin and nin == nin_nonlin must be always true?
- How to build the dataset 'all.en.concat.gz.pkl' in session2/train_nmt_all.py? HOT 1
- Random Translations? HOT 5
- ValueError: unsupported pickle protocol: 3 HOT 2
- why convert the value of matrix to the type with astype('float32')? HOT 2
- a detailed description about param_init_gru and gru_layer HOT 2
- the value of TensorType(float32, 3D) HOT 2
- compute word probabilities HOT 2
- Maybe it's time to upgrade to Python 3 and ditch Python 2 support? HOT 1
- By condition what does it mean in this tutorial?
- Can anyone provide GPU version of translate.py? HOT 1
- TypeError
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dl4mt-tutorial.