Dev / Test set? about 1-billion-word-language-modeling-benchmark HOT 4 CLOSED

ciprian-chelba commented on June 6, 2024

Dev / Test set?

from 1-billion-word-language-modeling-benchmark.

Comments (4)

ciprian-chelba commented on June 6, 2024

Yes, news.en.heldout-00000-of-00050 is the test set on which we report results. As a sanity check you should get the same number of predicted tokens. The rest can be used for any other purposes: e.g. validation set(s) or to estimate variance of your estimates, etc.

…

On Fri, Jun 15, 2018 at 5:30 PM Haibin Lin ***@***.***> wrote: I noticed that there're ~50 files under heldout-monolingual.tokenized.shuffled folder. Which ones of them is meant for test data? Is heldout-monolingual.tokenized.shuffled/news.en.heldout-00000-of-00050 for testing while the rest of heldout-monolingual.tokenized.shuffled/news.en.heldout-00* can be used as validation set? — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#4>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AK_u-_nax3joRTtl5jrGvX_AZMRCzUCjks5t9FGfgaJpZM4UqSWB> .

-- -Ciprian

from 1-billion-word-language-modeling-benchmark.

eric-haibin-lin commented on June 6, 2024

Thanks! For the test set, are we expected to read it in sequential order for evaluation? I see in https://github.com/tensorflow/models/tree/master/research/lm_1b it reads a random file each time, leading to slightly different test ppl each time.

from 1-billion-word-language-modeling-benchmark.

ciprian-chelba commented on June 6, 2024

The test set is only one file. Sentences are randomized across all the corpus, so the sentence order should not matter either, there should be no information across sentence boundaries.

…

-Ciprian Nexus 5 On Jun 28, 2018 08:56, "Haibin Lin" <[email protected]> wrote: Thanks! For the test set, are we expected to read it in sequential order for evaluation? I see in https://github.com/tensorflow/models/tree/master/research/lm_1b it reads a random file each time, leading to slightly different test ppl each time. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#4 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AK_u--gUO_x9nadZbWGl13MOY2LuED6Oks5uBG_2gaJpZM4UqSWB> .

from 1-billion-word-language-modeling-benchmark.

eric-haibin-lin commented on June 6, 2024

Thanks!

from 1-billion-word-language-modeling-benchmark.

Dev / Test set? about 1-billion-word-language-modeling-benchmark HOT 4 CLOSED

Comments (4)

Related Issues (7)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent