Giter VIP home page Giter VIP logo

Comments (5)

gojomo avatar gojomo commented on August 27, 2024

I also couldn't find any equivalent to the 'Subword Information' paper's P in fastText.

But rather than modifying computeNgrams source, it should be enough to just provide an extreme -minn or -maxn parameter.

A minn larger than maxn should mean line

ngrams.push_back(nwords_ + h);
is never reached, so no word ever decomposed into char n-grams. That's make the fastText code work equivalently to classic word-only word2vec. That is, it looks like running fastText with -maxn 0 would be sufficient to disable char n-grams.

from fasttext.

Developerayo avatar Developerayo commented on August 27, 2024

Am I allowed to give a comment
On Aug 8, 2016 12:01 AM, "Gordon Mohr" [email protected] wrote:

I also couldn't find any equivalent to the 'Subword Information' paper's P
in fastText.

But rather than modifying computeNgrams source, it should be enough to
just provide an extreme -minn or -maxn parameter.

A minn larger than 'maxn' should mean line https://github.com/
facebookresearch/fastText/blob/55ba6b505004539d3621d8eb5a10d5
259faa4bb2/src/dictionary.cc#L128 is never reached, so no word ever
decomposed into char n-grams, so the fastText code equivalent to classic
word-only word2vec. So running fastText with -maxn 0 seems sufficient to
disable char n-grams.


You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
#16 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/ATll0HRFVKycnJQYRPBkhM8A35lmynUhks5qdmPKgaJpZM4JelKy
.

from fasttext.

piotr-bojanowski avatar piotr-bojanowski commented on August 27, 2024

Hi,

You are right, the parameter was removed from the final implementation. Given that the code is much more efficient not than then, this parameter was not critical. We may consider re-implementing it later if useful applications are found.

Best,

Piotr

from fasttext.

Developerayo avatar Developerayo commented on August 27, 2024

I think it should be done now it's not about how many codes are there it is
ABOUT the site security because if anyone else finds out you have a hole in
your site it will bring hackers to test their skills and that is nither
good for the server on the site so I suggest it should be fixed latest
this week
On Aug 8, 2016 10:23 AM, "Piotr Bojanowski" [email protected]
wrote:

Hi,

You are right, the parameter was removed from the final implementation.
Given that the code is much more efficient not than then, this parameter
was not critical. We may consider re-implementing it later if useful
applications are found.

Best,

Piotr


You are receiving this because you commented.
Reply to this email directly, view it on GitHub
#16 (comment),
or mute the thread
https://github.com/notifications/unsubscribe-auth/ATll0BOjrm2kV25ruk4ksSThsT6PB1OKks5qdvWVgaJpZM4JelKy
.

from fasttext.

jayantj avatar jayantj commented on August 27, 2024

Thanks a lot for the helpful responses, @gojomo and @piotr-bojanowski

from fasttext.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.