Giter VIP home page Giter VIP logo

nltk-based-word-checking-dic-method's People

Contributors

anik101 avatar joel-hanson avatar mahikanthnagy avatar rra94 avatar samsomyajit avatar sayanmondal2098 avatar soumakofc avatar

Watchers

 avatar

nltk-based-word-checking-dic-method's Issues

Problem Statement we would like to analyze the queries that the consumers of an FMCG company type and respond accordingly with relevant information.

  1. find Various keywords that are primary concern of the consumers or seeking for information.
    These can be put under various categories such as brand, product line and sub-product line,
    usage, etc;
  2. Examples of inputs (searched phrases) that the consumers may type in
  3. Examples of formation of composite keywords (phrases) out of keywords and thus
    enhancing the list of keywords (‘shoe polish’ out of ‘shoe’ and ‘polish’)
  4. Examples of Synonyms
    We need to create an optimal set of categories (buckets) in the range of 10 to 20 (e.g. brand,
    product line, price, region, etc.) and an optimal set of sub-categories under each category (e.g.
    wellness, skin care, makeup, etc under ‘product’ category; Lux, Hamam, Liril, Lakme, Lipton,
    etc sub-categories under ‘brand’ category; cheap, expensive under ‘price’ category; etc.) out of
    the Keywords; examples of the final attributes /keywords under a sub-category such as ‘loofah’,
    ‘soap’, ‘shower gel’ etc under ‘Lux’ sub-category. And you need to put all the key words in one
    or many categories and/ or sub-categories. The keywords may belong to different categories or
    sub-categories (e.g. polish can be under makeup ‘nail polish’ and also under accessories ‘shoe
    polish’).
    You may require applying more than one supervised and / or unsupervised learning
    techniques for the same.

Check and segregation with synonyms feature requested

If a word doesn't match with the dictionary , then is produce it's synonymy and check the words with positive and negative dictionary . And then separate them into two different txt file . In the "naturalprocessingwordnet.py" file you will get necessary function for synonyms .

Add a word picker feature

we have positive word dictionary,negative word dictionary and a text file(with paragraphs). we have to write a program to tokenize key words from the text file . Then match those key words with positive and negative file. if the key word is matched with any of these file the store it in destinationposfile or destinationneg file. If the key word is not found then collect synonyms of that key word and again try to match those words with positive or negative file , if match found then store that key word in a destination file and if not found then store that key word in destinationnutral file.

Remove duplicate word from log file

With help of the "finder.py" script we can create log file . But in the files there are few duplicate words . or a same word few many time . Need help to remove the duplicates .

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.