Giter VIP home page Giter VIP logo

neural-semantic-relatedness's Introduction

neural-semantic-relatedness

Analysis on Semantic Relatedness with Attentive Tree LSTM over BiLSTM

In this project we explored the state-of-the-art BiLSTM model, where in we understood what information it captures and what kind of applications it can be employed in. We also explored the current TreeLSTM models and its applications. We came up with our own model which combines BiLSTM with TreeLSTM model to create a new hierarchical model that gives emphasis on certain words using the attention based mechanism in the TreeLSTM. Our model was able to improve upon the current TreeLSTM and BiLSTM accuracies by 1.02% in the task of semantic relatedness using both attention and Bi-LSTM. No significant improvement was observed in the task of contextual similarity using Quora dataset with our approach. Observing the results it can be seen that the Bi-LSTM trained on outputs from attentive tree LSTM outperforms or is par with all the other models.

Usage

Running baselineBiLSTM

python3 baselineBiLSTM.py

Running baselineTreeLSTM

sh fetch_and_preprocess.sh

""" Downloads the following:

  • Stanford parser
  • Stanford POS tagger
  • Glove vectors
  • SICK dataset (semantic relatedness task) """

python3 main.py

Running treelstm_similarity

SICK dataset

sh fetch_and_preprocess.sh

""" Downloads the following:

  • Stanford parser
  • Stanford POS tagger
  • Glove vectors
  • SICK dataset (semantic relatedness task) """ python3 main.py

Quora dataset

quora dataset is already in the repository

sh quora_preprocess.sh python3 quora_main.py

References:

  1. Code for baseline BiLSTM has been referenced from https://github.com/eliorc/Medium/blob/master/MaLSTM.ipynb and then modified according to the need.
  2. Code for baseline Tree LSTM has been referenced from https://github.com/dasguptar/treelstm.pytorch

License

Apache

neural-semantic-relatedness's People

Watchers

James Cloos avatar Shubham Jindal avatar

Forkers

futong

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.