Giter VIP home page Giter VIP logo

rntn-spark's Introduction

rntn-spark

Description

Repository for MTA final Msc. project: Distributed RNTN.
The purpose of this project is to implement the Recurssive Neural Tensor Network (RNTN) for sentiment analysis as described in the paper by R. Socher in a distributed manner using Apache Spark.
We are following the Downpour paradigm described by Jeffrey Dean from google and implemented by Dirk Neumann's DeepDist project.

Please bare in mind: This is a work in progress! This is, by no means, a download-and-run project.

pre-requites and setup instructions

  1. RNTN

  2. Download/clone the forked semantic-rntn project to every node on your cluster. This is based on the original semantic-rntn project. The only difference is that I have taken the existing project and turned it into a module, thus enabling it to be installed and managed on all nodes of the cluster.

  3. Install by running:
    python setup.py install

  4. DeepDist

  5. At the moment, some updates are needed in order to run RNTN using DeepDist. Those updates are available from my forked Deepdist project. Until my pull requests are approved, Download/clone the forked DeepDist project to every node on your cluster.

  6. Install by running:
    python setup.py install

  7. Spark

  8. Follow the instructions on Downloading and installing Spark from the documentation. Make sure you know the paths to pyspark and py4j.

  9. rntn-spark

  10. Download/clone the rntn-spark project (this).

  11. In the configuration file: update the paths to Spark's python and py4j paths and set the app name.

  12. Update the sparkrunner.sh script with your master address and port.

  13. Run:
    ```sh sparkrunner.sh``

Support

Please use github's issues to report troubles.

rntn-spark's People

Contributors

urirosenberg avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar

Forkers

iron-fe uripab

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.