Giter VIP home page Giter VIP logo

qingshenqin / wiseowl Goto Github PK

View Code? Open in Web Editor NEW

This project forked from asmehra95/wiseowl

0.0 2.0 0.0 1.02 MB

This is a Fact based Question Answering System using Apache Solr as backend search engine, Wikipedia dumps as information source, Apache velocity , Html, Css for Web interface Design. The project also uses Linux bash script to perform its various functions like start,stop,training,indexing.

License: MIT License

Shell 25.68% Batchfile 16.96% HTML 0.82% JavaScript 6.28% CSS 0.73% Java 49.53%

wiseowl's Introduction

WiseOwl

This is a Fact based Question Answering System using Apache Solr as backend search engine, Wikipedia dumps as information source, Apache velocity , Html, Css for Web interface Design. The project also uses Linux bash script to perform its various functions like start,stop,training and indexing

Question Answering

Features:

  • Fast and reliable searching using open source Apache Solr 6.3.0 and Apache Lucene 6.3.0 projects. Apache Solr is used as a search engine which uses capabilities of Apache Lucene to profide searching.
  • Custom-made Query Parser based on Apache Lucene 6.3.0 specially optimized for Question Answering.
  • Named Entity Recognition and Time normalization during indexing using StanfordCoreNLP.
  • Automatic cleaning and parsing of Wikipedia Raw text from the wikipedia dumps. It is achieved by using Lucene 6.3 benchmark classes and WikiClean Project.
  • Answer Type Classification of given question using Apache OpenNLP's Maxent Models, Models are trained on data taken from thesis by Tom Morton, tagging aroung 1800 handnpicked questions.
  • Currently the project is more optimised for Description Type Answers.
  • Sleek user interface by combining elements of css, html and Apache Velocity.
  • Bash script which uses underlying solr scripts to provide functionality of starting, stoping, indexing and training.

![Date Normalization](screenshots/date_norm .jpg)

Name Detection

Installation

Throughout this document, WiseOwl_HOME is the base directory for all the wiseowl code

Required Downloads


Throughout this document, WiseOwl_HOME is the base directory for all the wiseowl code

WiseOwl uses Maven for building and running the code. To get started, you will need:

  1. JDK 1.7+

  2. Maven 3.0 or higher installed

  3. The OpenNLP English models, available at http://opennlp.sourceforge.net/models-1.5/

    Download all the models, and place the in WiseOwl_HOME/opennlp-models place ans.bin model from the repository to WiseOwl_HOME/opennlp-models/qa

  4. Get WordNet 3.0 and place it in the WiseOwl_HOME root directory. (usually it comes as zipped file, so unzip it using gunzip or other utilities) You may find it at the given link or perform a simple google search to find it. http://wordnetcode.princeton.edu/3.0/WordNet-3.0.tar.gz

  5. Download Apache Solr 6.3.0 available at http://lucene.apache.org/solr/ copy contrib, server and dist folder from Solr 6.3.0 package to WiseOwl_HOME root directory

Building the Source


Prior to building the source, for those previously unfamiliar with Maven, it may be wise to read this to avoid future hassles: http://maven.apache.org/guides/getting-started/maven-in-five-minutes.html

To build the source, in WiseOwl_HOME:

mvn clean package

Running the System


  1. Goto the WiseOwl_HOME/bin directory and open a terminal there. type :
  • ./WiseOwl : For different options available
  • ./WiseOwl start : To start the Apache Solr Server
  • ./WiseOwl stop : To stop the server
  1. Point your web browser to http://localhost:8983/solr/WiseOwl/answer

    You will see a screen similar to a screen below.

Start Screen

Special Thanks:

We are greatly thankful to Grant Ingersoll for their open source project under the book TamingText Their work is really great! We have built on their code. We thank them so much. It was great help and learning experience for us. Their original project can be found at: https://github.com/tamingtext/book

wiseowl's People

Contributors

asmehra95 avatar sumitbanger avatar chrismattmann avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.