Giter VIP home page Giter VIP logo

project1_ir's Introduction

Information Retrieval Project 1 Group 11

Michael Merki, Julien Romero, Markus Greiner

November 14, 2016

Installation

Provided is the zip file project1_group11.zip. Unzip it:

$ unzip project1_group11.zip

This creates the directory project1_group11, which contains:

  • the sources under src
  • tinyir.jar as part of lib
  • build.sbt
  • the directory labelingtestdocs which contains the three resulting test results.
  • and this README.md file.

Note: We had continuing conflicts between the tinyir and the breeze libraries. This is why we decided to build tinyir for Scala 2.11.5 and provide it as jar.

Running

To run:

$ cd project1
$ sbt "run-main Main <path-to-data-folder>"

The <path-to-data-folder> must contain the directories train, test, and validation.

It is also possible to give options to influence the iterations and learning rates of the linear regression classifier. These options are called:

  • ITERATION=<nof-iterations-integer>
  • LEARNING=<learning-rate-double>
  • SKIP=(BAYES|LINREG|SVM)+

It can be run like this:

$ sbt "run-main <path-to-data-folder> ITERATION=10000 LEARNING=0.001 SKIP=BAYES,SVM"

Results

Upon running the program, it will

  • Naive Bayes Classifier: train and generate list of tested documents and their codes
  • Logistic Regression Classifier: train and generate the list
  • SVM: train and generate the list.

The result files are called ir-project-2016-1-11-[nb|lr|lsvm].txt and they are located under labelingtestdocs

The project report is under ir-2016-1-report-11.pdf

project1_ir's People

Contributors

aunsiels avatar mmerki avatar mmgreiner avatar

Watchers

 avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.