Giter VIP home page Giter VIP logo

rake's Introduction

Rapid Automatic Keyword Extraction (RAKE)

Rake

RAKE is an algorithm for extracting keywords (technically phrases, but I don't question scientific literature) from a document that have a high relevance or importance to the contents of the document. For example, the top five significant keywords in the text:

Compatibility of systems of linear constraints over the set of natural numbers. Criteria of compatibility of a system of linear Diophantine equations, strict inequations, and nonstrict inequations are considered. Upper bounds for components of a minimal set of solutions and algorithms of construction of minimal generating sets of solutions for all types of systems are given. These criteria and the corresponding algorithms for constructing a minimal supporting set of solutions can be used in solving all the considered types of systems and systems of mixed types.

are calculated to be:

Keyword Relevance
linear diophantine equations 10.666
minimal generating sets 10.333
minimal supporting set 8.833
upper bounds 6.0
natural numbers 6.0

Installing

This library isn't on a central repo yet, so nab the JAR URL from the releases page and toss it into whatever dependency manager you're using. That, or just download the JAR.

Using the API

Using the library is super straightforward. The main class only exports one public method, getKeywordsFromText(), and needs a language code in order to run. Any constant found in RakeLanguages can be used without issue. So for example:

public class Main {
    public static void main(String[] args) {
        String languageCode = RakeLanguages.ENGLISH;
        String rake = new Rake(languageCode);
        LinkedHashMap<String, Double> results = rake.getKeywordsFromText("Your text would go here."));
    	System.out.println(results);
    }
}

Contributing

If you wish to contribute to this library, you'll need Buck to build it. Once you have Buck installed, it's as simple as:

$ buck build :rake

Credit

Algorithm taken from here.

Implementation taken from here.

rake's People

Contributors

nickersoft avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.