Giter VIP home page Giter VIP logo

parallel-divsufsort's Introduction

Description

Parallel-DivSufSort is a parallel lightweight suffix array construction algorithm for byte alphabets written in C++. It is a implementation based on:

A detailed description and benchmarks of the algorithm can be found in the following work.

Julian Labeit, Julian Shun, and Guy E. Blelloch. Parallel Lightweight Wavelet Tree, Suffix Array and FM-Index Construction. DCC 2015.

Installation

The following steps have been tested on Ubuntu 14.04 with gcc 5.3.0 and cmake 2.8.12.

git clone https://github.com/jlabeit/parallel-divsufsort.git
cd parallel-divsufsort
mkdir build
cd build
cmake ..
make
make install

Note that in the default version the cilkplus implementation by gcc is used for parallelization. To change this setting edit parallelization settings in the CMakeLists.txt file.

Getting Started

An example application can be found in examples/main.cpp. The library provides two basic functions to build the suffix array over a text.

// 32 bit version.
uint8_t divsufsort(const uint8_t *T, int32_t *SA, int32_t n);
// 64 bit version.
uint8_t divsufsort(const uint8_t *T, int64_t *SA, int64_t n);

To use the library include the header divsufsort.h, link against the library divsufsort and libprange.

Benchmarks

See Benchmarks page for details.

parallel-divsufsort's People

Contributors

jlabeit avatar kloetzl avatar y-256 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

parallel-divsufsort's Issues

Visual C++ build?

Greetings,

I am interested in using this for a research project for which I build suffix arrays and inverse suffix arrays. I have so far been using the original divsufsort and although it seems to have some openmp capability built in, I don't seem to be able to get different performance turning openmp on and off (i.e. both compiler options give the same performance result).

My project is already built in VC++ and I was wondering if I could use your parallel version of divsufsort to improve my results. Do you have any notes or pointers for building your code in VC++ (2013 and above)? Would it be enough to swap some of the original divsufsort code with your version to get better results?

Sincerely,
Faizal

Segmentation fault

Hi,

I found another case, now with a larger instance ( ~50MB), which gives a segmentation fault.
The steps to reproduce it:

Let me know if you are able to reproduce the problem or if you need additional details,
Best,
Daniel

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.