Giter VIP home page Giter VIP logo

spark-bwt's Introduction

SparkBWT

About the project

SparkBWT is a tool for calculating the Burrows-Wheeler transform (BWT) on Apache Spark Framework.

Build with

The application has been developed using maven, the main languages are java and scala. To improve the performance it has been used C/C++ languages too, integrated through JNI. In the development of the application was used the framework Apache Spark.

Structure

The source code is the src folder. Inside there are the main folder that contains the code for the application and the test folder that contains the code for testing the classes in main folder.

In src/main we can find:

  • java contains the JNI glue code and the code for CLI.
  • native contains the native code, that is the c++ procedure for sorting based on Radix-Sort algorithm
  • scala contains the implementation of the algorithm in Apache Spark..

Getting started

Prerequisites

The building of the project can be made automatically with maven, but this requires that the following tools are installed in the system:

  • make
  • g++

For building in Windows environment you have to use MinGW and CMake.

Build

To build the project from command line:

git clone https://github.com/MR6996/spark-bwt
cd spark-bwt
mvn package -P [profile]

The profiles are window and linux depending on your operating system.

In the created /target folder, we can find the jar file needed to run the application (Should be named as spark-bwt.jar).

Usage

The tool can be launched using the tool provided by default by Apache Spark spark-submit. Can be used a YARN cluster and can be used the option parameters for configuration. A typical usage is:

spark-submit [options] spark-bwt.jar <filename>

for help:

spark-submit spark-bwt.jar -h

License

The project is distributed under GPL v.3 License More info

References

[1] Mario Randazzo, Simona E. Rombo A Big Data Approach for Sequences Indexing on the Cloud via Burrows Wheeler Transform

spark-bwt's People

Contributors

mr6996 avatar

Stargazers

 avatar  avatar  avatar  avatar

Forkers

varuy322

spark-bwt's Issues

Error during building

IMG_20240422_171300.jpg

When using Java 8 this is what I'm getting during the building stage. Doing this on WSL.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.