Giter VIP home page Giter VIP logo

biom-io's Introduction

biom-io

In order to run large datasets (e.g. Global soil), increase heap space export NODE_OPTIONS="--max-old-space-size=6144" # Increase to 6 GB

Test result, Global soil:

Reading Global Soil dataset to Biom
Taxa: 722682 samples: 3200
toBiom: 11:55.952 (m:ss.mmm)
addReadCounts: 1:00.674 (m:ss.mmm)

biom-io's People

Contributors

thomasstjerne avatar

Stargazers

Daniel Swan avatar

Watchers

 avatar

Forkers

gbif

biom-io's Issues

Services for data review

Setup webservices that sends data from the hdf5 file to the client for geojson, taxonomic similarity between samples, taxonomy burst charts etc

Integrate to seq id backend for assigning taxonomy

If a user inputs data based on ITS,16s or COI markers, it should be optional to assign kingdom, phylum, order,family, genus, scientificName from the seq id webservice.

We will need a set of cutoff rules for species level (OTU), genus, family etc
i.e. if a match is 97% can we infer the genus and use that as a scientificName ?
Could be sth like:

  • > 99% - its a match to the OTU
  • 95 - 99% we snap to the genus
  • 90 - 95% we snap to the family

Can we use one ruleset for all databases avilable from the seq id backend? Or do we need a ruleset for each database?

Wipe files when processing is started

Use case:

  1. User have uploaded, mapped and processed data
  2. During the review the user finds something is not right
  3. User goes back and changes something (replace a file, edit the mapping...)
  4. User clicks the process button again

In this scenario, the the BIOM files, DWC files and the Processing report must be wiped before the new processing starts

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.