Giter VIP home page Giter VIP logo

negativedatasets's Introduction

NegativeDatasets

Citation

Katarzyna Sidorczuk, Przemysław Gagat, Filip Pietluch, Jakub Kała, Dominik Rafacz, Laura Bąkała, Jadwiga Słowik, Rafał Kolenda, Stefan Rödiger, Legana C H W Fingerhut, Ira R Cooke, Paweł Mackiewicz, Michał Burdukiewicz, Benchmarks in antimicrobial peptide prediction are biased due to the selection of negative data, Briefings in Bioinformatics, 2022;, bbac343, https://doi.org/10.1093/bib/bbac343.

Getting started

This repository contains the data and code necessary to reproduce the results from the paper Benchmarks in antimicrobial peptide prediction are biased due to the selection of negative data. It uses renv and targets packages to control the workflow and assure the reproducibility.

Some of the data files are too large to store them on GitHub but they can be downloaded using the links below:

  • UniProt data - Data directory with reviewed sequences and their annotation downloaded from UniProtKB release 2020_06. These sequences and their annotations were used to create negative data sets in our study.

  • Prediction results for architectures - Results directory with prediction results for all 660 models trained and tested in our study. These files are necessary for calculation of models’ performance and generation of plots and tables from the paper.

To reproduce the results clone the repo, set your path to the directories with data files and:

renv::restore()
targets::tar_make()

Content

_targets.R - reproducible pipeline for generation of all data sets and results processing,

data - data files used during the study, e.g. for creation of the positive dataset,

drafts - draft codes used for initial exploratory analyses,

functions - all functions used for running the pipeline and obtaining results,

presentations - presentation files for this project,

renv - renv package files,

reports - reports with initial analyses,

third-party - third-party executables used in the pipeline.

Important links

Contact

If you have any questions, suggestions or comments, contact Michal Burdukiewicz.

negativedatasets's People

Contributors

agosiewska avatar dominikrafacz avatar erdaradungaztea avatar ksidorczuk avatar michbur avatar

Watchers

 avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.