Giter VIP home page Giter VIP logo

Comments (5)

d4straub avatar d4straub commented on June 21, 2024

Thanks!
The idea here could be to add a parameter, e.g. --contamination_controls "sample1,sample2", and all sequences that appear in that control samples are removed from the ASV table (including the control samples itself).
More advanced for such a task (using control samples) might be decontam which is also in bioconda.

from ampliseq.

erikrikarddaniel avatar erikrikarddaniel commented on June 21, 2024

I would absolutely recommend Decontam. We have seen in actual projects that raw removal of ASVs found in negative controls risks both to remove true ASVs found in samples and miss contaminants. This is, of course, taking Decontam as the truth, but the results have looked intuitively good.

There are at least two ways of running Decontam, and I think it would be wise to allow both.

from ampliseq.

d4straub avatar d4straub commented on June 21, 2024

Alright, thanks, then it will be not worth the effort to implement the simple method above but rather immediately a proper one such as Decontam.

from ampliseq.

skose82 avatar skose82 commented on June 21, 2024

Hi all,

I wouldn't advise decontam until everything is known about how it removes an asv - exactly. We still need a clean feature which will simply remove anything in the control samples as a first pass for comparison with a second pass without removal. This is what we did before ampliseq and what most microbiologists do with every project - scan the controls and remove what they see as a legitimate contamination. To do this removal is time intensive and tedious and then you have to replot. It would be truly worthwhile to have this feature as an option, then we can look at the output and decide if it's worth using decontam instead or not. It certainly should be an option as it currently is not an option in decontam!

from ampliseq.

d4straub avatar d4straub commented on June 21, 2024

Hm thats a rather emotional plea for a simple method. I do think that the decontam documentation is not too ambiguous. Decontam implements a method that is using control samples, see here, I am not sure what your exact criticism is?
Manual manipulation is however the worst way of data processing in my opinion, it would be in any case better to automatize, i.e. standardize and make reproducible. I can live with having optional filters available. If you or someone else wants to implements that simple method because you feel its a method with future, I will not stand in your way (I cannot speak for others though).

from ampliseq.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.