Giter VIP home page Giter VIP logo

biodiversity-of-mudflat-intertidal-viromes-along-the-chinese-coasts's Introduction

Biodiversity of mudflat intertidal viromes along the Chinese coasts

Code availability

The R code for generating figures and performing data analysis can be found in the corresponding directory (Fig.1 to Fig.5).

The raw data for generating the figures and performing data analysis with the R code can be found in the corresponding subfolder.

Data availability

The outputs generated by this study are stored in the directory named "full_output". For more details, please download the file and locate the required file in the subfolder. The detailed description of the output files is as follows:

1. AMGs_align_output

The alignment results of viral proteins with various functional gene databases, including eggNOG (COG function), NcycDB (nitrogen metabolism), McycDB (methane metabolism), PcycDB (phosphorus metabolism), and ScycDB (sulphur metabolism), are available. In addition, the normalized abundances of viral protein clusters (vPCs) and virus-encoded auxiliary metabolic genes (vAMGs) can also be found in this subfolder.

2. amoC-pmoC_output

This subfolder includes the NCycDB alignment outputs used for differentiating amoC and pmoC genes, as well as the phylogenetic tree of amoC and pmoC genes constructed using the maximum likelihood method by IQ-TREE. The raw protein sequences used for inferring phylogeny and the tree visualized by iToL are also placed in this subfolder.

3. checkv_output

The full outputs of checkv v1.0.1 ('end_to_end' mode), including the completeness, contamination, and quality summary of intertidal viruses (20,102 vOTUs). All viral genomes used in this study have been evaluated for contamination and removed accordingly using checkv v1.0.1.

4. genomad_output

The full outputs of genomad v1.7.4 (score > 0.7), including the taxonomic assignment and marker gene annotation of intertidal viruses.

5. iphop_output

The full outputs of iphop v1.3.2 (false ratio < 10%), including the host prediction of intertidal viruses.

6. kegg-decoder_output

This subfolder includes the metabolic pathway annotations of microbial operational taxonomic units (mOTUs) belonging to Deltaproteobacteria, Thermodesulfobacteria, and Thaumarchaeota conducted by KEGG-Decoder module. Of these, the list files indicate the completeness of each metabolic pathway involved for each mOTU.

7. viralRefseq_align_output

The identification of viral-like genes within intertidal viruses based on the viral RefSeq.

8. virsorter2_viralrecall_output

This subfolder includes the virsorter2 ('NCLDV' mode) and viralrecall outputs of nucleocytoplasmic large DNA viruses (NCLDVs) and virsorter2 ('lavidaviridae' mode) output of virophages identified in this study.

Note

vOTU, mOTU, vPC sequences information can be obtained from https://zenodo.org/records/10827260.

For any other code/data inquiries, please open a github issue or contact me: [email protected].

biodiversity-of-mudflat-intertidal-viromes-along-the-chinese-coasts's People

Contributors

mengzhij avatar

Stargazers

Liu zifeng avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.