Scripts for sequence processing and R analysis 16s amplicon sequencing of fecal samples from 150 cows on 15 different farms across california commerical diaries.
- For Submission to NCBI is a file that describes the demultiplexing of raw sequences and prepping them for NCBI submission.
- R_session_info is the output of R session to show all packages used and their version.
- Scripts_for_processing contains filtering steps (phyloseq), beta & alpha diversity and differential abundance testing (DESeq2). Figures 1-5 and Supplemental tables 1-3.
- Sequence coverage file are the scripts to describe the coverage of sequencing for this data set.
- Controls file are the scripts to show taxa in control samples.
- MaSaLin Analysis is the R scripts used to run and generate output from MaSaLin (Fig 6 & 7, Supplemental Table 4).
- Pathogen_Tree are qiime and other bash scripts used to make a reference tree of full lenght 16S rRNA sequences from RDP database.
- Seq_info.csv contains the sequence names and taxa names from RDP database used to make the references tree of 16S sequences for pathogens.
- Full_Tree.xml is the tree of near full length 16S rRNA sequences of pathogens of interest with reads from data set placed on the tree.
Paper is found here