Giter VIP home page Giter VIP logo

mscproject's Introduction

MScProject

"An examination of genomic resistance and virulence of Escherichia marmotae identified in deer faeces Portumna, Galway."

Scripts

This repository holds scripts used for a masters project which involved the assembly of 11 strains of Escherichia coli and Escherichia marmotae, and the investigation of their virulence and resistance gene content.

The scripts in this repository were used for the following:

  1. assembly_pipeline.sh - This bash script was used to trim raw paired-end reads in .fastq.gz files and assemble trimmed reads using the shovill pipeline. It also runs prokka for annotation of the assembly, quast for assemby quality statistics and ABRicate and AMRFinder for the identification of resistance, virulence and plasmid genes from assemblies.

  2. Project_Heatmaps.Rmd This is an R markdown document containing the Copy_number_matrix() function used in R to format ABRicate and AMRFinder outputs into copy number matrices. It also contains the commands where these matrices were used as input for the pheatmap() function to create the heatmaps available as supplementary figures in the manuscript. It requires names_total.csv and heatmaps_metadata.csv to correctly name strains in the matrices and provide metadata for heatmaps.

  3. Plasmid_contig_genes.Rmd This is an R markdown document containing the Plasmid_contents() function used in R to find virulence or resistance genes on the same sequence as a plasmid gene using the ABRicate PlasmidFinder data and another ABRicate or AMRFinder dataset. It also contains the commands used to get the output that was used for supplememtary table 2 of the manuscript. It requires names_total.csv to correctly name strains.

  4. database_amr.sh was a bash script used to run abricate and AMRFinder for .gff, .fna and .faa files associated with genomes downloaded on NCBI, and database_prokka.sh was a bash script used to re-annotate the E. marmotae genomes downloaded from NCBI to get the input .gff files used for Roary.

Commands

Outside of these scripts, the following commands were used to run other software:

  1. Running Roary: singularity run ../scripts/roary.img roary -p 8 roary/*gff

  2. Running Parsnp: singularity run ../scripts/parsnp.img parsnp -r ../phylogeny/assembled/HT073016-2_annotated.fna -d ../phylogeny/assembled/

mscproject's People

Contributors

amyobro avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.