Giter VIP home page Giter VIP logo

rp1_aluinsertion's Introduction

RP1_aluinsertion

The linux program grep to search FASTQ files for the 5' junction of between the reference sequence of exon 4 and the beginning of the Alu insertion in RP1 gene.

Most files without the insertion return a count of "0" though rarely a false-positive read count of 1 or 2 detected in minority of wildtype samples depending on the depth of coverage and the method of sequencing such as targeted, whole exome, or whole genome.

In the datafolder to execute this command to grep Alu-insertion in exon 4 of RP1

This RP1 Alu insertion with other truncated variant has been known to cause autosomal recessive macular dystrophy with retinitis pigmentosa.

This Alu insertion in RP1 gene seems to be frequently observed in East Asian (Korean, Japanese, ...), probably founder mutation.

Prerequisites (Ubuntu)

GNU parallels and python is required.

The name of fastq.gz files should be ${Sample}_R1.fast.gz and ${Sample}_R2.fastq.gz

sudo apt install moreutils
sudo apt install parallel
sudo apt-get install agrep

Install

git clone https://github.com/jin0008/RP1_aluinsertion.git

Set your PATH permanently

export PATH="$PATH:/where/to/RP1_aluinsertion"

To run

Run this bash script in the directory with fastq.gz files.

This code runs with multiple samples.

bash RP1alu.sh

Interpretation of results (See. sample.RP1_Alu.txt files)

VAF < 0.1 : No AluY insertion was found in exon 4 of RP1 at 8:55540494 position (hg19)

0.1 <= VAF < 0.3 : AluY insertion was suspected in exon 4 of RP1 at 8:55540494 position (hg19). Please recheck by visualizing AluY sequence at 8:55540494 position (hg19).

VAF >= 0.3 : AluY insertion was detected in exon 4 of RP1 at 8:55540494 position (hg19).

alt text

Or you can directly check this common Alu insertion in Integative Genomics Viewer (Broad Institute)

hg19 position: chr8:55540494

hg38 position: chr8:54627934

Please turn on "show sof-clipped bases" in the Preference-Alignments of Integrative Genomics Viewer.

alt text

alt text

R code to allow one substitution within 13 bp upstream to the RP1-Alu

To allow one substitution within 13 bp upstream to the RP1-Alu, agrep R code is used to detect such rare instance.

This R code has more flexible to find Alu insertion in patients who has rare variants within insertion point.

However, above bash script is much faster than R agrep code.

Running time will be reduced if you ungzip fastq.gz files before running.

Run this R code in the directory with fastq.gz files.

This code runs with multiple samples.

alugrep {number of thread}

if 24 threads are available

alugrep 24

In referring to work done using RP1 alu grep program, please cite:

Won D, Hwang JY, Shim Y, Byeon SH, Lee J, Lee CS, Kim M, Lim HT, Choi JR, Lee ST, Han J. In Silico identification of a common mobile element insertion in exon 4 of RP1. Sci Rep. 2021 June 28;11(1):13381.

rp1_aluinsertion's People

Contributors

jin0008 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.