A vignette of how to analyze CUT&RUN data for the Hiebert Lab
java -classpath {SOFTWARE_DIR}/trimmomatic-0.39.jar org.usadellab.trimmomatic.TrimmomaticPE -phred33 -threads 8 \
5176-MB-1-TCGGATTC-CGCAACTA_S01_L005_R1_001.fastq.gz 5176-MB-1-TCGGATTC-CGCAACTA_S01_L005_R2_001.fastq.gz \
5176-MB-1_R1.nodap.paired.txt 5176-MB-1_R1.noadap.unpaired.txt 5176-MB-1_R2.noadap.paired.txt 5176-MB-1_R2.noadap.unpaired.txt \
ILLUMINACLIP:TruSeq_CD_adapter.txt:2:30:7 LEADING:15 TRAILING:15 MINLEN:15
nohup java -classpath {SOFTWARE_DIR}/trimmomatic-0.39.jar org.usadellab.trimmomatic.TrimmomaticPE -phred33 -threads 8 \
5176-MB-1-TCGGATTC-CGCAACTA_S01_L005_R1_001.fastq.gz 5176-MB-1-TCGGATTC-CGCAACTA_S01_L005_R2_001.fastq.gz \
5176-MB-1_R1.nodap.paired.txt 5176-MB-1_R1.noadap.unpaired.txt 5176-MB-1_R2.noadap.paired.txt 5176-MB-1_R2.noadap.unpaired.txt \
ILLUMINACLIP:TruSeq_CD_adapter.txt:2:30:7 LEADING:15 TRAILING:15 MINLEN:15 > 5176-trim-1.out &
From CUT&RUN protocol.io For mapping spike-in fragments, we also use the --no-overlap --no-dovetail options to avoid cross-mapping of the experimental genome to that of the spike-in DNA.(https://www.protocols.io/view/cut-amp-run-targeted-in-situ-genome-wide-profiling-14egnr4ql5dy/v3?step=113)
local: Local alignment searches for the best alignment of a substring of the input sequence. While it can find an alignment for the entire sequence, if another, shorter, alignment has a higher score, it will be chosen. End-to-end will compute the score over the entire matching of the input sequence and its alignment with the reference. If there are adapters/long mismatches/indels etc. the local will work best. If you have a good reason to believe that the input sequence should be fully matched to the reference, then select end-to-end
no-mixed: By default, when bowtie2 cannot find a concordant or discordant alignment for a pair, it then tries to find alignments for the individual mates. This option disables that behavior.
no-discordant: A discordant alignment is an alignment where both mates align uniquely, but that does not satisfy the paired-end constraints
bowtie2 -p 8 --local --very-sensitive-local --no-unal --no-mixed --no-discordant --phred33 -I 10 -X 700 --no-overlap --no-dovetail -x {GENOME_DIR}/hg19_ec \
-1 4617-MB-1_R1.noadap.paired.txt -2 4617-MB-1_R2.noadap.paired.txt -S 4617-MB-1.hg19scer.sam
##### sam to bam file
samtools view -S -b -@ 14 ${ALIGN}/9631-MB-${i}.hg19ec.sam -o ${BAM}/9631-MB-${i}.hg19ec.bam
##### read quality filter
samtools view -b -F 4 -q 10 -@ 14 ${BAM}/9631-MB-${i}.hg19ec.bam -o ${BAM}/9631-MB-${i}.hg19ec.F4q10.bam
##### sort
samtools sort -@ 12 ${BAM}/9631-MB-${i}.hg19ec.F4q10.bam -o ${BAM}/9631-MB-${i}.hg19ec.F4q10.sorted.bam
##### index
samtools index ${BAM}/9631-MB-${i}.hg19ec.F4q10.sorted.bam
##### selecting only the human chromosomes minus chrM
samtools view -@ 12 -bh ${BAM}/9631-MB-${i}.hg19ec.F4q10.sorted.bam chr1 chr2 chr3 chr4 chr5 chr6 chr7 chr8 chr9 chr10 chr11 chr12 chr13 chr14 chr15 chr16 chr17 chr18 chr19 chr20 chr21 chr22 chrX chrY > ${BAM}/9631-MB-${i}.hg19.F4q10.sorted.bam
##### index
samtools index ${BAM}/9631-MB-${i}.hg19.F4q10.sorted.bam