Comments (17)
What do the files in ths /mnt/lustre/references/speedseq/annotations/cnvnator_chroms
directory look like? Do your chromosomes have "chr" prefixes by any chance? SpeedSeq sv currently doesn't support chromosome prefixes because cnvnator's names are rigid, so I'm not sure how it will behave with them.
from speedseq.
Ah that's a good point. We're using hg19 so they are chr1.fa etc.
I've used the standalone version of CNVnator previously with hg19 formatted chromosomes and its worked fine. Perhaps it may be to do with the changes made to the CNVnator-multi program supplied with SpeedSeq?
from speedseq.
yeah, quite possibly the case. It comes down to this file mainly. I think Alexej may have modified it for more flexibility since the speedseq implementation was written.
from speedseq.
Hi Colby, sorry for the delay.
I've downloaded GRCh37 and set it up as per the user guide. Will be doing some testing over the next few days so shall update this ticket with new information in due course. Hopefully it'll be resolved by the corrected naming as you suspect.
from speedseq.
Hi Colby,
I've been able to run this today and no longer have the chromosome naming error.
However, what we now have is what looks like a root error (output below):
/opt/gridware/pkg/apps/speedseq/0.0.3a/gcc-4.4.6+root-5.34.30+samtools-0.1.19+python-2.7.3/bin/cnvnator-multi: error while loading shared libraries: libCore.so.5.34: cannot open shared object file: No such file or directory
Should I open a new ticket or do you think this related?
from speedseq.
I think this is a different issue, I've opened a new ticket here #44
from speedseq.
As per #44;
Command I used to run speedseq sv:
qsub -V -e e -o o -cwd -b Y -N speedseq_sv_1 speedseq sv -R /mnt/lustre/references/speedseq/GRCh37/grch37.fasta -o ERR194146 -g -k -d -v \
-B /mnt/archive/analysis/projects/HiSeq/CNV_Genome_Project/External_Data/Illumina_Platinum_Genomes/NA12877/bam_fromfastq_dl/ERR194146.bam \
-S /mnt/archive/analysis/projects/HiSeq/CNV_Genome_Project/External_Data/Illumina_Platinum_Genomes/NA12877/bam_fromfastq_dl/ERR194146.splitters.bam \
-D /mnt/archive/analysis/projects/HiSeq/CNV_Genome_Project/External_Data/Illumina_Platinum_Genomes/NA12877/bam_fromfastq_dl/ERR194146.discordants.bam
Error message:
Traceback (most recent call last):
File "/opt/gridware/pkg/apps/speedseq/0.0.3a/gcc-4.4.6+root-5.34.30+samtools-0.1.19+python-2.7.3/bin/cnvnator_wrapper.py", line 350, in <module>
chroms_list = get_chroms_list(args.bam)
File "/opt/gridware/pkg/apps/speedseq/0.0.3a/gcc-4.4.6+root-5.34.30+samtools-0.1.19+python-2.7.3/bin/cnvnator_wrapper.py", line 153, in get_chroms_list
proc = subprocess.Popen(['samtools', 'view', '-H', bam_fn], stdout = subprocess.PIPE)
File "/opt/gridware/pkg/apps/python/2.7.3/gcc-4.4.6/lib/python2.7/subprocess.py", line 679, in __init__
errread, errwrite)
File "/opt/gridware/pkg/apps/python/2.7.3/gcc-4.4.6/lib/python2.7/subprocess.py", line 1249, in _execute_child
raise child_exception
OSError: [Errno 2] No such file or directory
The structure of my references directory is as follows:
ls -l /mnt/lustre/references/speedseq/GRCh37/
-rw-rw-r-- 1 sb2 bioinfo 3147288981 Aug 25 14:40 grch37.fasta
-rw-rw-r-- 1 sb2 bioinfo 6563 Aug 26 16:18 grch37.fasta.amb
-rw-rw-r-- 1 sb2 bioinfo 869 Aug 26 16:18 grch37.fasta.ann
-rw-rw-r-- 1 sb2 bioinfo 3095694072 Aug 26 16:17 grch37.fasta.bwt
-rw-rw-r-- 1 sb2 bioinfo 713 Aug 27 13:06 grch37.fasta.fai
-rw-rw-r-- 1 sb2 bioinfo 773923497 Aug 26 16:18 grch37.fasta.pac
-rw-rw-r-- 1 sb2 bioinfo 1547847040 Aug 26 16:30 grch37.fasta.sa
ls -l /mnt/lustre/references/speedseq/annotations/cnvnator_chroms/
-rw-rw-r-- 1 sb2 bioinfo 137793664 Aug 25 13:35 10.fa
-rw-rw-r-- 1 sb2 bioinfo 137256629 Aug 25 13:35 11.fa
-rw-rw-r-- 1 sb2 bioinfo 136082764 Aug 25 13:35 12.fa
-rw-rw-r-- 1 sb2 bioinfo 117089380 Aug 25 13:35 13.fa
-rw-rw-r-- 1 sb2 bioinfo 109138703 Aug 25 13:35 14.fa
-rw-rw-r-- 1 sb2 bioinfo 104240253 Aug 25 13:35 15.fa
-rw-rw-r-- 1 sb2 bioinfo 91860670 Aug 25 13:35 16.fa
-rw-rw-r-- 1 sb2 bioinfo 82548468 Aug 25 13:35 17.fa
-rw-rw-r-- 1 sb2 bioinfo 79378540 Aug 25 13:35 18.fa
-rw-rw-r-- 1 sb2 bioinfo 60114471 Aug 25 13:35 19.fa
-rw-rw-r-- 1 sb2 bioinfo 253404802 Aug 25 13:35 1.fa
-rw-rw-r-- 1 sb2 bioinfo 64075950 Aug 25 13:35 20.fa
-rw-rw-r-- 1 sb2 bioinfo 48932064 Aug 25 13:35 21.fa
-rw-rw-r-- 1 sb2 bioinfo 52159647 Aug 25 13:35 22.fa
-rw-rw-r-- 1 sb2 bioinfo 247252699 Aug 25 13:35 2.fa
-rw-rw-r-- 1 sb2 bioinfo 201322807 Aug 25 13:35 3.fa
-rw-rw-r-- 1 sb2 bioinfo 194340184 Aug 25 13:35 4.fa
-rw-rw-r-- 1 sb2 bioinfo 183930518 Aug 25 13:35 5.fa
-rw-rw-r-- 1 sb2 bioinfo 173966988 Aug 25 13:35 6.fa
-rw-rw-r-- 1 sb2 bioinfo 161790978 Aug 25 13:35 7.fa
-rw-rw-r-- 1 sb2 bioinfo 148803426 Aug 25 13:35 8.fa
-rw-rw-r-- 1 sb2 bioinfo 143566992 Aug 25 13:35 9.fa
-rw-rw---- 1 sb2 bioinfo 16849 Aug 25 14:38 M.fa
-rw-rw---- 1 sb2 bioinfo 157858406 Aug 25 14:36 X.fa
-rw-rw---- 1 sb2 bioinfo 60363129 Aug 25 14:36 Y.fa
Is there a key file that I am missing?
from speedseq.
is samtools in your path? also, can you should the header of /mnt/archive/analysis/projects/HiSeq/CNV_Genome_Project/External_Data/Illumina_Platinum_Genomes/NA12877/bam_fromfastq_dl/ERR194146.bam
from speedseq.
Ah, samtools wasn't loaded at the time. It looks like it isn't a linked dependency to speedseq qhen I load that on our cluster.
Here's my bam header for you; chromosome numbering looks fine to me?
@HD VN:1.3 SO:coordinate
@SQ SN:1 LN:249250621
@SQ SN:2 LN:243199373
@SQ SN:3 LN:198022430
@SQ SN:4 LN:191154276
@SQ SN:5 LN:180915260
@SQ SN:6 LN:171115067
@SQ SN:7 LN:159138663
@SQ SN:8 LN:146364022
@SQ SN:9 LN:141213431
@SQ SN:10 LN:135534747
@SQ SN:11 LN:135006516
@SQ SN:12 LN:133851895
@SQ SN:13 LN:115169878
@SQ SN:14 LN:107349540
@SQ SN:15 LN:102531392
@SQ SN:16 LN:90354753
@SQ SN:17 LN:81195210
@SQ SN:18 LN:78077248
@SQ SN:19 LN:59128983
@SQ SN:20 LN:63025520
@SQ SN:21 LN:48129895
@SQ SN:22 LN:51304566
@SQ SN:X LN:155270560
@SQ SN:Y LN:59373566
@SQ SN:M LN:16569
@RG ID:ERR194146 SM:ERR194146
@PG ID:bwa PN:bwa CL:/opt/gridware/pkg/apps/bwa/0.7.8/gcc-4.4.6/bin/bwa mem -t 1 -M -R @RG\tID:ERR194146\tSM:ERR194146 /mnt/lustre/references/speedseq/GRCh37/grch37.fasta ../fastq_dl/ERR194146_1.subset.fastq ../fastq_dl/ERR194146_2.subset.fastq VN:0.7.8-r455
from speedseq.
yeah, the samtools thing is an oversight, i didn't realize that it was being called.
BAM header looks fine, is the problem solved if you add samtools to your path?
from speedseq.
Looks OK so far!
I'll be able to check it all on Tuesday (long weekend here in the UK) and hopefully everything will have run hunky dory and I'll be able to close the task for you.
Thanks (again!) for the troubleshooting help. 😄
from speedseq.
sounds good, appreciate the detailed bug report. i'll change speedseq to explicitly use the embedded sambamba rather than the samtools in PATH
from speedseq.
OK, cool. I've asked our syadmin to pop samtools into the load path for speedseq for the time being so that should make sure I don't see it again.
from speedseq.
Hi Colby, sorry for the late reply.
With samtools in my PATH its worked fine.
I've got an Error (below) which I think is related to me qsubbing the jobs to our cluster via ssh (rather than running speedseq natively). Could you confirm this, and explain what I'm missing out on. This happens with ssh -X (my default) and ssh -Y.
Error in <TGClient::TGClient>: can't open display "localhost:11.0", switchingt to batch mode...
In case you run from a remote ssh session, reconnect with ssh -Y
from speedseq.
I also see this error when we submit speedseq jobs through LSF. I'm not sure what CNVnator is attempting to do with X11, but as far as I can tell, you can ignore this message. Does your output complete and look correct?
from speedseq.
Yeah that makes sense. Everything else looked fine so I'm definitely at the
point where it's working as intended.
Thanks for the help again!
On Wed, 2 Sep 2015 at 16:40, Colby Chiang [email protected] wrote:
I also see this error when we submit speedseq jobs through LSF. I'm not
sure what CNVnator is attempting to do with X11, but as far as I can tell,
you can ignore this message. Does your output complete and look correct?—
Reply to this email directly or view it on GitHub
#40 (comment).
from speedseq.
@cc2qe you can mark this issue closed. Following all the helpful comments has resolved the error.
from speedseq.
Related Issues (20)
- compilation of freebayes module fails on Ubuntu 18.04 HOT 2
- Error -running speedseq SV - missing pair end parameters:mean stdev
- Parallel link is dead HOT 1
- problems about vcf files.
- Warning: --split_bam (-S) is deprecated
- speedseq var and sv only on genomic regions
- Read block operation failed with error HOT 1
- is there a container? HOT 2
- Linker error when doing `make cnvnator`
- -M parameter for BWA MEM alignment
- error: static assertion failed: comparison object must be invocable as const
- Bam file not generated using speedseq align
- mismatched line lengths at line 3 within sequence
- Speedseq is runnning infinitely HOT 1
- [Question] Is ARM64 supported ? HOT 1
- configure: error: cannot guess build type; you must specify one
- error: cannot guess build type; you must specify one
- make :can not find -lCore -lRIO -lHist -lGraf -lGpad -lTree -lMathCore
- Are CRAM files supported by SpeedSeq?
- Compile Errors: *** [Makefile:59: cnvnator] Error 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from speedseq.