rdresearch's Introduction
rdresearch's People
rdresearch's Issues
BGISEQ500 picard dedup报错
BGISEQ500测序数据,使用picard对多个lane的bam进行dedup时报如下错误:
Exception in thread "main" htsjdk.samtools.util.RuntimeIOException: Read error; BinaryCodec in readmode; file: /path/lane.dedup.realn.recal.bam
参考bcbio/bcbio-nextgen#1392 (comment)
判断应该是bam文件提前中断导致报错,不过从log中看bam的生成没有问题,只有一个Warning
WARNING 2017-06-07 13:57:57 AbstractDuplicateFindingAlgorithm Default READ_NAME_REGEX '[a-zA-Z0-9]+:[0-9]:([0-9]+):([0-9]+):([0-9]+).*' did not match read name 'CL100009742L1C007R055_331190'. You may need to speci
fy a READ_NAME_REGEX in order to correctly identify optical duplicates. Note that this message will not be emitted again even if other read names do not match the regex.
BGISEQ500和ILLUMINA数据的reads ID的差异对分析的影响待确定
测序平台 | BGISEQ500 | ILLUMINA |
---|---|---|
readsID | CL100009742L1C001R003_23451 | E00495:53:H772NALXX:6:1101:13393:1450 |
HiSeq4000 picard dedup报错
Exception in thread "main" htsjdk.samtools.SAMException: Value was put into PairInfoMap more than once. 1: RGID_170523_X167_FCHKL23ALXX_L8_HUMrskEAAAAAAA-13:ST-E00167:8:2103:23155
BGISEQ500 GATK IndelRealigner报错
处理BGISEQ500数据时遇到的报错
ERROR MESSAGE: SAM/BAM file SAMFileReader{/path/lane.dedup.bam} is malformed: BAM file has a read with mismatching number of bases and base qualities. Offender: CL100015487L2C006R076_221351 [1 bases] [0 quals]. You can use --defaultBaseQualities to assign a default base quality for all reads, but this can be dangerous in you don't know what you are doing.
查看bam文件发现有一条read长度为一个碱基,但碱基质量为“*”,这样的reads需要提前过滤掉
CL100015487L2C006R076_221351 117 chr1 248458064 0 * = 248458064 0 A * PG:Z:MarkDuplicates RG:Z:RGID_CL100015487_L02_1 AS:i:0 XS:i:0
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.