Giter VIP home page Giter VIP logo

chm13-issues's People

Contributors

aphillippy avatar arangrhie avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

chm13-issues's Issues

CHM13.GFF3.V4 annotations cause the phase error in bcftools

Hello,

I have issues with gff annotation using bcftools. At first, I used the following command
bcftools csq -f chm13_grch38_Y_added.fa -g chm13.draft_v1.1.gene_annotation.v4.gff3 ../longread/samtools/T15/T15_chm13_sorted.vcf -O z -o T15_chm13_sorted_annotated.vcf
then got the error message saying Could not parse the line, "Parent=transcript:" not present so I adjusted gff file using zcat chm13.draft_v1.1.gene_annotation.v4.gff3.gz \ | gawk 'BEGIN{IGNORECASE=0} $0 ~/^#/ || ($3 != "gene" && $3 != "transcript" ) { $0=gensub(/Parent=(.+?_T[^;]+)/, "Parent=transcript:\\1", "g", $0); print; next} $0 !~/^#/ { gsub(/ID=/, "ID="$3":"); $0=gensub(/Parent=(.+?_G[^;]+)/, "Parent=gene:\\1", "g", $0); print }' \ | bgzip -c > chm13.draft_v1.1.gene_annotation.v4.fix.gff3.gz to fix the error message.
But then I still get the phase error message
Error: GFF3 assumption failed for transcript CHM13_T0000003, CDS=111940: phase!=len%3 (phase=2, len=379)
and got the following message after --force running it.
Warning: GFF3 assumption failed for transcript CHM13_T0110841, CDS=23883573: phase!=len%3 (phase=0, len=607)
If bcftools seems to cause phase errors which tool do you recommend to use to annotate variant calls on CHM13?

Seems like others are experiencing the same issue here

marbl/CHM13#31

Cheers,
DK

Excessive clipping in Winnowmap alignments

We found excessive clipping in the Winnowmap alignment, particularly in the HSat2 and 3 arrays.
We are investigating this using alternative alignment methods and expect to have an update to the issues.bed track.

how to merge SV callings from Pariament2 and sinffle results

I had two SV Calling resultings from Pariament2 and sinffle software showing in blow:
Pariament2:
image
sinffle:
image
Frist,there are no base information instead of “DEL001207SUR” in the pariament2 resulting,how to transform it into base format just like sinffle resulting.
Second,how to merge two SV calling resulting according to your SVs format showing blow?
image

potential issues in chm13.draft_v1.0.fasta identified by using another independent assembly

I generate an assembly with Peregrine2021 (0.4.1, main:eb7a2dc+) and find it may be useful for identifying some potential issues in the current T2T assembly although it may be equally likely to be issues in my assembly results.

  • Method summary:
    • Generate independent assembly (not using T2T’s overlaps and graphs to avoid ascertainment biases)
    • Try to separate the assembly results to two haplotypes assembly-wise
    • Call variants with dipcall, generate a VCF file for the variants seen between the assembly and T2T chm13.draft_v1.0.fasta
    • Going through some dense variant call clusters and check if there are alignment anomaly manually
  • For the pick 10 regions picked, four have nearby low coverage alignment signal from the bed files. Six do not. The alignment files in the IGV used are from markerandlength.idxsession (from Arang, the polishing team):
  • I don’t have a systematic to generate a bed from from the VCF files that I have. However, if anyone is interesting, I can provide the VCF file.

Regions:
chr1:132,379,149-132,387,405
chr2:91,194,176-91,258,718
chr9:65,106,334-65,426,773
chr9:68,183,834-68,263,943
chr9:74,317,466-74,319,968
chr10:37,180,330-37,252,004
chr11:18,996,964-19,068,834
chr12:35,546,839-35,617,750
chr13:9,812,837-9,873,599
chr16:42,993,455-43,044,690

image

image

image

image

image

image

image

image

image

image

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.