Giter VIP home page Giter VIP logo

i5knal_ogs's People

Contributors

taiwaness avatar

Watchers

 avatar  avatar

Forkers

nal-i5k

i5knal_ogs's Issues

[intra-model] Error detection based on multiple gff lines within a model

Relatively easy to re-implement

  • [multiple gff lines of intra-model] Redundant gene length (Ema0001)
  • [multiple gff lines of intra-model] Internal stops in amino acid sequence (Ema0002)
  • [multiple gff lines of intra-model] child feature not within parent boundaries (Ema0003)
  • [multiple gff lines of intra-model] Incomplete gene feature (Ema0004)
  • [multiple gff lines of intra-model] pseudogene has unusual child features, such as mRNA (Ema0005)

Future improvement for I5KNAL_OGS/lib/gff3_to_fasta

Translator method in the current version of gff3_to_fasta (v0.0.1) only considers standard codons and universal stop codons for generating peptide sequences. However, the genetic codes used for translation might be different for different species. Therefore, as suggested by @childers, it will be great if the translator method in this program can further consider to incorporate multiple codon tables for multiple species.

NCBI has compiled related information for codon tables for different species on April 30, 2013 (The Genetic Codes). These suggested codon tables can be incorporated into this program in the future.

Before this program incorporated this improvement, a list of other translation tools is suggested for users: http://molbiol-tools.ca/Translation.htm

Two tools with translation function having options for choosing genetic codes are recommended:

  • ExPASy โ€“ Translate tool (ExPASy, University of Geneva, Switzerland). This site useful if users have a gene which begins with an alternative start codon.
  • Translate Nucleic Acid Sequence Tool (University of Massachusetts Medical School, U.S.A.) which permits choice of reading frame(s) and genetic code.

[inter-model] Error detection for multiple gff lines across models

Relatively hard to re-implement

  • [multiple gff lines of inter-models] Duplicate transcripts/exon/CDS (Emr0001)
  • [multiple gff lines of inter-models] merged gene parent (Emr0002)
  • [multiple gff lines of inter-models] split gene parent (Emr0003)
  • [multiple gff lines of inter-models] distant isoforms (Emr0004)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.