Giter VIP home page Giter VIP logo

Comments (6)

rbouckaert avatar rbouckaert commented on June 27, 2024

This may be due to the nexus file using X to represent missing values. Both nucleotide and amino acid data types should probably be able to deal with this.

from beast2.

alexeid avatar alexeid commented on June 27, 2024

X is not a valid code for missing data in DNA/RNA. We should stick to established codes. It might be valid for amino acid sequences but that should be confirmed before we allow it.

Perhaps a better error message (i.e. popup saying illegal character in sequence) would be sufficient?

Cheers
Alexei

Sent from my iPhone

On 17/02/2014, at 8:28 AM, rbouckaert [email protected] wrote:

This may be due to the nexus file using X to represent missing values. Both nucleotide and amino acid data types should probably be able to deal with this.


Reply to this email directly or view it on GitHub.

from beast2.

rbouckaert avatar rbouckaert commented on June 27, 2024

http://en.wikipedia.org/wiki/FASTA_format suggests X is used as "masked" for nucleotide, and "any" for amino acid coding. Since "X" is not used for anything else in nucleotide and amino acid encoding in BEAST, we can keep to the same convention as for FASTA, and interpret and "X" the same as a question-mark.

from beast2.

alexeid avatar alexeid commented on June 27, 2024

Lets not introduce our own conventions. IUPAC is the convention for nucleotides.

Alexei

Sent from my iPhone

On 24/03/2014, at 9:20 AM, rbouckaert [email protected] wrote:

http://en.wikipedia.org/wiki/FASTA_format suggests X is used as "masked" for nucleotide, and "any" for amino acid coding. Since "X" is not used for anything else in nucleotide and amino acid encoding in BEAST, we can keep to the same convention as for FASTA, and interpret and "X" the same as a question-mark.


Reply to this email directly or view it on GitHub.

from beast2.

rbouckaert avatar rbouckaert commented on June 27, 2024

How do you suggest to interpret the 'X' in nucleotide alignments allegedly produced by clustalX?
At least the FASTA parser needs converting 'X's.

from beast2.

tgvaughan avatar tgvaughan commented on June 27, 2024

@alexeid seems to be right regarding X not being used this way under the IUPAC nucelotide convention (in fact, it represents "xanthine"). It does represent ambiguity in amino acid sequences though.

Still, if X is defined in FASTA this way, shouldn't the parser read it this way?

from beast2.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.