Giter VIP home page Giter VIP logo

Comments (6)

ukemi avatar ukemi commented on July 29, 2024

Interesting. In other places in the imports, I've seen enablers that are labeled something like 'unknown ubiquitin ligase'.

from pathways2go.

nataled avatar nataled commented on July 29, 2024

@ukemi at least that restricts the possibilities! Just having 'protein' as enabler, seems to me, would be most accurately interpreted as "any protein will do".

from pathways2go.

deustp01 avatar deustp01 commented on July 29, 2024

But to restrict in a consistent way, we need an ontology structure and curators and users trained in its use. It's probably easier to train curators and users to understand that "unknown protein" means just that: the activity has an enabler whose identity has not yet been discovered, and not that anything can enable it.

from pathways2go.

ukemi avatar ukemi commented on July 29, 2024

Yes, and in the OWL instance world I believe that having protein there means some protein, not all proteins.

from pathways2go.

ukemi avatar ukemi commented on July 29, 2024

@deustp01 If a reaction uses a protein that has isoforms, but you don't know which specific isoform is being used, do you annotate to the generic protein identifier?

from pathways2go.

deustp01 avatar deustp01 commented on July 29, 2024

@ukemi by default we always annotate to the canonical / default isoform specified by UniProt unless there is experimental evidence that specifies the use of a different isoform so, yes, in effect we are annotating to the generic identifier because we haven't really examined the possibility of isoform usage.

This is also our rationale for not routinely making sets of all of the isoforms of a UniProt that should be able to enable a particular function and using that set, instead of a single EWAS instance corresponding to the canonical UniProt isoform as the enabler. There is also a biology issue here like the one for paralogs. We assume that all paralogs / all isoforms are equally competent enablers, by default ignoring differences in tissue- or state-specific expression of these variant form that might be pointing to real differences in function (as in the case of the sets of glycolytic enzymes where all set members have the same catalytic activity but are expressed in different tissues and subject to different regulators of their activity).

This leads to problems when UniProt re-edits a SwissProt entry to change the isoform that is the canonical / default one - then our numbering of positions in the protein sequence, e.g. to indicate start and end coordinates and coordinates of specific modified residues can be thrown off. @nataled 's QA tests have enabled us to clean up (almost) all of the 20-year legacy mess caused by these UniProt - Reactome branchings, and we have just introduced a new QA check whereby any change in the checksum of a UniProt entry (which should be triggered by any of these changes in the identity of the canonical sequence) causes all EWASs / proteoforms that refer to the changed UniProt to be flagged for manual review.

from pathways2go.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.