Giter VIP home page Giter VIP logo

Comments (2)

clange avatar clange commented on August 12, 2024

@muhammadaliqasmi I discussed this metric with @nfriesen and will give you a more detailed guide for its implementation below. Please stop reading at the horizontal line. @jerdeb below that line there are some further questions that still need clarification (by you looking into the literature or even talking to the use case partners).

This metric is actually very similar to UndefinedClassesOrProperties. We need to compute the ratio of resources referenced in our dataset that are defined in ontologies that are members of the OBO Foundry.

@jerdeb's comment above that “we need to check only for the the type of an instance” (i.e. for the objects of triples whose predicate is rdf:type) is wrong because there are OBO Foundry ontologies (e.g. http://svn.code.sf.net/p/obi/code/releases/2014-03-29/obi.owl) that also define properties and individuals, which means that we need to do the check described below for all predicates and objects of all triples in our dataset. (By the reasoning of #31 I think we can skip the subjects.)

Therefore, @muhammadaliqasmi, I think we can do the following:

  1. check whether the URIs start with a string that's, for now, in a hard-coded list, which for now contains http://purl.obolibrary.org/obo/ as the only element.
  2. check, using VocabularyReader, and similarly to UndefinedClassesOrProperties metric, whether the class/property with this URI is defined.

From this we know that

  • the resource that our dataset reuses is defined in an ontology, …
  • … which is a member of the OBO Foundry.

E.g., for http://purl.obolibrary.org/obo/OBI_9991118, this is the case.


@jerdeb open questions start here:

Should this metric return "true or false", or a ratio (i.e. how of the classes/properties (also individuals?) used in a dataset are from ontologies in the OBO Foundry). "ratio" makes more sense IMHO, as the question with "true or false" is when we should return true: if all classes/properties/individuals that our dataset reuses are from OBO Foundry ontologies, or if some of them are?

Secondly, is the above “OBO Foundry membership check” actually right? Dereferencing http://purl.obolibrary.org/obo/OBI_9991118 takes me to http://www.ontobee.org/browser/rdf.php?o=OBI&iri=http://purl.obolibrary.org/obo/OBI_9991118 (and it also gives some meaningful RDF/XML to a linked data client). So I think our job is to see whether the stuff we can download by dereferencing that URI has a certain structure that looks like “OBO Foundry”.

I don't know whether this mechanism

  • fails for ontologies in the list of “candidate ontologies and other ontologies of interest” on http://www.obofoundry.org/. These are less reputable ontologies, and we'd like our metric to “fail” on them.
  • works with those ontologies that are written in the language OBO, not in OWL. I know OBO is somehow compatible to OWL, but I don't know what URIs the stuff declared in these OBO ontologies has, and whether the stuff from OBO is also published as linked data.

from quality.

muhammadaliqasmi avatar muhammadaliqasmi commented on August 12, 2024

This metric detects non reputable resources by retrieving URI of resources from data sets and prefix match with "http://purl.obolibrary.org/obo/".

Metric value = (total number of NOT reputable resources ) / ( total number of resources )

Metric Value Range : [0 - 1] , Best Case : 0, Worst Case : 1

(for further changes/improvements, kindly re-open this issue).

.

from quality.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.