Comments (2)
@muhammadaliqasmi I discussed this metric with @nfriesen and will give you a more detailed guide for its implementation below. Please stop reading at the horizontal line. @jerdeb below that line there are some further questions that still need clarification (by you looking into the literature or even talking to the use case partners).
This metric is actually very similar to UndefinedClassesOrProperties. We need to compute the ratio of resources referenced in our dataset that are defined in ontologies that are members of the OBO Foundry.
@jerdeb's comment above that “we need to check only for the the type of an instance” (i.e. for the objects of triples whose predicate is rdf:type
) is wrong because there are OBO Foundry ontologies (e.g. http://svn.code.sf.net/p/obi/code/releases/2014-03-29/obi.owl) that also define properties and individuals, which means that we need to do the check described below for all predicates and objects of all triples in our dataset. (By the reasoning of #31 I think we can skip the subjects.)
Therefore, @muhammadaliqasmi, I think we can do the following:
- check whether the URIs start with a string that's, for now, in a hard-coded list, which for now contains
http://purl.obolibrary.org/obo/
as the only element. - check, using VocabularyReader, and similarly to UndefinedClassesOrProperties metric, whether the class/property with this URI is defined.
From this we know that
- the resource that our dataset reuses is defined in an ontology, …
- … which is a member of the OBO Foundry.
E.g., for http://purl.obolibrary.org/obo/OBI_9991118, this is the case.
@jerdeb open questions start here:
Should this metric return "true or false", or a ratio (i.e. how of the classes/properties (also individuals?) used in a dataset are from ontologies in the OBO Foundry). "ratio" makes more sense IMHO, as the question with "true or false" is when we should return true: if all classes/properties/individuals that our dataset reuses are from OBO Foundry ontologies, or if some of them are?
Secondly, is the above “OBO Foundry membership check” actually right? Dereferencing http://purl.obolibrary.org/obo/OBI_9991118 takes me to http://www.ontobee.org/browser/rdf.php?o=OBI&iri=http://purl.obolibrary.org/obo/OBI_9991118 (and it also gives some meaningful RDF/XML to a linked data client). So I think our job is to see whether the stuff we can download by dereferencing that URI has a certain structure that looks like “OBO Foundry”.
I don't know whether this mechanism
- fails for ontologies in the list of “candidate ontologies and other ontologies of interest” on http://www.obofoundry.org/. These are less reputable ontologies, and we'd like our metric to “fail” on them.
- works with those ontologies that are written in the language OBO, not in OWL. I know OBO is somehow compatible to OWL, but I don't know what URIs the stuff declared in these OBO ontologies has, and whether the stuff from OBO is also published as linked data.
from quality.
This metric detects non reputable resources by retrieving URI of resources from data sets and prefix match with "http://purl.obolibrary.org/obo/".
Metric value = (total number of NOT reputable resources ) / ( total number of resources )
Metric Value Range : [0 - 1] , Best Case : 0, Worst Case : 1
(for further changes/improvements, kindly re-open this issue).
.
from quality.
Related Issues (20)
- Consider splitting UndefinedClassesOrProperties metric in two
- First draft of D3.2 for internal reviewers
- Quality Problems Reporting in EmptyAnnotationValue, LabelsUsingCapitals, WhitespaceInAnnotation HOT 3
- Implement Problem Reporting for SPARQLAccessibility, RDFAccessibility metrics HOT 3
- Consider centralising some implementation that all metrics have in common HOT 1
- OntologyHijacking Metric
- Review Dereferencibility, NoDereferencedBackLinks/ForwardLinks, UnstructuredData
- Consistent metric names HOT 2
- Publish proper qprob ontology HOT 1
- RDF/XML vocabularies are outdated
- daQ and other vocabularies are not really published in a LOD-conforming way
- ProblemList issues.
- Completeness metrics against gold standard
- User-friendly labels for metrics
- Prepare for Y2 review meeting
- Define collaboration with crawling
- Approximate metric implementations are called "estimate"
- Ask question
- maven compile errors HOT 2
- FileNotFoundException HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from quality.