Giter VIP home page Giter VIP logo

typing-www2018's Introduction

Are All People Married? Determining Obligatory Attributes in Knowledge Bases

© 2018 Jonathan Lajus & Fabian M. Suchanek

An attribute is obligatory for a class in a Knowledge Base (KB), if all instances of the class have the attribute in the real world. For example, hasBirthDate is an obligatory attribute for the class Person, while hasSpouse is not. In this paper, we propose a new way to model incompleteness in KBs. From this model, we derive a method to automatically determine obligatory attributes – using only the data from the KB. Our algorithm can detect such attributes with a precision of up to 90%.

Publication

Full paper at the Web Conference (WWW), 2018

Datasets

We ran our system on two datasets

Experimental Data

For each experiment, we report:

  • The results given by the different algorithms (here);
  • The gold standard used (here);
  • The per-instances evaluation for each methods and relations (here).

Raw results

In "data/Results/results" we provide tab-separated-value files of any class returned by any method, parameter and predicate.

IMPORTANT NOTE: here we only output upper-classes in the taxonomy

"predicate \t class" stands for "class(x) => \exists y. predicate(x, y)" and "predicate-1 \t class" stands for "class(y) => \exists x. predicate(x, y)"

Evaluation

We provide tab-separated-value files with the results of the experiments. They contain for every method, parameter, and predicate:

  • The number of true positives
  • The number of predicted instances
  • The number of instances derived from the gold standard
  • The precision, recall, and F1 per predicate
  • There is also their "NF" (for "New Facts") counterpart where we do not consider any instance already having the predicate in the KB (not used in the paper).

Source Code

The source code in the form of a Maven project can be found in the directory src

Dependencies not available in Maven are in src/lib

Runnables

All the jar files used for the experiments can be found in the run directory.

Refer to this documentation for usage information.

typing-www2018's People

Contributors

lajus avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

lnnnnnn

typing-www2018's Issues

yago dateset

Hi, I really appreciate your job!

(1) Issues related to the generation of file “uni_yagoICounts50.tsv”

In the original file:
uni_yagoICounts50.tsv:<wordnet_country_108544813> owl:Thing 6754
yago_full_results.tsv: <wordnet_country_108544813> GS Size = 6754

input:
java -Xmx50g -cp lib/*:typing-1.0.jar amie.data.utils.PrintCounts -l 50 uni_yagoTransitiveType.tsv(from http://resources.mpi-inf.mpg.de/yago-naga/yago3.1/yagoTransitiveType.tsv.7z)
Recalculated result:
<wordnet_country_108544813> owl:Thing 49130
Do you have any other treatment for file yagoTransitiveType.tsv ?

(2) Question about file "yagoFacts.tsv"
Written in the paper, Yago dataset, "With this, our dataset contains more than 5 million instances"
But I found a total of 12430701 instances in file yagoFacts.tsv(from http://resources.mpi-inf.mpg.de/yago-naga/yago3.1/yagoFacts.tsv.7z).

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.