Giter VIP home page Giter VIP logo

acl-rd-tec-2.0's Introduction

THE ACL RD-TEC 2.0: A dataset for evaluation of term and entity recognition in computational linguistics

#What this repository contains?

The dataset is organised in the following directories:

  • `readme.txt': this file.
  • `/documents': contains the annotation guidelines as well as a paper that describe the annotation process.
  • `/annotations_during_guideline_dev': Materials related to the development of guidelines; this includes the annotated files during this process as well as the guidelines used by the annotators.
  • `/annoitation_files': contains all annotated files; these files are grouped per annotator; for ease of use, those annotations that are annotated by by both annotators are collected and again presented in the directory double_annotated_files.
  • `/pos_tagged_vertical_files': Annotation files converted to the familiar vertical format (i.e., a token or annotation tag per line). These files contain automatically obtained part-of-speech tagged and lemmas using the Stanford CoreNLP library.
  • `/raw_abstract_txt': Contains abstract text files, segmented and corrected for OCR errors. These files do not contain any annotation.
  • `/licenses': license files.

#History

ACL RD-TEC 2.0 is developed by Dr. Anne-Kathrin Schumann and Behrang QasemiZadeh. The dataset is developed as the second version of ACL RD-TEC in order to provide annotation of terms in context.

#Other links and related materials

#Contact us If you have questions, or you would like to change or contribute to this resource, please contact Anne-Kathrin Schumann (ak47schumann at gmail.com ) or Behrang QasemiZadeh (zadeh at phil.hhu.de).

#BIBLIOGRAPHY

Behrang QasemiZadeh and Anne-Kathrin Schumann. "The ACL RD-TEC 2.0: A Language Resource for Evaluating Term Extraction and Entity Recognition Methods." In Proceedings of LREC, 2016.

Schumann, A.-K. and QasemiZadeh, B., (2015a). The ACL RD-TEC Annotation Guidelines. Saarland University and National University of Ireland, ver. 2.6 edition. Available from http://pars.ie/publications/papers/pre-prints/acl-rd-tec-guidelines-ver2.pdf

QasemiZadeh, Behrang and Schumann, Anne-Kathrin, 2016, The ACL RD-TEC 2.0, LINDAT/CLARIN digital library at Institute of Formal and Applied Linguistics, Charles University in Prague, http://hdl.handle.net/11372/LRT-1661.

#Useful Links:


Last edited by BQ, 08.03.2016

acl-rd-tec-2.0's People

Contributors

anetschka avatar languagerecipes avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.