Name: CLARIN-PL
Type: Organization
Bio: CLARIN (Common Language Resources and Technology Infrastructure) is a pan-European research infrastructure intended for the humanities and social sciences
Location: Wrocław, Poland
Blog: http://clarin-pl.eu
CLARIN-PL's Projects
✨Argilla: the open-source data curation platform for LLMs
Source code for paper "Capturing Human Perspectives in NLP: Questionnaires, Annotations, and Biases" published at the 2nd Workshop on Perspectivist Approaches to NLP at the 6th European Conference on Artificial Intelligence (NLPerspectives2 @ ECAI 2023)
Code, datasets and results of the ChatGPT evaluation presented in paper "ChatGPT: Jack of all trades, master of none"
Press texts portal processing
An advanced, extensible web front-end for the Manatee-open corpus search engine
Clarin Files Share App for Nextcloud
☁️ Nextcloud server, a safe home for all your data
CLARIN-PL digital library based on DSpace
CLEX — Knowledge-based Information Extractionfrom Documents with Complex Layouts
Source code used in article
Deep Neural Entities Recognition (Bi-LSTM + Bi-GRU)
Wordnet Visual Editor
Open source annotation tool for machine learning practitioners.
A simple client for doccano API.
Minimalistic Dockerfiles used in CLARIN-PL
Embeddings: State-of-the-art Text Representations for Natural Language Processing tasks, an initial version of library focus on the Polish Language
Inforex is a web system for text corpora construction.
An advanced web front-end for the Manatee-open corpus search engine
This is the way: designing and compiling LEPISZCZE, a comprehensive NLP benchmark for Polish
Temporal storage for LEPISZCZE datasets descriptions
Generic framework for information extraction tasks, including recognition of named entities, temporal expressions, spatial expressions and events.
The main idea is to identify all geographical names in the literary text (or a corpus) and map them onto the geographical map. The task goes beyond Named Entity Recognition (NER), as NER must be combined with geo-location.
Source code for paper "Towards Model-Based Data Acquisition for Subjective Multi-Task NLP Problems" published at the 13th ICDM Workshop on Sentiment Elicitation from Natural Text for Information Retrieval and Extraction (SENTIRE) organized during the 23rd IEEE International Conference on Data Mining (ICDM 2023)
Multi Tier Annotation Search