kaveeshbaddage / misinformationcorpussinhala Goto Github PK
View Code? Open in Web Editor NEWThis project forked from lirneasia/misinformationcorpussinhala
A dataset consisting of 3576 documents in Sinhala, drawn from Sri Lankan news websites and factchecking operations, annotated as CREDIBLE, FALSE, PARTIAL or UN- CERTAIN. The dataset has markers for the content of the document, the classification, the web domain from which each document was retrieved, and the date on which the document was published.