juhijj / text-and-web-analysis Goto Github PK
View Code? Open in Web Editor NEWThis project contains the designing of Information filtering models using Python programming language. It aims at building communication between users and web information systems. The dataset is a subset of RCV1 data collection which cannot be shared for ethical reasons. It contains a set of documents of different topics and includes topic definition, topic number, title, description and narrative.