stephaniemak / boolean-search-engine Goto Github PK
View Code? Open in Web Editor NEWThis project forked from imuqtadir/boolean-search-engine
Project parses the news corpus and retrieves all the relevant information from the article such as Author, Date, Place, Titile etc. by parsing it and indexes these fields in separate indexes. The user when enters his search query, then a boolean query is formulated using AND, OR and NOT and relevant results are retrieved along with snippets. We use Okapi BM25 model for tf-idf in order to rank the documents.