Giter VIP home page Giter VIP logo

kshishtawy / query-system-based-on-positional-index-boolean-model-and-vector-space-model Goto Github PK

View Code? Open in Web Editor NEW
0.0 1.0 0.0 48 KB

A versatile information retrieval tool implementing a Boolean retrieval model and Vector Space model. Perform precise phrase queries, rank documents by similarity scores between the documents and the query, and gain insights from processed text data collection

Python 16.36% Jupyter Notebook 83.64%
boolean-retrieval-model natural-language-processing positional-indexing python query-system vector-space-model

query-system-based-on-positional-index-boolean-model-and-vector-space-model's Introduction

Query System

Overview

Welcome to the Query System project, a powerful tool for information retrieval based on a Boolean retrieval model coupled with a Vector Space model. This versatile system allows users to perform phrase queries, retrieve documents ranked by similarity scores, and gain insights from processed text data.

Features

  • Boolean Retrieval Model: A robust Boolean retrieval model forms the foundation of the system, enabling users to execute precise and complex queries using operators such as "AND," "OR," and "NOT."

  • Vector Space Model: Leveraging a Vector Space model enhances the system's capability to retrieve documents ranked based on similarity scores, providing users with relevant and ordered results.

  • Phrase Queries: The system supports phrase queries, allowing users to search for specific phrases within the document collection.

  • Internal Workflow: A streamlined internal workflow processes the collection of text data, employs Natural Language Processing (NLP) techniques, and generates basic insights. The workflow also constructs a positional index of the collection for efficient query processing.

Project Structure

The code is organized into two main parts:

  1. Query-System-notebooks: This section contains Jupyter Notebooks explaining the underlying logic, implementation details, and examples of using the Query System.

  2. Query-System-CMD: Find the main executable file for running the Query System via the command line. Refer to this file to interact with the system using the command-line interface.

Usage

Explore the Query-System-notebooks Jupyter Notebooks file for in-depth explanations and examples of using the Query System.

Example usage: Example usage of the CMD version of the query system

Getting Started

To run the Query System on your collection of files, follow these steps:

  1. Clone the repository to your local machine.
  2. Navigate to the Query-System-CMD directory.
  3. Open the Query-System-CMD python file and change the commented line of code to your collection directory that you want to run the query system on
  4. Run the main executable file to start the system in the command-line interface.

query-system-based-on-positional-index-boolean-model-and-vector-space-model's People

Contributors

kshishtawy avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.