This is fork from https://github.com/mahesh-tws/lmstudio-rag
Start LM Studio server running on port 1234.
This repo performs 3 functions:
-
Scrapes a website (thoughtswin systems in this example) and follows links under the same path up to a given depth and outputs the scraped data to the data directory. One can point to other data sources or utilize this as a starter.
-
Runs an embedding model to embed the text into a Chroma vector database using disk storage (chroma_db directory)
-
Runs a Chat Bot that uses the embeddings to answer questions about the website.
main.py runs all 3 functions. Once the scraper and embeddings have been completed, they do not need to be run again for same website. You can simply run the chatbot.py file.
Prerequisite: Run an LM Studio Server