moj-analytical-services / airflow-pdf2embeddings Goto Github PK
View Code? Open in Web Editor NEWNLP tool for scraping text from a corpus of PDF files, embedding the sentences in the text and finding semantically similar sentences to a given search query.
License: MIT License