Giter VIP home page Giter VIP logo

hariharan-jayakumar / financial-document-summarizer Goto Github PK

View Code? Open in Web Editor NEW

This project forked from madhavsankar/question-answering-system

5.0 1.0 0.0 266.05 MB

To automate the process of reviewing financial documents by extracting relevant information and generating a report in a question-answer format.

Jupyter Notebook 79.34% Dockerfile 2.21% Python 5.06% Shell 3.50% Java 9.31% Batchfile 0.58%

financial-document-summarizer's Introduction

Motivation

This project aims to automate the process of report generation by making use of two modules - question generation from text and answer extraction from text. Firstly, question extraction modules are used to generate a set of generic questions from a training corpus of financial documents. This generated questions are fed to the test document and answer extraction modules extract the answers for these questions. The question-answer pairs are compiled in the form of a report. We also use the novel idea of using training documents to obtain a set of questions and use the knowledge gained during this for answer extraction. We also generate a generalized set of financial specific questions that can be commonly used across any similar test financial data.

Environment

Getting Started

Run the setup.ipynb on Jupyter notebook.

Deployment

The input can be given as Input.pdf (a sample is given, can be overwritten). A set of generalized questions are generated from this file. Questions from multiple such files can be combined. The test file for which summarization in the form of question answers is needed is given as Test.pdf. Answers for the generlized questions are generated from this file and given as a form of summarization.

Authors

Acknowledgments

Thanks to Dr Rajeswari Sridhar, Head Of Department, CSE, NIT Trichy for her guidance.

Thanks to Neural Networks and Deep Learning lab, MIPT for the inspiration for the Question Generation module and Answer Extraction module.

financial-document-summarizer's People

Contributors

hariharan-jayakumar avatar madhavsankar avatar

Stargazers

Rakesh Siva avatar Preetha Kumaresan avatar  avatar  avatar  avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.