Giter VIP home page Giter VIP logo

hhousen / docsum Goto Github PK

View Code? Open in Web Editor NEW
63.0 3.0 13.0 117 KB

A tool to automatically summarize documents abstractively using the BART or PreSumm Machine Learning Model.

Home Page: https://haydenhousen.com/projects/docsum/

License: GNU General Public License v3.0

Python 75.82% Jupyter Notebook 24.18%
machine-learning nlp summarization automatic-summarization python pdf xml-parser abstractive-summarization abstractive-text-summarization text-summarization

docsum's Introduction

Hi there ๐Ÿ‘‹

HHousen's Stats

HHousen Profile Views

GitHub Trophies ๐Ÿ† HHousen's Trophies

docsum's People

Contributors

hhousen avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

docsum's Issues

Link to pressum is broken

Hi i am trying to run the pressum from cmd_summarizer.py but getting this error. i think the path to huggingface is broken. Can you please guide me to run presumm.

Error : Traceback (most recent call last):
File "cmd_summarizer.py", line 51, in
summarizer = presumm.PreSummSummarizer()
File "/content/docsum/presumm/presumm.py", line 28, in init
model = BertAbs.from_pretrained("bertabs-finetuned-cnndm")
File "/usr/local/lib/python3.6/dist-packages/transformers/modeling_utils.py", line 877, in from_pretrained
**kwargs,
File "/usr/local/lib/python3.6/dist-packages/transformers/configuration_utils.py", line 347, in from_pretrained
config_dict, kwargs = cls.get_config_dict(pretrained_model_name_or_path, **kwargs)
File "/usr/local/lib/python3.6/dist-packages/transformers/configuration_utils.py", line 400, in get_config_dict
raise EnvironmentError(msg)
OSError: Can't load config for 'bertabs-finetuned-cnndm'. Make sure that:

  • 'bertabs-finetuned-cnndm' is a correct model identifier listed on 'https://huggingface.co/models'

  • or 'bertabs-finetuned-cnndm' is the correct path to a directory containing a config.json file

Help How To run

hello,sorry if i use this space but i am very newbie whitout code knowledge
Is possible run on my local pc whit anaconda or ubutu or better whit google collab?

S3 bucket model not found error

Traceback (most recent call last):
File "run_summarization.py", line 325, in
main()
File "run_summarization.py", line 310, in main
evaluate(args)
File "run_summarization.py", line 34, in evaluate
model = BertAbs.from_pretrained("bertabs-finetuned-cnndm")
File "D:\anaconda3\lib\site-packages\transformers\modeling_utils.py", line 546, in from_pretrained
**kwargs,
File "D:\anaconda3\lib\site-packages\transformers\configuration_utils.py", line 202, in from_pretrained
config_dict, kwargs = cls.get_config_dict(pretrained_model_name_or_path, **kwargs)
File "D:\anaconda3\lib\site-packages\transformers\configuration_utils.py", line 272, in get_config_dict
raise EnvironmentError(msg)
OSError: Couldn't reach server at 'https://s3.amazonaws.com/models.huggingface.co/bert/remi/bertabs-finetuned-cnndm-extractive-abstractive-summarization-config.json' to download pretrained model configuration file.

I'm trying to run the code but facing the above error. The code fails to download the config file saying the file not found.
Could you please help me to resolve this issue?
i tried to access the model directly using the provided link but it raises the error "no such key"

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.