graykode / gpt-2-pytorch Goto Github PK

View Code? Open in Web Editor NEW

954.0 27.0 226.0 2.39 MB

Simple Text-Generator with OpenAI gpt-2 Pytorch Implementation

License: MIT License

Python 74.75% Jupyter Notebook 25.25%

gpt-2 pytorch implementation nlp text-generator story-telling gpt2 natural-language-processing

gpt-2-pytorch's Introduction

gpt-2-pytorch's People

Contributors

Stargazers

Watchers

Forkers

antonioibarraortiz awenhu duke24k burakakrishna keep-steady marksteward awesome-archive zeroows wangkanger johndpope mturnshek yet-another-account stanxii lgstd abhimalamkar homewave giegloop wang-ii owen864720655 madscience101 aaronrmm tomeasure tarsbase highdxy maylibooyah69 liuwq168 g2064 gitrekm raveenb abul22 zhenjason gnomehat hulumei123 crackgfw rosssong tstsukahara yashrajav boldness123 jqueguiner lazuraslong jleen remaininlight iszhuangsha itspritish elchristog tuannguyen27 mthandazo42 larab401 hypnoai wyong16 freetoplayir zhanglipku hakanaku1234 stealthmaster007 lukemshannonhill adaniy lmb633 arkn shreyanshiitb guoyuxuan523 yai333 lmy0217 bizfreak22 pranavspandya janussa aashaybhupendradoshi ramanshrivastava nickgao86 arielsho derikatwork desertsx mayank-p michaelyao hyungjun010 dezeptup maryna-b wesley-yang berryhn jochemstoel michedev ilibx myuluo wqruan shubhamjaiswal87 milesqli laveesh mbialuddinkhan estkae melvinkcx nayananga sandyhouse shubhambaid wuxiaolianggit tel3port vayermaking kendallc kcbf dlreseach abhinav-shawarma jvmtorch

gpt-2-pytorch's Issues

How exact have you reproduce the experiment result？

Thank you very much.

Help Increasing the amount of training/fine-tuning text to about 10k words

Hello,
I am trying to train/fine-tune the GPT-2 model using your wrapper, I have successfully made it to train by using a text file, however I would like to train the model with lots of text like 10 thousand words on a specific topic/domain and have it generate from 500-1000 words but I keep getting a strange error when I try it.
Please how do I increase the amount of training/fine-tuning text from the current limit to about 10,000 words?

Grey text

Very interesting

Is there a way to use text file contents as an argument rather than text?

I have a text file that pulls todays headlines. I'd like to feed that into the model as the prompt.

Discrepancy in Parameter Size of Smallest Model

I have been using an implementation of GPT-2 from your repository and noticed that the size of the smallest GPT-2 model available in the repository differs from the smallest model mentioned in the original paper of GPT-2.
Specifically, the size of the parameters of the smallest model in the repository is about 124M but the smallest model in original paper is 117M

I am curious to know why there is this difference

Missing stuff in requirements.txt

Doesn't run. Lots of missing dependencies that should be in requirements.txt

GPT-2 implementation problem

"Hi, I am reading the GPT-2 paper and encountering a problem with the following phrase related to implementation:

'A modified initialization method is used to account for the accumulation on the residual path with model depth. We scale the weights of residual layers at initialization by a factor of 1/√N, where N is the number of residual layers.'

My problem is that we normalize after accumulation (addition then normalization). So, why do we need to scale weights? Aren't we doing this to reduce the impact of accumulation?"

How to train/fine tune the model with multiple GPUs?

I have pulled the code from branch train. Is there a way to train or fine tune the GPT-2 model with data parallelism on multiple GPUs? Thanks for your help.

Cannot recognize <|endoftext|>

Thank you for this project! It is very helpful for me to understand how GPT2 synthesize text.

I also noticed that the GPT2/encoder.py does not implement the capability of recognizing special tokens as the HuggingFace tokenzier could.

The part of source code in HuggingFace's repo is at https://github.com/huggingface/transformers/blob/c836f77266be9ace47bff472f63caf71c0d11333/src/transformers/tokenization_utils.py#L516-L520

I understand that it is not critical, because there is only one special token <|endoftext|> in use wangkuiyi/huggingface-tokenizer-in-cxx#11

So, just saying.

Invalid Syntax

I installed Python 2 and followed the instructions in the readme, but I'm getting an 'Invalid Syntax' error on the end quote on the following command. I have retyped the command just in case of a copy/paste artifact and I get the same error.

main.py --text "It was a bright cold day in April, and the clocks were striking thirteen. Winston Smith, his chin nuzzled into his breast in an effort to escape the vile wind, slipped quickly through the glass doors of Victory Mansions, though not quickly enough to prevent a swirl of gritty dust from entering along with him."

training

Is there's any way to train GPT2 using my own text corpus?

Missing requirements

It needs these packages as well so I guess they need to go into requirements.txt:

torch
tqdm

gpt2 pytorch model 344M .bin - Larger models with this codebase

Hi,

I was wondering how to integrate the larger released open.ai models with this code base.

Many thanks,
Vince.