Comments (5)
Could you please provide more details on what you're trying to do?
I couldn't understand from your description.
from apps.
I couldn't find the file this line requires in the dataset (https://github.com/hendrycks/apps/blob/main/train/tune_apps_gpt.py#L159). But I think it is just a json file with list of training folders.
from apps.
I'll update you later tonight. I need to redownload the apps dataset as I thought we included it in there.
Otherwise I'll create a new one and upload it to the git repo. After that I'll update the README and this issue.
from apps.
Not a big issue because the json file can be inferred from the APPSBaseDataset.py file. Btw, I wonder how many gpus need to fine-tune the gpt-neo on apps. I saw the batch size per replica is only 2.
from apps.
I added the instructions here: https://github.com/hendrycks/apps/blob/main/train/README.md
and the script here: https://github.com/hendrycks/apps/blob/main/train/apps_create_split.py
As for how many GPUs I believe it is listed in the paper. I can't remember the numbers off hand.
from apps.
Related Issues (20)
- Show a data instance in the readme HOT 2
- Computation of the accuracy scores when there are compilation and runtime errors HOT 7
- evaluation on multiple solutions at once causes memory leak HOT 14
- Nan test case average HOT 5
- Test case average of solutions in real dataset HOT 8
- Running instructions HOT 4
- Request for scripts of fine-tuning HOT 3
- Problems with fine-tuning
- Problems With APPS HOT 4
- Too Long Problems HOT 8
- Unable to run pre-trained (1.5B) model on test set HOT 2
- answer_type calculation is different for train/val and eval HOT 1
- Steps About Generated Code Solutions Post-processing HOT 1
- About Solutiions' validity HOT 1
- check5 in function "run_test" seem to bring some wrong result HOT 2
- Can this dataset test for chatgpt?(gpt 3.5?) HOT 10
- Problem in ground-truth solutions HOT 2
- Asking for scripts for pre-processing HOT 2
- Request for pretrained models HOT 2
- DeepSpeed config and TrainingArguments mismatch HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from apps.