Comments (2)
It`s still not working properly. But I think that has something to do with padding and my sequence lengths. I have to investigate that further, but thank you for your help! :)
I found a small thing that accelerated the data loading time a lot:
DiffuSeq/diffuseq/text_datasets.py
Lines 163 to 166 in bea43e1
here a line is loaded twice. By reading it once and then accessing the src and trg I saved a lot of time.
with open(path, 'r') as f_reader:
for row in f_reader:
line = json.loads(row)
sentence_lst['src'].append(line['src'].strip())
sentence_lst['trg'].append(line['trg'].strip())
from diffuseq.
Hi,
Maybe you can try to add keep_in_memory = True
in function raw_datasets.map
DiffuSeq/diffuseq/text_datasets.py
Line 78 in bea43e1
If it doesn't work, you can try to split your datasets into separate folds and load them respectively in different training steps.
from diffuseq.
Related Issues (20)
- Problem about Running Time on Dialogue dataset HOT 1
- Some questions about different losses HOT 2
- On the calculation of Xt partial loss in Zt HOT 6
- Questions about "decoder_nll" HOT 3
- About Transformer Model HOT 1
- About loss in training_losses_seq2seq() when time step t=0 HOT 5
- facing an error while trying to execute pip install -r requirements.txt in terminal HOT 13
- Error when decoding HOT 1
- Only one gpu
- About the usage of `batch` in dataset (form of `batch`, `cond`) HOT 2
- Dataset(2) in "text_datasets.py" HOT 1
- the Multi-GPU training acutally duplicates data in each GPU ? HOT 19
- Where decode "src" and "trg" ? HOT 1
- Nothing generated from decode HOT 2
- NCCL error
- License HOT 1
- Question About top-p sampling HOT 1
- q<n> metrics HOT 1
- NLL for q0-q2 is 0 but for q3 is >2 HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from diffuseq.