Comments (6)
@dumitrac I think we need to pull 885a1f0 into main to fix this.
from olmo.
Thank you! I will reach out to ROCm team to get an estimate on when they plan to release this pytorch 2.2 image.
from olmo.
Sorry I will close the issue once the PR has been merged and I am able to verify that it works.
from olmo.
@prakamya-mishra I see that device_mesh requires Pytorch 2.2 (link).
Are you able to upgrade yours from 2.1.2 and confirm if that resolves it?
from olmo.
@dumitrac @2015aroras I am using the rocm/pytorch docker image from docker hub, and I could not find any pytorch 2.2 image tag. Is there a release of pytorch 2.2 for rocm?
from olmo.
I couldn't find a docker image for pytorch 2.2 for rocm either.
Then, it looks like @2015aroras 's PR is the quickest way to unblock you (#561).
Thank you both.
from olmo.
Related Issues (20)
- Key 'https://olmo_checkpoints' not in 'TrainConfig' HOT 1
- How the 1B and 7B model are initialized?
- Tokenizer with relative path import fails when using olmo as pip library
- Multi node training
- Resuming training on unsharded checkpoint HOT 5
- What did OLMo 1B converge to? HOT 1
- Issue with tokenizer wrapper
- start_index not getting reset in data loader when moving to new epoch HOT 4
- Cannot convert internal OLMo checkpoint to HF HOT 2
- Can long text be splitted into short texts?
- Is there explicitly instruction-following data in the version of Dolma used to train v1? HOT 1
- DDP training tries to save sharded checkpoint on the last step
- Does global_train_batch_size support gradient accumulation? HOT 1
- mlp_ratio not adjusted in config if mlp_hidden_size is set
- Initial Loss increased from 10 (0.3.0 v) to 60 (0.4.0) ! HOT 9
- Model ladder has no documentation
- Olmo 0724 `-hf` checkpoints don't load the proper config when instantiating with OLMoForCausalLM HOT 2
- why CrossEntropyLoss is zero,i HOT 2
- Gflops computation is faulty for FSDP due to bug in `OLMo.num_params()`
- Number of tokens Olmo-1B was trained: 2T or 3T?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from olmo.