Comments (10)
/assigntome
from xla.
@duncantech I am thinking of training the latest Gemma model, indeed, with Pytorch XLA. Is it okay then?
from xla.
I think the geema model should work out of box. Take a look at https://github.com/google/gemma_pytorch#try-it-out-with-pytorchxla. Feel free to give it a try and see if we can improve anything.
from xla.
I think the geema model should work out of box. Take a look at https://github.com/google/gemma_pytorch#try-it-out-with-pytorchxla. Feel free to give it a try and see if we can improve anything.
Ok. I will look into the gemma part.
For a different model I am trying with, a few things I need to know: do I need to use any free cloud tpu provider, for example, Kaggle or Colab tpu, or is it necessary to do it with the v5 in Google Cloud?
from xla.
That part I think @duncantech can answer.
from xla.
You can work with a free TPU provider if you'd like to get things started.
We should also be able to give a small amount of v5es to try with too.
from xla.
@sitamgithub-MSIT we haven't heard an update in a bit and just wondering if you're still working on the issue?
from xla.
@sitamgithub-MSIT we haven't heard an update in a bit and just wondering if you're still working on the issue?
Yes I am working to it. I am checking this example in the hugging face for Gemma. I am thinking about reproducing the same for CodeGemma, though.
from xla.
Related Issues (20)
- Create a glossary HOT 1
- Try running Resnet example on GPU HOT 1
- Create a distributed and single device example
- Try using the CPU PJRT plugin HOT 9
- Try running inference on an ARM CPU HOT 4
- Add a table on hardware compatability HOT 3
- Update diagrams to work with dark mode HOT 1
- Test export HLO instructions HOT 7
- Add example for training small LLM HOT 3
- How do I know which pytorch parameter corresponds to which parameter in hlo ir HOT 9
- Distributed spmd training with multiple compilations HOT 4
- RuntimeError: isDifferentiableType(variable.scalar_type()) INTERNAL ASSERT FAILED when using torch.repeat HOT 2
- In-place operations on an DLPack aliased XLA tensor does not propagate. HOT 8
- [RFC] PR Cherrypicking Process After a Release Branch Cut HOT 1
- Incomplete Checkpoints for Non-Sharded Parameters During SPMD Training in PyTorch XLA HOT 4
- Delete main branch HOT 4
- TPU Initialization Failed HOT 3
- How to convert hlo.pb to hlo text? HOT 3
- The combination of inplace ops and custom op resulted in incorrect results HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from xla.