Comments (3)
Hi @danieldietzel ,
Could you provide your yaml config? we will help you check the correctness
from lumina-t2x.
Hi @danieldietzel , Could you provide your yaml config? we will help you check the correctness
Hi Pommes,
I ran through the steps again and realized the second item in this config is supposed to be the LLM checkpoint, not the lumina model.
https://github.com/Alpha-VLLM/Lumina-T2X/blob/main/lumina_next_t2i/configs/infer/settings.yaml
Now my Yaml is:
-
settings:
model:
ckpt: 'C:\models\lumina'
ckpt_lm: 'C:\models\gemma'
token: ""transport:
path_type: "Linear" # option: ["Linear", "GVP", "VP"]
prediction: "velocity" # option: ["velocity", "score", "noise"]
loss_weight: "velocity" # option: [None, "velocity", "likelihood"]
sample_eps: 0.1
train_eps: 0.2ode:
atol: 1e-6 # Absolute tolerance
rtol: 1e-3 # Relative tolerance
reverse: false # option: true or false
likelihood: false # option: true or falseinfer:
resolution: "1024x1024" # option: ["1024x1024", "512x2048", "2048x512", "(Extrapolation) 1664x1664", "(Extrapolation) 1024x2048", "(Extrapolation) 2048x1024"]
num_sampling_steps: 60 # range: 1-1000
cfg_scale: 4. # range: 1-20
solver: "euler" # option: ["euler", "dopri5", "dopri8"]
t_shift: 4 # range: 1-20 (int only)
ntk_scaling: true # option: true or false
proportional_attn: true # option: true or false
seed: 0 # rnage: any number
But I get this:
TypeError: NextDiT.forward_with_cfg() got an unexpected keyword argument 'ntk_factor'
[rank0]: AttributeError: 'NoneType' object has no attribute 'float'
I am on Windows by the way if it helps. Had to change all dist.init_process_group("nccl")
to dist.init_process_group("gloo")
to get this far not sure if that breaks things.
from lumina-t2x.
This may not impact performance, but we have not tested whether it can run correctly on the Gloo backend. you could try running the mini version of Lumina-Next-T2I on https://github.com/Alpha-VLLM/Lumina-T2X/tree/main/lumina_next_t2i_mini
from lumina-t2x.
Related Issues (20)
- cutlassF: no kernel found to launch! HOT 2
- Fine-tuning on a 24GB GPU? HOT 1
- Lumina Next img2img HOT 7
- Expected timeline of T2V model/code release HOT 1
- lumina_next_t2i: _pickle.UnpicklingError: invalid load key, 'v'. HOT 5
- Lumina-T2X VAE problem HOT 3
- it would be cool if you could implement IC-Light for this model
- Is it possible to replace the model's Text Encoder with other models, such as google/gemma-2-9b?
- Do you have any plans to accelerate the Lumina model using TensorRT? HOT 1
- img2img script generates corrupted images HOT 2
- When will the MV-Next-DiT model weights released?
- This can generate video pictures and 3D models with only a single model, or it needs to be configured separately. Each model is separate and needs to integrate its own functions
- Request for Guidance on Reproducing Model Architecture for Image Classification HOT 2
- Could the author release a training script for training LoRA or fine-tuning based on Diffusers? HOT 1
- Next-DiT-ImageNet checkpoints HOT 2
- The total iterations in 256 pretraining stage
- Why did you choose FSDP instead of deepseed framework for training? What are the potential problems of deepseed framework for making text graphs?
- Slow generation on cuda
- run Style-Consistent Generation HOT 3
- 当前lumina_next_t2i是否支持batch_size大于1的推断?lumina_next_t2i/sample.py中的batch_size参数似乎没有使用?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from lumina-t2x.