Comments (3)
I have the same query
The first for loop modifies the following blocks:
down_blocks.0.attentions.0.transformer_blocks.0
down_blocks.0.attentions.1.transformer_blocks.0
down_blocks.1.attentions.0.transformer_blocks.0
down_blocks.1.attentions.1.transformer_blocks.0
down_blocks.2.attentions.0.transformer_blocks.0
down_blocks.2.attentions.1.transformer_blocks.0
up_blocks.1.attentions.0.transformer_blocks.0
up_blocks.1.attentions.1.transformer_blocks.0
up_blocks.1.attentions.2.transformer_blocks.0
up_blocks.2.attentions.0.transformer_blocks.0
up_blocks.2.attentions.1.transformer_blocks.0
up_blocks.2.attentions.2.transformer_blocks.0
up_blocks.3.attentions.0.transformer_blocks.0
up_blocks.3.attentions.1.transformer_blocks.0
up_blocks.3.attentions.2.transformer_blocks.0
mid_block.attentions.0.transformer_blocks.0
The second for loop modifies:
up_blocks.1.attentions.1.transformer_blocks.0.attn1
up_blocks.1.attentions.2.transformer_blocks.0.attn1
up_blocks.2.attentions.0.transformer_blocks.0.attn1
up_blocks.2.attentions.1.transformer_blocks.0.attn1
up_blocks.2.attentions.2.transformer_blocks.0.attn1
up_blocks.3.attentions.0.transformer_blocks.0.attn1
up_blocks.3.attentions.1.transformer_blocks.0.attn1
up_blocks.3.attentions.2.transformer_blocks.0.attn1
Which is a subset of the first for loop.
according to the comment, the first block of the lowest resolution shouldn't have extended attention registered. the first for loop registers extended attention for that block as well.
from tokenflow.
同问
from tokenflow.
I think the valid function should be register_extended_attention_pnp
where a list injection_schedule
is defined.
Lines 203 to 214 in 8ae24e9
The injection is activated according to injection_schedule
.
Lines 124 to 130 in 8ae24e9
Lines 86 to 91 in 8ae24e9
BTW, I tried removing the first loop in L203-L206 and found the result was not changed. However, when removing the second loop in L208-L214, the result would get worse.
from tokenflow.
Related Issues (20)
- what is the correct way to run demo? HOT 2
- Hello, I'd like to ask, does every video require preprocessing before editing? HOT 1
- not compatible with diffusers 0.21+ [with workaround] HOT 2
- Required GPU memory depends on the video length. HOT 1
- batching pivots allows processing bigger/longer sequences
- Random images if we use different SD version
- Adapt to use SDD-1B? HOT 1
- Script for Warp-error metric.
- confusions between reshape_heads_to_batch_dim and heads_to_batch_dim
- "ValueError: max() arg is an empty sequence" when trying to run via jupyterlab env HOT 1
- Missing License HOT 1
- SD XL Integration HOT 1
- Is it all code released? HOT 1
- Is Joint editing in paper use pretrained 'Tune-A-Video' model?
- What is the code of 'NN field compute & warp' ?
- ValueError: attempt to get argmax of an empty sequence HOT 1
- Output is same as input file, why?
- RuntimeError: CUDA error: out of memory, but this maybe no the memory problem,
- Why add_noise before sample_loop
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from tokenflow.