In the paper, you mention that IA^3 is compatible with multi-task batching, a requirem

Hi <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="

Multi-task batching about t-few HOT 4 OPEN

r-three commented on August 24, 2024 1

Multi-task batching

from t-few.

Comments (4)

dptam commented on August 24, 2024 1

Hi @einarbmag, sorry for the long delay.
I could be wrong, but I think mixture-of-experts might have an implementation for this to use different experts within a batch. @muqeeth might know more about this.

from t-few.

muqeeth commented on August 24, 2024 1

Hi @einarbmag, here is one possible implementation we can use for a batch containing examples from multiple tasks:

Assume B is the batch size, N is the number of tasks, and H is the hidden dimension at which IA^3 is applied.

Task indices T are represented by a B x N tensor. This tensor is one-hot, where the index corresponding to the task index is set to 1 for each example.
IA^3 vectors V are defined as an N x H tensor.

We can obtain the required IA^3 vectors for each example by using L_batch = torch.matmul(T, V).

Then, we modify the input activations, which have the shape (B x num_tokens x H), by multiplying them with L_batch unsqueezing along the sequence dimension.

from t-few.

dodoyeon commented on August 24, 2024

Hi I have to use the mixed task batch so I'll do it if I need to,..
Did you implement IA3 mixed task batch?

from t-few.

dptam commented on August 24, 2024

Hi @dodoyeon sorry we did not, but Muqeeth's sketch above can provide a starting point!

from t-few.

Recommend Projects

Multi-task batching about t-few HOT 4 OPEN

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent