Hi, thanks for your great job! I have a question on how to visualize the attention wei

Hi, You can use this flag (<a href="https://github.com/shariqiqbal28

Yes that is correct. The attention weights are calculated as part of the state-a

Hi, You can use this flag (<a href="https://github.com/

How to visualize the attention weights between agents in the testing phase? about maac HOT 8 CLOSED

shariqiqbal2810 commented on July 28, 2024

How to visualize the attention weights between agents in the testing phase?

from maac.

Comments (8)

shariqiqbal2810 commented on July 28, 2024

Hi,

You can use this flag (https://github.com/shariqiqbal2810/MAAC/blob/master/utils/critics.py#L162) to return the attention weights of each each agent over the other agents for all the time points that are passed in as input.

from maac.

soada commented on July 28, 2024

Thanks for the instructions! But I still have two questions:

Is the flag used for returning the attention weights of the samples collected in the training process?
Can I obtain the attention weights on the fixed time-step in the evaluating process (the decentralized execution process)?

from maac.

shariqiqbal2810 commented on July 28, 2024

The flag is simply used whenever you call the forward pass on the critic module. Example:

critic = AttentionCritic(sa_sizes)
rets = critic(return_q=True, return_attend=True)
# rets[0][0] contains Q-value for agent 0 corresponding to inputs and rets[0][1] contains attention weights for agent 0

As such, the attention weights are calculated for whatever states and actions you pass into the critic during the forward pass, so you can calculate the attention weights both during training and execution if you would like.

from maac.

soada commented on July 28, 2024

Thanks for your advice! In the execution process, the agents only get observations.

Should I first get the actions from the policies and then send the obs-action pair to the critic to calculate the attention weights? Is there any method to calculate the weights depending only on observations?
As for Figure 6 in your article, when the rover is paired with different towers, are the attention weights calculated in the training process averaged over several times or execution process?
If the attention weights are dynamically changed within an episode, then how to make a visualization? Thanks very much!

from maac.

shariqiqbal2810 commented on July 28, 2024

Yes that is correct. The attention weights are calculated as part of the state-action value prediction network, so there is no way to get them without inputting actions.
For Figure 6, the "attention entropy" is reported as an average over all data points in the mini-batch provided during training. It's important to note here that Figure 6 is not plotting the actual attention weights, but rather their entropy (i.e. how uniformly the attention weights are distributed).
You can simply plot the attention weights on a per timepoint basis.

from maac.

soada commented on July 28, 2024

Thank you very much! In fact, the figure I have mentioned is the following one (maybe figure 7 in your final version), whose caption is " Attention weights when subjected to different Tower pairings for Rover 1 in Rover-Tower environment
":

Are the attention weights calculated in the training process averaged over several times or execution process?

from maac.

shariqiqbal2810 commented on July 28, 2024

Oh I see. These are calculated from a single timepoint during execution.

from maac.

GoingMyWay commented on July 28, 2024

Hi,

You can use this flag (https://github.com/shariqiqbal2810/MAAC/blob/master/utils/critics.py#L162) to return the attention weights of each each agent over the other agents for all the time points that are passed in as input.

Hi, sir, is the all_attend_probs[i] the attention weights of agent i or it is the attention weights of other agents except itself?

from maac.

How to visualize the attention weights between agents in the testing phase? about maac HOT 8 CLOSED

Comments (8)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent