Issue forked from <a class="issue-link js-issue-link" data-error-text="Failed to load

DQL still learning at evaluation time about cyberbattlesim HOT 1 OPEN

blumu commented on June 10, 2024

DQL still learning at evaluation time

from cyberbattlesim.

Comments (1)

blumu commented on June 10, 2024

That's a valid point though I think it's valid and fair game to assume the agent can continue to learn and adjust it's policy during evaluation as long as its state (Q-function) is reset at the beginning of each episode to what it was after the learning phase (which might not be currently the case and may need to be fixed.)

Also if we really want to handicap the agent and prevent it to learn during an episode then I suggest we add a freeze_learning:bool parameter to DeepQLearnerPolicy agent. If set to true then function update_q_function becomes a no-op.

CyberBattleSim/cyberbattle/agents/baseline/agent_dql.py

Line 330 in 4fd228b

def update_q_function(self,

from cyberbattlesim.

Related Issues (20)

Dockerfile not working HOT 7
ModuleNotFoundError: No module named 'cyberbattle' HOT 2
E: Unable to locate package python3.9 HOT 1
Question about rewards in chain environment HOT 2
Key for how the simulation is working HOT 1
simplenv bidirectional? HOT 3
Example agent doesn't work with conda install HOT 8
python 3.11 not working - ValueError: mutable default <class 'cyberbattle.simulation.model.FirewallConfiguration'> HOT 9
ValueError: mutable default <class 'cyberbattle.simulation.model.FirewallConfiguration'> for field firewall is not allowed: use default_factory HOT 3
Internal Node Id Dependent on Order of Action Execution HOT 1
Unable to install dependencies HOT 12
Action_space dimensions formally too large HOT 1
Play the environment with other RL algorithms HOT 14
A question about 'pretty_print_internal_action' HOT 2
Making reward non-zero, what is the reason? HOT 1
Something wrong with the node's last owned time and last reimaged time? HOT 1
Consider designing defender as RL agent HOT 3
init.sh doesn't work HOT 2
How to setup machine/program for use HOT 5

DQL still learning at evaluation time about cyberbattlesim HOT 1 OPEN

Comments (1)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent