soft module codebase notes #79

NirViaje · 2021-08-23T04:32:25Z

https://rchalyang.github.io/SoftModule/

"epoch_frames" : 200
"batch_size" : 1280

`torchrl/algo/off_policy/twin_sac_q.py`

get sparse loss here

        """
        Policy Loss
        """
        if not self.reparameterization:
            raise NotImplementedError
        else:
            assert log_probs.shape == q_new_actions.shape
            policy_loss = ( alpha * log_probs - q_new_actions).mean()

        std_reg_loss = self.policy_std_reg_weight * (log_std**2).mean()
        mean_reg_loss = self.policy_mean_reg_weight * (mean**2).mean()

        policy_loss += std_reg_loss + mean_reg_loss

mujoco210-linux-x86_64.tar.gz

local_debug_logger-master.zip

The text was updated successfully, but these errors were encountered:

NirViaje · 2021-09-13T08:25:48Z

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

soft module codebase notes #79

soft module codebase notes #79

NirViaje commented Aug 23, 2021 •

edited

Loading

NirViaje commented Sep 13, 2021 •

edited

Loading

soft module codebase notes #79

soft module codebase notes #79

Comments

NirViaje commented Aug 23, 2021 • edited Loading

torchrl/algo/off_policy/twin_sac_q.py

NirViaje commented Sep 13, 2021 • edited Loading

CoBERL

v-MPO

NirViaje commented Aug 23, 2021 •

edited

Loading

`torchrl/algo/off_policy/twin_sac_q.py`

NirViaje commented Sep 13, 2021 •

edited

Loading