The proxy should not be copied into each environment instance #288

alexhernandezgarcia · 2024-02-19T04:57:01Z

Currently, the proxy is set as an attribute of the environments and the base environment implements the methods proxy2reward() and reward2proxy() that determine the conversion between proxy outputs and reward. The environment also implements the methods reward() and reward_batch(), which call the proxy and the conversion methods. This is probably not ideal for various reasons.

I do not see any longer a good reason to keep the proxy and these methods within the environment. It seems possible and a good idea to completely detach the environment and the proxy. Some proxies need information from the environment, which is currently set via the call to Env.setup_proxy(), which calls the proxy's setup() method. But this could just be done elsewhere.

Now, in terms of alternatives, I am not completely settled on what the best option would be. In particular, where should the methods that convert between proxy and reward go?

In the (base) proxy?
In the GFlowNet agent?

The text was updated successfully, but these errors were encountered:

alexhernandezgarcia · 2024-03-20T14:53:25Z

It seems that it would be better to have the methods in the base proxy class, so as to make it easier for non-GFN baselines to re-use the GFlowNet code.

alexhernandezgarcia · 2024-06-04T13:31:00Z

Done in PR #299

alexhernandezgarcia self-assigned this Feb 19, 2024

alexhernandezgarcia added the help wanted Extra attention is needed label Mar 19, 2024

alexhernandezgarcia pinned this issue Mar 19, 2024

alexhernandezgarcia mentioned this issue Apr 2, 2024

Detachment of env and proxy; avoiding copies of the environments #299

Merged

10 tasks

alexhernandezgarcia closed this as completed Jun 4, 2024

alexhernandezgarcia unpinned this issue Jun 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The proxy should not be copied into each environment instance #288

The proxy should not be copied into each environment instance #288

alexhernandezgarcia commented Feb 19, 2024 •

edited

Loading

alexhernandezgarcia commented Mar 20, 2024

alexhernandezgarcia commented Jun 4, 2024

The proxy should not be copied into each environment instance #288

The proxy should not be copied into each environment instance #288

Comments

alexhernandezgarcia commented Feb 19, 2024 • edited Loading

alexhernandezgarcia commented Mar 20, 2024

alexhernandezgarcia commented Jun 4, 2024

alexhernandezgarcia commented Feb 19, 2024 •

edited

Loading