Should Q becomes higher as training number gets more? #729

hokhay · 2022-10-04T17:59:44Z

hokhay
Oct 4, 2022

The graph below shows Q, actor loss and reward as training process. The model is SAC.
As far I know, the algorithm objective is to maximise Q value, so I confuse that why Q value decreases along the training.
Also I notice that the more training episode, loss gets smaller but the result gets worsen in testing. If that's the case how many episode should I choose to train?

Thanks
Jason

alex-seto · 2022-12-11T05:23:09Z

alex-seto
Dec 11, 2022

Hey man, this is a common problem in machine learning where if we overfit our training set, we see in testing that often performance will decrease as our algorithm will learn the training data to well. There are many ways to address this across different algorithms, I suggest googling "overfitting" and whatever algorithm you are selecting. But as I said this is a very well explored, fundamental issue in ML/RL and its something you will have to take the time to learn about thoroughly.

0 replies

zhumingpassional · 2023-02-15T14:37:29Z

zhumingpassional
Feb 15, 2023
Collaborator

obviously not.

you should stop training to avoid overfitting.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should Q becomes higher as training number gets more? #729

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 2 comments

{{title}}

{{title}}

Select a reply

Should Q becomes higher as training number gets more? #729

hokhay Oct 4, 2022

Replies: 2 comments

alex-seto Dec 11, 2022

zhumingpassional Feb 15, 2023 Collaborator

hokhay
Oct 4, 2022

alex-seto
Dec 11, 2022

zhumingpassional
Feb 15, 2023
Collaborator