Skip to content

Inquiry Regarding Loss Score Interpretation in Reinforcement Learning with reinvent4 #79

Answered by halx
jackhu5 asked this question in Q&A
Discussion options

You must be logged in to vote

Hi,

many thanks for your interest in REINVENT and welcome to the community!

the loss function is documented in our paper, see eqs. 5, 6 and discussion following eq 7. Please note carefully that the equations there are written with the log likelihood while the TensorBoard output logs the negative log likelihood (NLL) which, confusingly, is a positive number.

Practically thinking, what you want to achieve in reinforcement learning (RL) is that the new model (agent) generates highly scoring SMILES with low NLL. This will, somewhat naturally, mean that the very same SMILES would probably not score well with the prior, which is used as a reference or "regularizer", and generate compounds with …

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@jackhu5
Comment options

Answer selected by jackhu5
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
question Further information is requested
2 participants