Is the order of the args in lstsq in the APS agent's regress meta function correct? #30

raymondchua · 2023-02-10T03:52:32Z

I am curious why in the regress_meta function in the APS agent, the reward is the first argument and not rep? According to the torch documentation, the lstsq function tries to find X in the the equation ||AX -B||_F, with A being the first argument and B being the second argument of the function. Since the equation that we are trying to solve is finding w, such that ||rep * w - reward||, shouldn't A be rep instead?

task = torch.linalg.lstsq(reward, rep)[0][:rep.size(1), :][0]

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is the order of the args in lstsq in the APS agent's regress meta function correct? #30

Is the order of the args in lstsq in the APS agent's regress meta function correct? #30

raymondchua commented Feb 10, 2023 •

edited

Loading

Is the order of the args in lstsq in the APS agent's regress meta function correct? #30

Is the order of the args in lstsq in the APS agent's regress meta function correct? #30

Comments

raymondchua commented Feb 10, 2023 • edited Loading

raymondchua commented Feb 10, 2023 •

edited

Loading