Implement ForgettingPeriodic agent #12

ali-tny · 2021-07-15T21:44:26Z

ie, a Periodic agent that forgets observations from the past, implemented
via setting the chance of including an observation as an exponential decay.

The default value of the exponential decay constant is chosen to include
observations from a year ago with probability 0.05. I think this is probably good for environments
longer than a year, but could probably be higher for environments only of a single year.

We'll borrow a lot of the functionality when making the forgetting version, so we just let the concrete class define it's own UCB method and pull the rest of the functionality into the base. For the forgetting agent, we'll need the full history of the conversions (which is also sufficient for the regular Periodic agent), so we refactor to allow that.

ie, a Periodic agent that forgets observations from the past, implemented via setting the chance of including an observation as an exponential decay. The default value of the exponential decay constant is chosen to include observations from a year ago with probability 0.05.

codecov-commenter · 2021-07-15T21:46:01Z

Codecov Report

Merging #12 (83f34da) into main (6266824) will not change coverage.
The diff coverage is 0.00%.

@@          Coverage Diff          @@
##            main     #12   +/-   ##
=====================================
  Coverage   0.00%   0.00%           
=====================================
  Files          2       2           
  Lines         54      73   +19     
=====================================
- Misses        54      73   +19

Impacted Files	Coverage Δ
pachinko/time_period_step_agent.py	`0.00% <0.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 6266824...83f34da. Read the comment docs.

DBCerigo

Yea nice. Good work on the refactor for the generalising/abstraction, did the right amount (not too much) of it (imo).

Just minors/enhancement suggestions.

I guess we need to figure out how we are going to add agent results/analysis in the repo so we can discuss it (without having to be in the same room) ay? Possibly we have a convention branch, like agent_performance (name tbd), in which we commit the main run notebook, that outputs the history plots and summary df etc. in nb for all the agent-envs. Thoughts?

DBCerigo · 2021-07-31T08:17:50Z

pachinko/time_period_step_agent.py

        ]
-        # Set conversion rate to infinity for unchosen actions
-        # to ensure all actions are tested at least once
+        # Set conversion rate to infinity for unchosen actions to ensure all actions are tested at


nit: small bit of imprecision in comment that could cause a confusion: we're not "set"ting the conversion rate (that's set by the env), we are setting the agents initial belief in the conversion rate. Maybe just adding "beliefs"/"estimates"/"inferences" after the word "rate" would help.

DBCerigo · 2021-07-31T08:25:59Z

pachinko/time_period_step_agent.py

+# A list containing a tuple of (num_successes, timestep) for each time an action was picked,
+# representing the full history of that action's conversions
+ConversionHistory = List[Tuple[int, int]]


Enhancement: would it be worth defining types like:

NumberSuccesses = int Timestep = int

to enable

ConversionHistory = List[Tuple[NumberSuccesses, Timestep]]

Given the importance of that variable for the agents etc., I found myself having to refer back to the commend on line 64 repeatedly, so I kept forgetting which was with the 2uple. Having it in the actual type def instead of having to stitch that with the comment would help a lot I think.

Thoughts?

ali-tny added 3 commits July 15, 2021 21:53

Correct RandomAgent and EpsilonGreedy docstrings

a2ed248

ali-tny requested a review from DBCerigo July 15, 2021 21:51

DBCerigo approved these changes Jul 31, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement ForgettingPeriodic agent #12

Implement ForgettingPeriodic agent #12

ali-tny commented Jul 15, 2021

codecov-commenter commented Jul 15, 2021

DBCerigo left a comment

DBCerigo Jul 31, 2021

DBCerigo Jul 31, 2021

Implement ForgettingPeriodic agent #12

Are you sure you want to change the base?

Implement ForgettingPeriodic agent #12

Conversation

ali-tny commented Jul 15, 2021

codecov-commenter commented Jul 15, 2021

Codecov Report

DBCerigo left a comment

Choose a reason for hiding this comment

DBCerigo Jul 31, 2021

Choose a reason for hiding this comment

DBCerigo Jul 31, 2021

Choose a reason for hiding this comment