Skip to content

Counterfactual Regret Minimization (CFR) without generating all the states apriori #1239

Answered by lanctot
PURANJAY14 asked this question in Q&A
Discussion options

You must be logged in to vote

Ah yes, but to be honest I really don't remember why it does it that way. There was a deep technical reason. @jblespiau, do you remember?

@PURANJAY14 if I were you, I would make an alternative tabular policy called TabularPolicyDict that simply wraps a python dictionary but otherwise exposes the necessary methods in the Policy base class. (And initializing the state policy to uniform + adding it to the dictionary upon first call). Then replace all the ones in cfr.py with your custom dict.

Hope this helps!

Replies: 3 comments 2 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
2 replies
@PURANJAY14
Comment options

@lanctot
Comment options

Answer selected by PURANJAY14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants