Counterfactual Regret Minimization (CFR) without generating all the states apriori #1239
-
Hi, |
Beta Was this translation helpful? Give feedback.
Replies: 3 comments 2 replies
-
CFR should not require getting all states. I just checked cfr.py and I don't see an import for get_all_states.py or where it gets all the states. Can you point to where you think the CFR implementation gets all the states? Oh wait, do you mean information states? You may want to look into MCCFR in that case. |
Beta Was this translation helpful? Give feedback.
-
Ah yes, but to be honest I really don't remember why it does it that way. There was a deep technical reason. @jblespiau, do you remember? @PURANJAY14 if I were you, I would make an alternative tabular policy called TabularPolicyDict that simply wraps a python dictionary but otherwise exposes the necessary methods in the Policy base class. (And initializing the state policy to uniform + adding it to the dictionary upon first call). Then replace all the ones in cfr.py with your custom dict. Hope this helps! |
Beta Was this translation helpful? Give feedback.
Ah yes, but to be honest I really don't remember why it does it that way. There was a deep technical reason. @jblespiau, do you remember?
@PURANJAY14 if I were you, I would make an alternative tabular policy called TabularPolicyDict that simply wraps a python dictionary but otherwise exposes the necessary methods in the Policy base class. (And initializing the state policy to uniform + adding it to the dictionary upon first call). Then replace all the ones in cfr.py with your custom dict.
Hope this helps!