TODO: writup https://ai.stanford.edu/~ang/papers/icml06-usinginaccuratemodelsinrl.pdf
- Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
- Hindsight Experience Replay
- Accelerating Online Reinforcement Learning with Offline Datasets
- Temporal difference learning and TD-Gammon
- An object-oriented representation for efficient reinforcement learning
- Using Inaccurate Models in Reinforcement Learning
- https://bair.berkeley.edu/blog/2020/07/11/auction/
- The Cognitive Systems Paradigm
- A Framework for Intelligence and Cortical Function Based on Grid Cells in the Neocortex
- Encoding Reality: Prediction-Assisted Cortical Learning Algorithm in Hierarchical Temporal Memory
- Deep Neuroevolution: Genetic Algorithms are a Competitive Alternative fo rTraining Deep Neural Networks for Reinforcement Learning
- Evolution Strategies as a Scalable Alternative to Reinforcement Learning
- Deep Double Descent: Where Bigger Models and More Data Hurt
- Explore implementing XOR in a small NN with linearity and nonlinearity
- Gated Linear Networks