racer

Black-box, gradient-free optimization of car-racing policies.

Example Race

Our best performing agent, the tuned genetic program (red) races against the untuned genetic program (blue).

Inference Pipeline

Results

Method Name	Max Reward	Mean Reward	Mean # Function evaluations to reach 900
Nelder Mead	713.2	Not Applicable	Not Applicable
NN + Generation-based Evolution Strategy (3 repetitions)	925.7	923.3	9.8k
NN + Iterative Evolution Strategy (4 repetitions)	915.5	906.3	50.3k
Genetic Program (4 repetitions)	928.2	917.2	5.8k
Tuned Genetic Programming	930.6	Not Applicable	Not Applicable

Our best results were achieved by fine tuning constants of the best genetic program using evolution strategies.

Environment

Graphics, physics engine and reward calculation adapted from OpenAI gym.

To improve performance, we rewrote the graphics pipeline yielding ~40x sequential speedup. Our modifications allow the evaluations to run headless and be parallelized. We also added a feature where multiple agents are evaluated simultaneously, allowing for a "race" to be visualized. All experiments were performed on the ETH Euler Supercomputer.

Name		Name	Last commit message	Last commit date
Latest commit History 126 Commits
.readme		.readme
racer		racer
resources		resources
scripts		scripts
tmp/frames		tmp/frames
videos		videos
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
create_video.sh		create_video.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

racer

Example Race

Inference Pipeline

Results

Environment

About

Releases

Packages

Contributors 3

Languages

License

max-eth/racer

Folders and files

Latest commit

History

Repository files navigation

racer

Example Race

Inference Pipeline

Results

Environment

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages