Evaluation and Benchmarking (e.g. Atari Learning Environment, OpenAI Gym, GVGAI)