Federation University Australia
Browse

Dataset from ES-C51: Expected Sarsa Based C51 Distributional Reinforcement Learning Algorithm

Download (17.35 kB)
dataset
posted on 2025-10-15, 00:15 authored by Rijul Tandon, Peter VamplewPeter Vamplew, Cameron Foale
<p dir="ltr">This dataset contains the results of experiments comparing the performance of the standard Q-learning based distributional deep reinforcement learning algorithm QL-C51, and a novel variant which uses Expected-Sarsa temporal difference updates (ES-C51). Each algorithm was executed for 10 separate runs with independent seeds on 22 environments (Acrobot, Cartpole, and the Atari-10 environments with and without stochasticity). Results are reported for each run in terms of the mean episodic reward over the last 10% of learning episodes. Full details are in the corresponding paper.</p>

History

Usage metrics

    Federation University Australia

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC