Extended Data Fig. 2: Learning the distribution of returns improves performance of deep RL agents across multiple domains. | Nature

Extended Data Fig. 2: Learning the distribution of returns improves performance of deep RL agents across multiple domains.

Search