DQN mnist & mountain car performance

Hi,

while working on a PyTorch DQN agent for BSuite experiments, I noticed quite bad results on the mnist and mountain car experiments. I see that a similar question was addressed [here](https://github.com/deepmind/bsuite/issues/14), but the thread was closed.

To further investigate, I created a new conda environment, downloaded and installed a fresh copy of BSuite and ran the DQN agent from the baselines. The only settings I've changed were "bsuite_id" to "SWEEP" and the save path. 

When you compare the results from both agents with the barplot on page 16 of the BSuite manuscript, you notice that both agents have worse performance on mnist and mountaincar and better performance on catch.
![barplot](https://user-images.githubusercontent.com/42319502/77575938-8c0ce980-6ed4-11ea-9d72-3533a41c23f0.png)

Were there any changes on the environments that I missed? The DQN agent from the manuscript did use the default parameters from the baseline directory, correct?

Thanks,
Peter

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DQN mnist & mountain car performance #20

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

DQN mnist & mountain car performance #20

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions