Analysis of AQuaGAIL hyperparameters
For each environment, we sample 1000 different configurations of hyperparameters at random, and report statistics over the configurations.
Success rate learning curves

Hyperparameters influence breakdown
For each hyperparameter, we aggregate all configurations for each
possible value of the hyperparameter and compute the following statistic:
the average of final success rate for the top 50% of configurations (since poorly performing configurations are uninformative). Some
plot bars are "missing", it is because the success rate is 0 for all configurations.
Analysis of discriminator number of layers

Analysis of discriminator number of hidden units per layer

Analysis of regularization

Analysis of discriminator weight decay

Analysis of observation normalization

Analysis of discriminator learning rate

Analysis of GAIL reward balance

Analysis of aquadem learning rate

Analysis of temperature

Analysis of number of actions

Analysis of aquadem input dropout rate

Analysis of aquadem hidden dropout rate

Analysis of DQN learning rate

Analysis of n step

Analysis of epsilon

Analysis of discriminator input dropout rate

Analysis of discriminator hidden dropout rate
