Model | Hyperparameter | Explored Space |
---|---|---|
Baseline MLP | Learning rate | {1e-5, 1r-4, 1e-3, 1e-2, 0.1} |
Hidden size | {5, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130} | |
Weight decay | {1e-5, 1r-4, 1e-3, 1e-2, 0.1} | |
Dropout | {0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9} |