How to get the true Pareto Front of mountain car environment? #3

HONG-ZI · 2022-12-11T15:10:56Z

Paper [1] give the true Pareto Front of mountain car environment, but it did not present the corresponding computing process. Is the true Pareto Front computed by “Exhaustion”？

[1] P. Vamplew, J. Yearwood, R. Dazeley, and A. Berry, “On the Limitations of Scalarisation for Multi-objective Reinforcement Learning of Pareto Fronts,” in AI 2008: Advances in Artificial Intelligence, vol. 5360.

Amp1874 · 2023-01-27T10:32:40Z

If I remember correctly it was a depth-first search, with leafs terminated if they were inferior to states which had previously been found earlier in the tree. We also used a similar approach to find the Pareto front for the MOPuddleWorld problem, but I no longer trust those results - several people have reported being unable to reproduce them. Unfortunately the code was lost when that research assistant's contract ended.

As a result we've largely moved away from comparing results against the "true front", and instead use other metrics like hypervolume, with appropriately chosen reference points. The exception is the Deep Sea Treasure problem, where it is simple to calculate the actual front.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to get the true Pareto Front of mountain car environment? #3

How to get the true Pareto Front of mountain car environment? #3

HONG-ZI commented Dec 11, 2022 •

edited

Loading

Amp1874 commented Jan 27, 2023 •

edited

Loading

How to get the true Pareto Front of mountain car environment? #3

How to get the true Pareto Front of mountain car environment? #3

Comments

HONG-ZI commented Dec 11, 2022 • edited Loading

Amp1874 commented Jan 27, 2023 • edited Loading

HONG-ZI commented Dec 11, 2022 •

edited

Loading

Amp1874 commented Jan 27, 2023 •

edited

Loading