Add continuous maze example. #216

mohsen-ghaffari1992 · 2023-09-22T11:10:28Z

@wasowski
Please consider reviewing this example and let me know whether it is what we want.

src/main/scala/symsim/examples/concrete/continuousmaze/ContinuousMaze.scala

wasowski · 2023-10-09T10:03:23Z

src/main/scala/symsim/examples/concrete/continuousmaze/ContinuousMaze.scala

+  val TimeHorizon: Int = 2000
+
+  def isFinal (s: MazeState): Boolean =
+    s == MazeState (MAZE_LENGTH, MAZE_WIDTH) || s == MazeState (MAZE_LENGTH, MAZE_WIDTH - 1)


Should isFinal be defined on MazeState or on MazeObservableState?

According to the agent class, it should be maze state.

Then, I think the code is incorrect either here or there. If isFinal is called on observable state, then the set of the final states should be an entire square, not just two points. If isFinal is called by symsim on observable state, then they should require the observable state type, not the system state type? Either side is confused. It does not make sense to check an observable state condition on a continuous state ...

Then, I think the code is incorrect either here or there. If isFinal is called on observable state, then the set of the final states should be an entire square, not just two points. If isFinal is called by symsim on observable state, then they should require the observable state type, not the system state type? Either side is confused. It does not make sense to check an observable state condition on a continuous state ...

Maybe I wasn't clear, I meant it is defined on MazeState. If it was defined on MazeObservableState, you were right.

src/main/scala/symsim/examples/concrete/continuousmaze/ContinuousMaze.scala

wasowski · 2023-10-09T10:07:30Z

src/main/scala/symsim/examples/concrete/continuousmaze/ContinuousMaze.scala

+    case MazeState (_, _) => -1.0
+
+
+  def distort (a: MazeAction): Randomized[MazeAction] = a match


In the continuous, version of the problem it would be more natural to change the distance moved, not the direction. But anyhow, how are you going to treat randomization in the symbolic executor?

I have an idea that we do not need to do anything about randomization. Let's talk about it in detail in the meeting. Just as a short answer, when there is randomization, it means that instead of a1, the behavior of a2 must be considered. Since we are analyzing the program for all actions, then we should not be worried about randomized actions.

But here a good randomization would not select a different action, but would result in adding random noise (for instance Gaussian) to the distance travelled.

src/main/scala/symsim/examples/concrete/continuousmaze/ContinuousMaze.scala

wasowski · 2023-10-09T10:15:15Z

src/test/scala/symsim/examples/concrete/continuousmaze/SarsaExperiments.scala

@@ -0,0 +1,19 @@
+package symsim
+package examples.concrete.continuousmaze


I think the package name should be continuousMaze not continuousmaze.

I tried to keep it similar to the other packages. e.g. simplemaze, simplebandit, ...
Should I rename it?

src/test/scala/symsim/examples/concrete/continuousmaze/SarsaExperiments.scala

wasowski · 2023-10-09T10:16:38Z

I have left my comments, but I am confused why tests for cartpole are failing. Have we added cartpole to main? (I might have simply forgotten...)

mohsen-ghaffari1992 · 2023-10-09T14:31:20Z

@wasowski
Thanks for comments!
I edited the code based on your comments and push the new version.
Regarding to cart pole, yes, we have added it in the main.

wasowski · 2023-10-11T09:41:19Z

BTW. Andreas tells me that this example is basically the same thing that is known as the random walk example.

mohsen-ghaffari1992 · 2023-10-11T11:15:13Z

BTW. Andreas tells me that this example is basically the same thing that is known as the random walk example.

Is it?
Random walk is referred to simple maze type of problems according to my knowledge. I googled it a little bit and I still think my understanding is correct.

Add continuous maze.

05a7747

wasowski force-pushed the continuous_maze branch from 7a10ee0 to 05a7747 Compare October 9, 2023 08:36