optimal-royal-game Use dynamic programming policy iteration to estimate the optimal state value function for the royal game of Ur