mdft

Here is 1 public repository matching this topic...

Rahgooy / soft_constraint_irl

In this project, we use the maximum entropy principle in Inverse reinforcement learning to learn soft constraints from demonstrations obtained from an agent interacting with a non-deterministic MDP. In the second part of this project, we implement various strategies (orchestrators) to mix conflicting policies (e.g. pragmatic vs ethical). In one …

reinforcement-learning gradient-descent cognitive-models mdft