Multi-agent-Multi-target-path-planning-in-MDPs

This is the code base for the paper F. Nawaz and M. Ornik, "Multi-agent, Multi-target Path Planning in Markov Decision Processes," in IEEE Transactions on Automatic Control, doi: 10.1109/TAC.2023.3286807.

Run the code VI_heur_multi.m for multi-agent planning, and VI_heur_single.m for single-agent planning. The MDP, initial state, and targets should be given as inputs. The functionality of each code is given below.

P_data_stoch.m: generate a random MDP/graph/grid-world.
Partition_Transfers_Swaps.m: Creates a heuristic partition of the target states, given a complete graph and number of agents.
Partitions_opt_k.m: Creates the optimal partition of the target states for the given complete graph and k number of agents.
VI_HT_fn.m: Generates the complete graph that corresponds to a given MDP, initial state and targets, where the weights of the graph are the optimal hitting time between targets and initial state.
VI_heur_fn.m: Computes the suboptimal value function used in the single agent heuristic in Algorithm 1.
VI_heur_multi.m: Implements the heuristic path planning algorithm for multiple agents (Algorithm 1), given the MDP, initial state, number of agents, and the assignment of targets to agents.
VI_heur_single.m: Implements the heuristic path planning algorithm for single agent (Algorithm 1), given the MDP, initial state, and targets.
VI_opt_fn.m: Implements the optimal policy iteration method and computes the optimal value function for single agent.
heur_impl_fn.m: The agent implements the planned path based on the suboptimal value function, until the agent reaches an unvisited target.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Multi-agent-Multi-target-path-planning-in-MDPs

Files

README.md

Latest commit

History

README.md

File metadata and controls

Multi-agent-Multi-target-path-planning-in-MDPs