MDP: Dynamic Supplier Selection
Model-Based RL: Learning Machine Degradation Dynamics
Monte Carlo RL: Atlanta Commute
RL for Warehouse Robot Routing: SARSA vs. Q-Learning (interactive)