Proactive Decision Making for Autonomous Systems

Reinforcement Learning Setup used for Simplex decision Logic

Adaptive decision-making for mitigation is a decisive step of my dynamic assurance framework. Once a problem has been detected, and the system risk is assessed, this information needs to be proactively used for selecting a suitable control action that mitigates the system’s risk. Simplex Architecture has been widely utilized as a mitigation strategy in Cyber-Physical Systems. This architecture integrates the system to be assured with a safe controller and provides a decision logic to switch between the decisions of these controllers. However, the decision logic is always designed “offline” and used at runtime for control action selection. Such an offline logic cannot optimally balance the safety vs. mission-critical information (e.g., performance) of the system employing the architecture. I have been working towards a “proactive” and “adaptive” decision logic to address this limitation. I have used reinforcement learning in my previous work, and I plan to use Monte Carlo Tree Search (MCTS) to learn an online decision logic at runtime. Recommended Reading

Shreyas Ramakrishna
Shreyas Ramakrishna
Senior Architect, System Safety Engineer