Proactive Decision Making for Autonomous Systems
Adaptive decision-making for mitigation is a decisive step of my dynamic assurance framework. Once a problem has been detected, and the system risk is assessed, this information needs to be proactively used for selecting a suitable control action that mitigates the system’s risk. Simplex Architecture has been widely utilized as a mitigation strategy in Cyber-Physical Systems. This architecture integrates the system to be assured with a safe controller and provides a decision logic to switch between the decisions of these controllers. However, the decision logic is always designed “offline” and used at runtime for control action selection. Such an offline logic cannot optimally balance the safety vs. mission-critical information (e.g., performance) of the system employing the architecture. I have been working towards a “proactive” and “adaptive” decision logic to address this limitation. I have used reinforcement learning in my previous work, and I plan to use Monte Carlo Tree Search (MCTS) to learn an online decision logic at runtime. Recommended Reading