Performs state-driven multi-candidate lookahead planning and ranking. Invoke when selecting the best incident response action under time/risk constraints.
Select the best response action via multi-candidate simulation, rollout-based estimation, and explicit scoring.
rollout_count, depth, penalty weights)Return JSON only:
{
"recommended_action": {},
"ranked_actions": [],
"reasons": {},
"expected_recovery_effect": {},
"risk_alerts": [],
"executable": {
"shell": "",
"api": ""
},
"explainability": {}
}
Score = state_gain - time_penalty - risk_penaltyhallucination=true).