
How to Improve Accuracy in Success and Safety Testing
This appendix refines Proposition 4.1 by improving statistical efficiency in hypothesis testing. It minimizes false positives and optimizes rejection regions for success and guardrail metrics, ensuring more reliable decision-making.