Government officials and tech leaders are emphasizing the importance of testing and evaluating AI models to strike a balance between regulatory frameworks and innovation. The Department of Defense is continually testing and evaluating AI models to ensure they align with their Responsible Artificial Intelligence (RAI) Toolkit. The National Institute of Standards and Technology’s (NIST) U.S. AI Safety Institute (AISI) is also working on advancing the science of AI safety through direct testing of AI model systems, focusing on “frontier” generative AI models. The institute plans to build a suite of evaluations to assess AI models’ performance, capabilities, and risks. Industry leaders agree that developing a regulatory framework based on empirical data from testing and evaluation of AI models is key to balancing innovation and responsibility. The importance of an international perspective on AI safety is also being stressed, with efforts to launch a network of AI Safety Institutes globally to enable aligned and interoperable standards and evaluation.

AI Safety Takes Center Stage in Government Regulations
“If you can test and evaluate, you can determine: is this model safe and responsible to deploy or is it not?”
1–2 minutes










