美國國家標準與技術研究院(National Institute of Standards and Technology,NIST)於2023年1月26日發布了人工智能風險管理框架(AI Risk Management Framework,AI RMF 1.0)。這個框架旨在幫助組織更好地管理與人工智能相關的風險,提高AI系統的可信賴性。
NIST AI RMF 是一個自願性框架,旨在改善組織在設計、開發、使用和評估AI產品、服務和系統時納入可信賴性考量的能力。
根據NIST的框架,可信賴的AI系統應具備以下特性:
Valid & Reliable (有效性和可靠性):
Validation is the confirmation, through the provision of objective evidence, that the requirements for a specific intended use or application have been fulfilled
Reliability is defined in the same standard as the "ability of an item to perform as required, without failure, for a given time interval, under given conditions"
Safe (安全性):
Safe: AI systems should "not under defined conditions, lead to a state in which human life, health, property, or the environment is endangered".
Secure and Resilient (安全和彈性):
Resilient: AI systems can withstand unexpected adverse events or unexpected changes in their environment or use – or if they can maintain their functions and structure in the face of internal and external change and degrade safely and gracefully when this is necessary.
Secure: AI systems that can maintain confidentiality, integrity, and availability through protection mechanisms that prevent unauthorized access and use.
Accountable & Transparent (問責和透明):
Transparency: It reflects the extent to which information about an AI system and its outputs is available to individuals interacting with such a system – regardless of whether they are even aware that they are doing so.
Explainable and Interpretable (可解釋和可解讀):
Explainability: It refers to a representation of the mechanisms underlying AI systems’ operation.
Interpretability: It refers to the meaning of AI systems’ output in the context of their designed functional purposes
隱私增強(Privacy-Enhanced):
Privacy refers generally to the norms and practices that help to safeguard human autonomy, identity, and dignity.
These norms and practices typically address freedom from intrusion, limiting observation, or individuals’ agency to consent to disclosure or control of facets of their identities (e.g., body, data, reputation).
公平性和有害偏見管理(Fair with Harmful Bias Managed):
Fairness includes concerns for equality and equity by addressing issues such as harmful bias and discrimination.