metadata: id: FINOS-AIR type: GuidanceCatalog gemara-version: "0.20.0" description: "" author: id: finos name: FINOS type: Human version: 0.1.0 mapping-references: - id: NIST-800-53 title: NIST SP 800-53r5 version: rev5 url: "https://nvlpubs.nist.gov/nistpubs/SpecialPublications/NIST.SP.800-53r5.pdf#%5B%7B%22num%22%3A342%2C%22gen%22%3A0%7D%2C%7B%22name%22%3A%22XYZ%22%7D%2C88%2C310%2C0%5D" - id: AIR-PRIN title: Example Principles Document for the Framework version: 0.1.0 title: AI Governance Framework type: Framework front-matter: The following framework has been developed by FINOS (Fintech Open Source Foundation). groups: - id: DET title: Detective description: Detection and Continuous Improvement - id: PREV title: Preventive description: Prevention and Risk Mitigation guidelines: - id: AIR-DET-011 group: DET title: Human Feedback Loop for AI Systems objective: A Human Feedback Loop is a critical detective and continuous improvement mechanism that involves systematically collecting, analyzing, and acting upon feedback provided by human users, subject matter experts (SMEs), or reviewers regarding an AI system's performance, outputs, or behavior. rationale: importance: A Human Feedback Loop is critical for ensuring AI systems operate effectively and safely by incorporating human judgment and expertise into continuous improvement processes. goals: - "Governance Support: Provides data for AI governance bodies to monitor impact and make decisions" statements: - id: AIR-DET-011.1 title: Designing the Feedback Mechanism text: Implementing an effective human feedback loop involves careful design of the mechanism. recommendations: - "Define Intended Use and KPIs:\nObjectives: Clearly document how feedback data will be utilized, such as for prompt fine-tuning, RAG document updates,model/data drift detection, or more advanced uses like Reinforcement Learning from Human Feedback (RLHF).\nKPI Alignment: Design feedback questions and metrics to align with the solution's key performance indicators (KPIs). For example, if accuracy is a KPI, feedback might involve users or SMEs annotating if an answer was correct." - id: AIR-DET-011.2 title: Types of Feedback and Collection Methods text: Implementing an effective human feedback loop involves clear collection processes. recommendations: - "Quantitative Feedback:\nDescription: Involves collecting structured responses that can be easily aggregated and measured, such as numerical ratings (e.g., \"Rate this response on a scale of 1-5 for helpfulness\"), categorical choices (e.g., \"Was this answer: Correct/Incorrect/Partially Correct\"), or binary responses (e.g., thumbs up/down).\nUse Cases: Effective for tracking trends, measuring against KPIs, and quickly identifying areas of high or low performance." see-also: - AIR-DET-015 - AIR-DET-004 - AIR-PREV-005 - id: AIR-DET-004 group: DET title: Example Detective Control 004 objective: Placeholder control for testing references. rationale: importance: Placeholder control for testing references. goals: - "Placeholder goal for testing" - id: AIR-DET-015 group: DET title: Example Detective Control 015 objective: Placeholder control for testing references. rationale: importance: Placeholder control for testing references. goals: - "Placeholder goal for testing" - id: AIR-PREV-005 group: PREV title: Example Preventive Control 005 objective: Placeholder control for testing references. rationale: importance: Placeholder control for testing references. goals: - "Placeholder goal for testing"