AI Fairness & Bias Mitigation

Define Protected Characteristics: Explicitly list the sensitive attributes considered for bias analysis. This must be based on applicable EU non-discrimination law and the specific context of the AI system's use (e.g., demographic groups, lighting conditions, or rare classes acting as proxies for class imbalance).

Select Fairness Metrics: Standardize the use of specific, mathematically defined fairness metrics appropriate for the use case. Acceptable metrics include, but are not limited to:

Demographic Parity
Equalized Odds
Equal Opportunity
Per-Class F1-Score
Accuracy Disparity

Establish Quantifiable Thresholds: Define pre-determined, quantifiable thresholds for the chosen metrics that the system MUST meet before deployment (e.g., "The F1-score for any individual class must not fall below 0.70"). These must be set before model testing begins.

Generative AI & LLM Benchmarking: For Large Language Models (LLMs) and text-generation systems where traditional mathematical fairness metrics do not directly apply, you must leverage established LLM benchmarking frameworks. Refer to and implement the tests outlined in the GRIP on LLMs repository to evaluate the model against known societal biases, toxicity, and stereotyping datasets.

What is the standard for Fairness & Bias Mitigation in AI?

When and for whom is this standard applicable?

What is required?

1. Fairness Framework

2. Bias Analysis and Mitigation Strategies

3. Fairness Testing & Counterfactuals

4. Continuous Bias Monitoring

What to avoid?

Considerations

Further reading