Blackbox External Model Access | ✓ |
Capabilities demonstration | ✓ |
Capabilities description | ✓ |
Centralized model documentation | ✓ |
Evaluation of capabilities | ✓ |
External model access protocol | - |
External reproducibility of capabilities evaluation | ✓ |
External reproducibility of intentional harm evaluation | - |
External reproducibility of mitigations evaluation | - |
External reproducibility of trustworthiness evaluation | - |
External reproducibility of unintentional harm evaluation | - |
Full external model access | - |
Inference compute evaluation | - |
Inference duration evaluation | - |
Input modality | ✓ |
Intentional harm evaluation | - |
Limitations demonstration | - |
Limitations description | ✓ |
Mitigations demonstration | - |
Mitigations description | ✓ |
Mitigations evaluation | ✓ |
Model architecture | ✓ |
Asset license | - |
Model components | - |
Model size | - |
Output modality | ✓ |
Risks demonstration | - |
Risks description | ✓ |
Third party capabilities evaluation | - |
Third party evaluation of limitations | ✓ |
Third party mitigations evaluation | - |
Third party risks evaluation | ✓ |
Trustworthiness evaluation | - |
Unintentional harm evaluation | - |