Blackbox External Model Access | ✓ |
Capabilities demonstration | ✓ |
Capabilities description | ✓ |
Centralized model documentation | ✓ |
Evaluation of capabilities | ✓ |
External model access protocol | ✓ |
External reproducibility of capabilities evaluation | ✓ |
External reproducibility of intentional harm evaluation | - |
External reproducibility of mitigations evaluation | - |
External reproducibility of trustworthiness evaluation | - |
External reproducibility of unintentional harm evaluation | - |
Full external model access | ✓ |
Inference compute evaluation | - |
Inference duration evaluation | ✓ |
Input modality | ✓ |
Intentional harm evaluation | - |
Limitations demonstration | - |
Limitations description | ✓ |
Mitigations demonstration | - |
Mitigations description | - |
Mitigations evaluation | - |
Model architecture | ✓ |
Asset license | ✓ |
Model components | ✓ |
Model size | ✓ |
Output modality | ✓ |
Risks demonstration | - |
Risks description | - |
Third party capabilities evaluation | - |
Third party evaluation of limitations | ✓ |
Third party mitigations evaluation | - |
Third party risks evaluation | - |
Trustworthiness evaluation | - |
Unintentional harm evaluation | - |