Popular repositories Loading
-
inspect_ai
inspect_ai PublicInspect: A framework for large language model evaluations
-
-
control-arena
control-arena PublicControlArena is a suite of realistic settings, mimicking complex deployment environments, for running control evaluations. This is an alpha release; we welcome feedback.
-
as-evaluation-standard
as-evaluation-standard Public templateA repository that holds templates, examples, and tests to help external parties submit tasks to AISI that conform with the Autonomous Systems Team's Task Standard
-
inspect_k8s_sandbox
inspect_k8s_sandbox PublicA Kubernetes sandbox environment for use with inspect_ai
-
Repositories
- control-arena Public
ControlArena is a suite of realistic settings, mimicking complex deployment environments, for running control evaluations. This is an alpha release; we welcome feedback.
UKGovernmentBEIS/control-arena’s past year of commit activity - dsit-dvs-register Public
UKGovernmentBEIS/dsit-dvs-register’s past year of commit activity - dsit-dvs-register-admin Public
UKGovernmentBEIS/dsit-dvs-register-admin’s past year of commit activity - beis-spl-eligibility-tool Public
Eligibility tool for Shared Parental Leave (SPL) and Statutory Shared Parental Pay (ShPP)
UKGovernmentBEIS/beis-spl-eligibility-tool’s past year of commit activity - beis-spl-planner Public
Planner tool for Shared Parental Leave (SPL) and Statutory Shared Parental Pay (ShPP)
UKGovernmentBEIS/beis-spl-planner’s past year of commit activity