List Benchmarks

Unified benchmark endpoint that aggregates scores from multiple benchmark sources (Artificial Analysis, Design Arena). Filter by source to reproduce the exact shapes from the legacy per-source endpoints, or use task_type to find models suited for specific workloads. Authenticate with any valid OpenRouter API key. Rate-limited to 30 requests/minute per key and 500 requests/day per account.

Authentication

AuthorizationBearer
API key as bearer token in Authorization header

Query parameters

sourceenumOptional
Benchmark source to query. Determines the shape of the returned items. When omitted, returns results from all sources.
Allowed values:
task_typeenumOptional
Filter results by task type. For Artificial Analysis, maps to the corresponding index. For Design Arena, maps to the matching category.
Allowed values:
arenaenumOptional

Design Arena only: arena to query. Defaults to models when source is design-arena.

Allowed values:
categorystringOptional

Design Arena only: category within the arena (e.g. codecategories, uicomponent, gamedev, 3d, dataviz, image, video, svg). When omitted, returns all categories.

max_resultsintegerOptional>=1
Maximum number of items to return. When omitted, all matching results are returned.

Response

Benchmark results filtered by the specified source and optional task type.
datalist of objects
metaobject

Errors

400
Bad Request Error
401
Unauthorized Error
429
Too Many Requests Error
500
Internal Server Error