Saltar al contenido principal
POST
/
api
/
v1
/
llm
/
evals
/
runs
Python (SDK)
from meetkai_mka1 import SDK, models


with SDK(
    bearer_auth="<YOUR_BEARER_TOKEN_HERE>",
) as sdk:

    res = sdk.llm.evals.create_run(suite_id="eval_suite_aa87e2b1112a455b8deabed784372198", models=[
        "auto",
    ], judge_model="auto", embedding_model="auto", generation=models.EvalGenerationConfig(
        temperature=0,
        max_gen_toks=512,
        until=[
            "<|endoftext|>",
        ],
        do_sample=False,
        chat_template_kwargs={
            "enable_thinking": False,
        },
        timeout_seconds=120,
        max_retries=2,
        max_empty_retries=1,
    ), generation_concurrency=4, grader_concurrency=2, max_workflow_sample_activities=5000, metadata={
        "purpose": "mvp",
    })

    # Handle response
    print(res)
{
  "id": "eval_run_aa87e2b1112a455b8deabed784372198",
  "object": "eval.run",
  "suite_id": "eval_suite_aa87e2b1112a455b8deabed784372198",
  "suite_version": 1,
  "suite_version_id": "eval_sver_aa87e2b1112a455b8deabed784372198",
  "status": "in_progress",
  "models": [
    "auto"
  ],
  "task_ids": null,
  "judge_model": "auto",
  "embedding_model": "auto",
  "generation": {
    "temperature": 0,
    "max_output_tokens": 512
  },
  "request_counts": {
    "total": 100,
    "completed": 10,
    "failed": 0
  },
  "metrics": null,
  "error": null,
  "artifact_file_ids": [],
  "metadata": {
    "purpose": "mvp"
  },
  "created_at": 1704067200,
  "started_at": 1704067210,
  "completed_at": null,
  "cancelled_at": null,
  "failed_at": null
}

Autorizaciones

Authorization
string
header
requerido

Gateway auth: send Authorization: Bearer <mka1-api-key>. For multi-user server-side integrations, you can also send X-On-Behalf-Of: <external-user-id>.

Encabezados

X-On-Behalf-Of
string

Optional external end-user identifier forwarded by the API gateway.

Cuerpo

application/json
suite_id
string
requerido
models
string[]
requerido
Required array length: 1 - 20 elements
Minimum string length: 1
suite_version
integer
Rango requerido: 1 <= x <= 9007199254740991
task_ids
string[]
Minimum array length: 1
judge_model
string
embedding_model
string
generation
object
generation_concurrency
integer
Rango requerido: 1 <= x <= 256
concurrency
integer
Rango requerido: 1 <= x <= 256
grader_concurrency
integer
Rango requerido: 1 <= x <= 256
max_samples_per_task
integer
Rango requerido: 1 <= x <= 9007199254740991
max_workflow_sample_activities
integer

Reservas máximas de actividad por etapa de muestra por ejecución de flujo de trabajo temporal antes de continuar como nuevo.

Rango requerido: 100 <= x <= 50000
metadata
object

Respuesta

200 - application/json

Está bien

id
string
requerido
object
any
requerido
suite_id
string
requerido
suite_version
integer
requerido
Rango requerido: -9007199254740991 <= x <= 9007199254740991
suite_version_id
string
requerido
status
enum<string>
requerido
Opciones disponibles:
queued,
in_progress,
finalizing,
completed,
failed,
cancelling,
cancelled
models
string[]
requerido
task_ids
string[] | null
requerido
judge_model
string | null
requerido
embedding_model
string | null
requerido
generation
object
requerido
request_counts
object
requerido
metrics
object
requerido
error
object
requerido
artifact_file_ids
string[]
requerido
metadata
object
requerido
created_at
integer
requerido
Rango requerido: -9007199254740991 <= x <= 9007199254740991
started_at
integer | null
requerido
Rango requerido: -9007199254740991 <= x <= 9007199254740991
completed_at
integer | null
requerido
Rango requerido: -9007199254740991 <= x <= 9007199254740991
cancelled_at
integer | null
requerido
Rango requerido: -9007199254740991 <= x <= 9007199254740991
failed_at
integer | null
requerido
Rango requerido: -9007199254740991 <= x <= 9007199254740991