> ## Documentation Index
> Fetch the complete documentation index at: https://docs.mka1.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Retry an eval run

> Queues a failed or cancelled eval run to retry in place. Completed samples keep their results, generated-but-unscored cancelled samples resume at scoring, unfinished samples are requeued, and runs with no persisted samples are prepared from scratch with the same run ID.


## OpenAPI

````yaml https://apigw.mka1.com/speakeasy.json post /api/v1/llm/evals/runs/{run_id}/retry
openapi: 3.1.1
info:
  title: MKA1 API
  version: 1.1.0
  description: >-
    The MKA1 API is a RESTful API that provides access to the MKA1 platform.
    Learn how to get started with the API and the TypeScript SDK
    [here](https://mka1.apidocumentation.com/guides/getting-started).
  license:
    name: Proprietary
servers:
  - url: https://apigw.mka1.com
    description: MKA1 API Gateway
  - url: /
    description: Relative server URL (configurable via SDK constructor)
security: []
tags:
  - name: Resource Authorization
    description: >-
      Manage permissions for LLM resources. Create resources, grant/revoke
      permissions, and delete resources. Only resource owners can grant, revoke,
      or delete permissions.
    x-displayName: Resource Authorization
  - name: Embeddings
    description: >-
      Text embedding API endpoints for generating vector representations of
      text. Create semantic embeddings for search, clustering, and similarity
      matching using various embedding models.
    x-displayName: Embeddings
  - name: Feedback
    description: >-
      User feedback API for rating and commenting on chat completions. Collect
      thumbs up/down ratings and detailed feedback to improve model responses
      and track user satisfaction.
    x-displayName: Feedback
  - name: Images
    description: >-
      Image generation API endpoints for creating images from text descriptions.
      Generate images with control over size, quality, and style.
    x-displayName: Images
  - name: MCP Vault
    description: >-
      MCP vault API for storing user-owned MCP server configurations and
      encrypted credentials. Agents reference vault IDs so secrets are resolved
      only at tool execution time.
    x-displayName: MCP Vault
  - name: Speech
    description: >-
      Speech API endpoints for audio processing. Convert text to
      natural-sounding speech (TTS) or transcribe speech to text (STT) in
      different languages.
    x-displayName: Speech
  - name: Usage
    description: >-
      Usage tracking and analytics API for monitoring token consumption, request
      counts, and cost analysis. View detailed statistics per user, model, and
      time period.
    x-displayName: Usage
  - name: Extract
    description: >-
      Structured data extraction API for extracting information from files.
      Define JSON schemas to extract structured data from images, PDFs, and
      documents. Supports reusable schema templates.
    x-displayName: Extract
  - name: Text Classification
    description: >-
      Text classification API for categorizing text into predefined labels. Use
      AI models to classify text content for sentiment analysis, topic
      categorization, and content moderation.
    x-displayName: Text Classification
  - name: Responses
    description: >-
      Agent-powered responses API for creating AI agents with autonomous tool
      usage. Build conversational assistants that can use web search, file
      operations, image generation, code execution, computer use simulation, and
      MCP integrations. Supports background processing, streaming, and real-time
      status tracking.
    x-displayName: Responses
  - name: Files
    description: >-
      File management API for uploading, storing, and managing files with
      automatic expiration and S3 integration. Upload files that can be used
      with Assistants, Vector Stores, and other features. Files are stored in S3
      with metadata tracked in PostgreSQL. Supports automatic cleanup of expired
      files.
    x-displayName: Files
  - name: Vector Stores
    description: >-
      Vector store API for storing and searching documents using embeddings.
      Create vector stores, upload files with automatic chunking and embedding
      generation, and perform semantic search. Files are processed
      asynchronously using Temporal workflows for durability. Supports automatic
      cleanup of expired stores and LanceDB for efficient vector storage.
    x-displayName: Vector Stores
  - name: Conversations
    description: >-
      Conversation management API for storing and retrieving conversation state
      across Response API calls. Create conversations, add items (user messages,
      assistant messages, system messages), and maintain conversation history.
      Supports metadata tracking and multi-turn dialogue state management.
    x-displayName: Conversations
  - name: Guardrails
    description: >-
      AI safety guardrails API for configuring content moderation and security
      policies. Set up ban word lists, prompt injection detection, and system
      prompt leakage prevention. Guardrails apply to all requests from an
      account and can be tested before deployment.
    x-displayName: Guardrails
  - name: Models
    description: >-
      Model listing API for discovering available models. Returns model IDs,
      ownership, and metadata for all registered models in the gateway.
    x-displayName: Models
  - name: Skills
    description: >-
      Skills API for managing versioned bundles of instructions and files
      following the Agent Skills standard. Create, version, and download
      reusable skill packages that include SKILL.md manifests for agent
      environments.
    x-displayName: Skills
  - name: Chat Completions
    description: >-
      **Deprecated: Use the Responses API (`/api/v1/llm/responses`) instead.**
      Chat completion endpoints with support for streaming, tool calls, and
      multiple providers.
    x-deprecated: true
    x-displayName: Chat Completions
  - name: Batches
    x-displayName: Batches
  - name: Evals
    x-displayName: Evals
  - name: Fine-Tuning
    x-displayName: Fine-Tuning
  - name: Memory Stores
    x-displayName: Memory Stores
  - name: Prompts
    x-displayName: Prompts
  - name: Tables
    description: Manage table schemas, data operations, search, and indices.
    x-displayName: Tables
  - name: Text Store
    description: >-
      Manage text stores with hybrid (vector + full-text) search and grouped
      text sets.
    x-displayName: Text Store
  - name: GraphRAG
    description: >-
      Construct and query lightweight knowledge graphs backed by Redis and
      LanceDB.
    x-displayName: GraphRAG
  - name: API Key
    x-displayName: API Key
  - name: Sessions
    description: Create, inspect, access, and terminate sandbox sessions.
    x-displayName: Sessions
  - name: Browser
    description: >-
      Connect to browser sessions through the gateway port proxy. Browser
      sessions expose a Chrome DevTools Protocol endpoint on port 9222.
    x-displayName: Browser
  - name: Execution
    description: Run shell commands and code inside an existing sandbox session.
    x-displayName: Execution
  - name: Workspace
    description: >-
      Inspect the workspace manifest, transfer files or archives, and download
      generated artifacts.
    x-displayName: Workspace
  - name: Sandbox Usage
    x-displayName: Sandbox Usage
  - name: Agents
    description: Create and manage reusable agent definitions.
    x-displayName: Agents
  - name: Agent Versions
    description: Inspect an agent's configuration history and roll back to a prior version.
    x-displayName: Agent Versions
  - name: Agent Runs
    description: Execute saved agents and inspect persisted run results.
    x-displayName: Agent Runs
  - name: Agent Schedules
    description: Create and manage scheduled or recurring saved agent runs.
    x-displayName: Agent Schedules
  - name: schema-5_other
    x-displayName: other
paths:
  /api/v1/llm/evals/runs/{run_id}/retry:
    post:
      tags:
        - Evals
      summary: Retry an eval run
      description: >-
        Queues a failed or cancelled eval run to retry in place. Completed
        samples keep their results, generated-but-unscored cancelled samples
        resume at scoring, unfinished samples are requeued, and runs with no
        persisted samples are prepared from scratch with the same run ID.
      operationId: retryFailedEvalRun
      parameters:
        - name: run_id
          in: path
          required: true
          schema:
            type: string
          example: eval_run_aa87e2b1112a455b8deabed784372198
        - name: X-On-Behalf-Of
          in: header
          required: false
          schema:
            type: string
          description: Optional external end-user identifier forwarded by the API gateway.
      responses:
        '200':
          description: OK
          content:
            application/json:
              schema:
                type: object
                properties:
                  id:
                    type: string
                  object:
                    const: eval.run
                  suite_id:
                    type: string
                  suite_version:
                    type: integer
                    minimum: -9007199254740991
                    maximum: 9007199254740991
                  suite_version_id:
                    type: string
                  status:
                    $ref: '#/components/schemas/EvalRunStatus'
                  models:
                    type: array
                    items:
                      type: string
                  task_ids:
                    anyOf:
                      - type: array
                        items:
                          type: string
                      - type: 'null'
                  judge_model:
                    anyOf:
                      - type: string
                      - type: 'null'
                  embedding_model:
                    anyOf:
                      - type: string
                      - type: 'null'
                  generation:
                    type: object
                    properties:
                      instructions:
                        type: string
                      temperature:
                        type: number
                        minimum: 0
                        maximum: 2
                      top_p:
                        type: number
                        minimum: 0
                        maximum: 1
                      max_output_tokens:
                        type: integer
                        minimum: 1
                        maximum: 9007199254740991
                      max_gen_toks:
                        type: integer
                        minimum: 1
                        maximum: 9007199254740991
                        description: lm-eval alias for max_output_tokens.
                      stop:
                        anyOf:
                          - type: string
                          - type: array
                            minItems: 1
                            items:
                              type: string
                      until:
                        type: array
                        minItems: 1
                        items:
                          type: string
                        description: lm-eval generate_until stop sequences.
                      max_tool_calls:
                        type: integer
                        minimum: 1
                        maximum: 9007199254740991
                      reasoning: {}
                      text: {}
                      tools:
                        type: array
                        items:
                          anyOf:
                            - {}
                            - type: 'null'
                      tool_choice: {}
                      parallel_tool_calls:
                        type: boolean
                      truncation:
                        enum:
                          - auto
                          - disabled
                      service_tier:
                        enum:
                          - auto
                          - default
                          - flex
                          - priority
                      presence_penalty:
                        type: number
                        minimum: -2
                        maximum: 2
                      frequency_penalty:
                        type: number
                        minimum: -2
                        maximum: 2
                      top_k:
                        type: integer
                        minimum: 0
                        maximum: 9007199254740991
                      min_p:
                        type: number
                        minimum: 0
                        maximum: 1
                      repetition_penalty:
                        type: number
                        minimum: 0
                      do_sample:
                        type: boolean
                      extra_body:
                        type: object
                        propertyNames:
                          type: string
                        additionalProperties: {}
                      chat_template_kwargs:
                        type: object
                        propertyNames:
                          type: string
                        additionalProperties: {}
                      prefill_think:
                        anyOf:
                          - type: boolean
                          - type: string
                      use_cache:
                        type: boolean
                      timeout_seconds:
                        type: integer
                        minimum: 1
                        maximum: 3600
                      max_retries:
                        type: integer
                        minimum: 0
                        maximum: 10
                      max_empty_retries:
                        type: integer
                        minimum: 0
                        maximum: 10
                    additionalProperties: {}
                  request_counts:
                    type: object
                    properties:
                      total:
                        type: integer
                        minimum: -9007199254740991
                        maximum: 9007199254740991
                      completed:
                        type: integer
                        minimum: -9007199254740991
                        maximum: 9007199254740991
                      failed:
                        type: integer
                        minimum: -9007199254740991
                        maximum: 9007199254740991
                    required:
                      - total
                      - completed
                      - failed
                  metrics:
                    anyOf:
                      - type: object
                        propertyNames:
                          type: string
                        additionalProperties: {}
                      - type: 'null'
                  error:
                    anyOf:
                      - type: object
                        propertyNames:
                          type: string
                        additionalProperties: {}
                      - type: 'null'
                  artifact_file_ids:
                    type: array
                    items:
                      type: string
                  metadata:
                    type: object
                    propertyNames:
                      type: string
                      maxLength: 64
                    additionalProperties:
                      type: string
                      maxLength: 512
                  created_at:
                    type: integer
                    minimum: -9007199254740991
                    maximum: 9007199254740991
                  started_at:
                    anyOf:
                      - type: integer
                        minimum: -9007199254740991
                        maximum: 9007199254740991
                      - type: 'null'
                  completed_at:
                    anyOf:
                      - type: integer
                        minimum: -9007199254740991
                        maximum: 9007199254740991
                      - type: 'null'
                  cancelled_at:
                    anyOf:
                      - type: integer
                        minimum: -9007199254740991
                        maximum: 9007199254740991
                      - type: 'null'
                  failed_at:
                    anyOf:
                      - type: integer
                        minimum: -9007199254740991
                        maximum: 9007199254740991
                      - type: 'null'
                required:
                  - id
                  - object
                  - suite_id
                  - suite_version
                  - suite_version_id
                  - status
                  - models
                  - task_ids
                  - judge_model
                  - embedding_model
                  - generation
                  - request_counts
                  - metrics
                  - error
                  - artifact_file_ids
                  - metadata
                  - created_at
                  - started_at
                  - completed_at
                  - cancelled_at
                  - failed_at
              example:
                id: eval_run_aa87e2b1112a455b8deabed784372198
                object: eval.run
                suite_id: eval_suite_aa87e2b1112a455b8deabed784372198
                suite_version: 1
                suite_version_id: eval_sver_aa87e2b1112a455b8deabed784372198
                status: in_progress
                models:
                  - auto
                task_ids: null
                judge_model: auto
                embedding_model: auto
                generation:
                  temperature: 0
                  max_output_tokens: 512
                request_counts:
                  total: 100
                  completed: 10
                  failed: 0
                metrics: null
                error: null
                artifact_file_ids: []
                metadata:
                  purpose: mvp
                created_at: 1704067200
                started_at: 1704067210
                completed_at: null
                cancelled_at: null
                failed_at: null
      security:
        - bearerAuth: []
      x-codeSamples:
        - lang: python
          label: Python (SDK)
          source: |-
            from meetkai_mka1 import SDK


            with SDK(
                bearer_auth="<YOUR_BEARER_TOKEN_HERE>",
            ) as sdk:

                res = sdk.llm.evals.retry_failed_run(run_id="eval_run_aa87e2b1112a455b8deabed784372198")

                # Handle response
                print(res)
        - lang: typescript
          label: Typescript (SDK)
          source: |-
            import { SDK } from "@meetkai/mka1";

            const sdk = new SDK({
              bearerAuth: "<YOUR_BEARER_TOKEN_HERE>",
            });

            async function run() {
              const result = await sdk.llm.evals.retryFailedRun({
                runId: "eval_run_aa87e2b1112a455b8deabed784372198",
              });

              console.log(result);
            }

            run();
        - lang: csharp
          label: CSharp (SDK)
          source: >-
            using MeetKai.MKA1;

            using MeetKai.MKA1.Types.Components;


            var sdk = new SDK(bearerAuth: "<YOUR_BEARER_TOKEN_HERE>");


            var res = await sdk.Llm.Evals.RetryFailedRunAsync(runId:
            "eval_run_aa87e2b1112a455b8deabed784372198");


            // handle response
components:
  schemas:
    EvalRunStatus:
      enum:
        - queued
        - in_progress
        - finalizing
        - completed
        - failed
        - cancelling
        - cancelled
  securitySchemes:
    bearerAuth:
      type: http
      scheme: bearer
      bearerFormat: API Key
      description: >-
        Gateway auth: send `Authorization: Bearer <mka1-api-key>`. For
        multi-user server-side integrations, you can also send `X-On-Behalf-Of:
        <external-user-id>`.

````