> ## Documentation Index
> Fetch the complete documentation index at: https://docs.mka1.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Import historical eval results from Hugging Face

> Imports simplified historical samples.jsonl-style eval results from a Hugging Face dataset repository into a completed eval suite run shape. Aggregates are recomputed from samples and include canonical score metrics for leaderboards.


## OpenAPI

````yaml https://apigw.mka1.com/speakeasy.json post /api/v1/llm/evals/imports/huggingface-results
openapi: 3.1.1
info:
  title: MKA1 API
  version: 1.1.0
  description: >-
    The MKA1 API is a RESTful API that provides access to the MKA1 platform.
    Learn how to get started with the API and the TypeScript SDK
    [here](https://mka1.apidocumentation.com/guides/getting-started).
  license:
    name: Proprietary
servers:
  - url: https://apigw.mka1.com
    description: MKA1 API Gateway
  - url: /
    description: Relative server URL (configurable via SDK constructor)
security: []
tags:
  - name: Resource Authorization
    description: >-
      Manage permissions for LLM resources. Create resources, grant/revoke
      permissions, and delete resources. Only resource owners can grant, revoke,
      or delete permissions.
    x-displayName: Resource Authorization
  - name: Embeddings
    description: >-
      Text embedding API endpoints for generating vector representations of
      text. Create semantic embeddings for search, clustering, and similarity
      matching using various embedding models.
    x-displayName: Embeddings
  - name: Feedback
    description: >-
      User feedback API for rating and commenting on chat completions. Collect
      thumbs up/down ratings and detailed feedback to improve model responses
      and track user satisfaction.
    x-displayName: Feedback
  - name: Images
    description: >-
      Image generation API endpoints for creating images from text descriptions.
      Generate images with control over size, quality, and style.
    x-displayName: Images
  - name: MCP Vault
    description: >-
      MCP vault API for storing user-owned MCP server configurations and
      encrypted credentials. Agents reference vault IDs so secrets are resolved
      only at tool execution time.
    x-displayName: MCP Vault
  - name: Speech
    description: >-
      Speech API endpoints for audio processing. Convert text to
      natural-sounding speech (TTS) or transcribe speech to text (STT) in
      different languages.
    x-displayName: Speech
  - name: Usage
    description: >-
      Usage tracking and analytics API for monitoring token consumption, request
      counts, and cost analysis. View detailed statistics per user, model, and
      time period.
    x-displayName: Usage
  - name: Extract
    description: >-
      Structured data extraction API for extracting information from files.
      Define JSON schemas to extract structured data from images, PDFs, and
      documents. Supports reusable schema templates.
    x-displayName: Extract
  - name: Text Classification
    description: >-
      Text classification API for categorizing text into predefined labels. Use
      AI models to classify text content for sentiment analysis, topic
      categorization, and content moderation.
    x-displayName: Text Classification
  - name: Responses
    description: >-
      Agent-powered responses API for creating AI agents with autonomous tool
      usage. Build conversational assistants that can use web search, file
      operations, image generation, code execution, computer use simulation, and
      MCP integrations. Supports background processing, streaming, and real-time
      status tracking.
    x-displayName: Responses
  - name: Files
    description: >-
      File management API for uploading, storing, and managing files with
      automatic expiration and S3 integration. Upload files that can be used
      with Assistants, Vector Stores, and other features. Files are stored in S3
      with metadata tracked in PostgreSQL. Supports automatic cleanup of expired
      files.
    x-displayName: Files
  - name: Vector Stores
    description: >-
      Vector store API for storing and searching documents using embeddings.
      Create vector stores, upload files with automatic chunking and embedding
      generation, and perform semantic search. Files are processed
      asynchronously using Temporal workflows for durability. Supports automatic
      cleanup of expired stores and LanceDB for efficient vector storage.
    x-displayName: Vector Stores
  - name: Conversations
    description: >-
      Conversation management API for storing and retrieving conversation state
      across Response API calls. Create conversations, add items (user messages,
      assistant messages, system messages), and maintain conversation history.
      Supports metadata tracking and multi-turn dialogue state management.
    x-displayName: Conversations
  - name: Guardrails
    description: >-
      AI safety guardrails API for configuring content moderation and security
      policies. Set up ban word lists, prompt injection detection, and system
      prompt leakage prevention. Guardrails apply to all requests from an
      account and can be tested before deployment.
    x-displayName: Guardrails
  - name: Models
    description: >-
      Model listing API for discovering available models. Returns model IDs,
      ownership, and metadata for all registered models in the gateway.
    x-displayName: Models
  - name: Skills
    description: >-
      Skills API for managing versioned bundles of instructions and files
      following the Agent Skills standard. Create, version, and download
      reusable skill packages that include SKILL.md manifests for agent
      environments.
    x-displayName: Skills
  - name: Chat Completions
    description: >-
      **Deprecated: Use the Responses API (`/api/v1/llm/responses`) instead.**
      Chat completion endpoints with support for streaming, tool calls, and
      multiple providers.
    x-deprecated: true
    x-displayName: Chat Completions
  - name: Batches
    x-displayName: Batches
  - name: Evals
    x-displayName: Evals
  - name: Fine-Tuning
    x-displayName: Fine-Tuning
  - name: Memory Stores
    x-displayName: Memory Stores
  - name: Prompts
    x-displayName: Prompts
  - name: Tables
    description: Manage table schemas, data operations, search, and indices.
    x-displayName: Tables
  - name: Text Store
    description: >-
      Manage text stores with hybrid (vector + full-text) search and grouped
      text sets.
    x-displayName: Text Store
  - name: GraphRAG
    description: >-
      Construct and query lightweight knowledge graphs backed by Redis and
      LanceDB.
    x-displayName: GraphRAG
  - name: API Key
    x-displayName: API Key
  - name: Sessions
    description: Create, inspect, access, and terminate sandbox sessions.
    x-displayName: Sessions
  - name: Browser
    description: >-
      Connect to browser sessions through the gateway port proxy. Browser
      sessions expose a Chrome DevTools Protocol endpoint on port 9222.
    x-displayName: Browser
  - name: Execution
    description: Run shell commands and code inside an existing sandbox session.
    x-displayName: Execution
  - name: Workspace
    description: >-
      Inspect the workspace manifest, transfer files or archives, and download
      generated artifacts.
    x-displayName: Workspace
  - name: Sandbox Usage
    x-displayName: Sandbox Usage
  - name: Agents
    description: Create and manage reusable agent definitions.
    x-displayName: Agents
  - name: Agent Versions
    description: Inspect an agent's configuration history and roll back to a prior version.
    x-displayName: Agent Versions
  - name: Agent Runs
    description: Execute saved agents and inspect persisted run results.
    x-displayName: Agent Runs
  - name: Agent Schedules
    description: Create and manage scheduled or recurring saved agent runs.
    x-displayName: Agent Schedules
  - name: schema-5_other
    x-displayName: other
paths:
  /api/v1/llm/evals/imports/huggingface-results:
    post:
      tags:
        - Evals
      summary: Import historical eval results from Hugging Face
      description: >-
        Imports simplified historical samples.jsonl-style eval results from a
        Hugging Face dataset repository into a completed eval suite run shape.
        Aggregates are recomputed from samples and include canonical score
        metrics for leaderboards.
      operationId: importHistoricalEvalResults
      parameters:
        - name: X-HuggingFace-Token
          in: header
          required: true
          description: >-
            Caller-owned Hugging Face token with read access to the source
            dataset repository. Gateway does not use its server-side Hugging
            Face token for historical imports.
          schema:
            type: string
          example: hf_your_read_token
        - name: X-On-Behalf-Of
          in: header
          required: false
          schema:
            type: string
          description: Optional external end-user identifier forwarded by the API gateway.
      requestBody:
        required: true
        content:
          application/json:
            schema:
              $ref: '#/components/schemas/ImportHistoricalEvalResultsRequest'
            example:
              source:
                type: huggingface
                repo: meetkai/lm-eval-harness-result
                revision: main
                path_prefix: qwen/ur/
              suite_name: MKA1 historical eval results
              create_missing_tasks: true
              metadata:
                owner: eval-team
      responses:
        '200':
          description: OK
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/EvalHistoricalImportObject'
              example:
                object: eval.import
                source:
                  type: huggingface
                  repo: meetkai/lm-eval-harness-result
                  revision: main
                suite_id: eval_suite_aa87e2b1112a455b8deabed784372198
                suite_version: 1
                created_suite: true
                created_suite_version: true
                dry_run: false
                discovered_files: 30
                imported_runs: 30
                skipped_runs: 0
                imported_samples: 8870
      security:
        - bearerAuth: []
      x-codeSamples:
        - lang: python
          label: Python (SDK)
          source: |-
            from meetkai_mka1 import SDK


            with SDK(
                bearer_auth="<YOUR_BEARER_TOKEN_HERE>",
            ) as sdk:

                res = sdk.llm.evals.import_historical_results(x_hugging_face_token="hf_your_read_token", source={
                    "path_prefix": "qwen/ur/",
                }, suite_name="MKA1 historical eval results", create_missing_tasks=True, dry_run=False, metadata={
                    "owner": "eval-team",
                })

                # Handle response
                print(res)
        - lang: typescript
          label: Typescript (SDK)
          source: |-
            import { SDK } from "@meetkai/mka1";

            const sdk = new SDK({
              bearerAuth: "<YOUR_BEARER_TOKEN_HERE>",
            });

            async function run() {
              const result = await sdk.llm.evals.importHistoricalResults({
                xHuggingFaceToken: "hf_your_read_token",
                importHistoricalEvalResultsRequest: {
                  source: {
                    pathPrefix: "qwen/ur/",
                  },
                  suiteName: "MKA1 historical eval results",
                  metadata: {
                    "owner": "eval-team",
                  },
                },
              });

              console.log(result);
            }

            run();
        - lang: csharp
          label: CSharp (SDK)
          source: |-
            using MeetKai.MKA1;
            using MeetKai.MKA1.Types.Components;
            using System.Collections.Generic;

            var sdk = new SDK(bearerAuth: "<YOUR_BEARER_TOKEN_HERE>");

            var res = await sdk.Llm.Evals.ImportHistoricalResultsAsync(
                xHuggingFaceToken: "hf_your_read_token",
                body: new MeetKai.MKA1.Types.Components.ImportHistoricalEvalResultsRequest() {
                    Source = new ImportHistoricalEvalResultsRequestSource() {
                        PathPrefix = "qwen/ur/",
                    },
                    SuiteName = "MKA1 historical eval results",
                    Metadata = new Dictionary<string, string>() {
                        { "owner", "eval-team" },
                    },
                }
            );

            // handle response
components:
  schemas:
    ImportHistoricalEvalResultsRequest:
      type: object
      properties:
        source:
          type: object
          properties:
            type:
              const: huggingface
              default: huggingface
            repo:
              type: string
              pattern: ^[A-Za-z0-9][A-Za-z0-9_.-]*\/[A-Za-z0-9][A-Za-z0-9_.-]*$
              default: meetkai/lm-eval-harness-result
            revision:
              type: string
              minLength: 1
              default: main
            path_prefix:
              type: string
              minLength: 1
            files:
              type: array
              minItems: 1
              maxItems: 100
              items:
                type: string
                minLength: 1
          default:
            type: huggingface
            repo: meetkai/lm-eval-harness-result
            revision: main
        suite_id:
          type: string
          description: >-
            Existing suite to import into. Missing tasks are appended as a new
            version by default.
        suite_name:
          type: string
          minLength: 1
          maxLength: 255
          default: Historical lm-eval harness results
        suite_description:
          type: string
          maxLength: 10000
        create_missing_tasks:
          type: boolean
          default: true
        dry_run:
          type: boolean
          default: false
        limit_files:
          type: integer
          minimum: 1
          maximum: 100
          description: >-
            Maximum number of result files to import from Hugging Face, capped
            at 100. Repository listing imports default to 50; explicit
            source.files imports are not capped unless this is set.
        metadata:
          type: object
          propertyNames:
            type: string
            maxLength: 64
          additionalProperties:
            type: string
            maxLength: 512
          default: {}
    EvalHistoricalImportObject:
      type: object
      properties:
        object:
          const: eval.import
        source:
          type: object
          properties:
            type:
              const: huggingface
            repo:
              type: string
            revision:
              type: string
          required:
            - type
            - repo
            - revision
        suite_id:
          type: string
        suite_version:
          type: integer
          minimum: -9007199254740991
          maximum: 9007199254740991
        created_suite:
          type: boolean
        created_suite_version:
          type: boolean
        dry_run:
          type: boolean
        discovered_files:
          type: integer
          minimum: 0
          maximum: 9007199254740991
        imported_runs:
          type: integer
          minimum: 0
          maximum: 9007199254740991
        skipped_runs:
          type: integer
          minimum: 0
          maximum: 9007199254740991
        imported_samples:
          type: integer
          minimum: 0
          maximum: 9007199254740991
      required:
        - object
        - source
        - suite_id
        - suite_version
        - created_suite
        - created_suite_version
        - dry_run
        - discovered_files
        - imported_runs
        - skipped_runs
        - imported_samples
  securitySchemes:
    bearerAuth:
      type: http
      scheme: bearer
      bearerFormat: API Key
      description: >-
        Gateway auth: send `Authorization: Bearer <mka1-api-key>`. For
        multi-user server-side integrations, you can also send `X-On-Behalf-Of:
        <external-user-id>`.

````