Data Types & Constraints

Schema Overview

The Supervised AI Testing API utilizes a strictly typed JSON schema for all request and response bodies. To ensure consistency across the Supervised AI platform, all data must adhere to the following primitive types and structured object formats.

Base Data Types

The following primitive types are used throughout the API:

Core Objects & Constraints

Test Case Object

The TestCase object represents a single unit of evaluation data.

Example:

{
  "id": "550e8400-e29b-41d4-a716-446655440000",
  "input": "Summarize the latest quarterly report.",
  "expected_output": "The report shows a 15% increase in revenue...",
  "tags": ["production", "regression"],
  "priority": 3
}

Evaluation Metric Constraint

Used to define the success criteria for a testing run.

Example:

{
  "metric_name": "ROUGE_L",
  "threshold": 0.85,
  "comparator": "GTE"
}

Enumerations

Metric Types

The metric_name field must be one of the following supported values:

ACCURACY: Direct match between prediction and ground truth.
F1_SCORE: Harmonic mean of precision and recall.
BLEU: Evaluates candidate text against reference text.
ROUGE_L: Measures longest common subsequence for summarization tasks.
LATENCY: Response time in milliseconds (Constraint: LT or LTE recommended).

Test Status

The status of a testing job or individual test case.

PENDING: Queued for execution.
RUNNING: Currently being processed by the evaluation engine.
PASSED: Met or exceeded all metric constraints.
FAILED: Failed one or more metric constraints.
ERROR: System or model failure during execution.

Global Validation Rules

Strict Typing: Passing a string representation of a number where a Float or Integer is expected will result in a 400 Bad Request.
Character Limits: All String fields, unless otherwise specified, have a default maximum length of 4096 characters.
Null Values: Optional fields should be omitted rather than passed as null unless the specific endpoint documentation permits nullable values.
Array Limits: Collection objects (like tags or constraints) are limited to a maximum of 50 items per request to ensure performance.