Overview

Legacy Evaluations, Reports, and Datasets are deprecated for new workflows. Use Tables for new evaluation, dataset, report, backtesting, and batch workflows. See Migrate from Evaluations and Datasets.

Datasets are PromptLayer’s versioned system of record for evaluation inputs and historical examples. Use them when you want a reusable set of test cases for evaluations, backtests, regression checks, or batch workflows. A Dataset Group is the container for one dataset over time. Inside that group, you edit a draft dataset, save numbered versions, and reuse those versions across evaluations. You can create data by uploading a CSV or JSON file, building from request history, adding individual rows from observability traces, or turning evaluation outputs back into a dataset for the next iteration.

What you can do

Create datasets from files, request history, or evaluation results.
Add individual rows programmatically from observability traces via the API.
Edit draft datasets before publishing a version, including adding rows, renaming or deleting columns, updating values, and converting columns to JSON.
Organize dataset groups with folders, tags, changelog history, and archive controls.
Reuse a specific dataset version in an evaluation blueprint, or update a blueprint to a different dataset later.
Export datasets to CSV and use report outputs to seed the next round of testing.

How it fits together

Create a dataset group and populate a draft.
Edit the draft and save a version.
Attach that version to an evaluation blueprint.
Run full batches, review results and history, and feed outputs into the next dataset iteration.

Next steps

Evaluations

Learn how datasets power evaluation blueprints, scoring, and batch runs.

Programmatic Evals

Build datasets and evaluations from code or CI workflows.

Create from File

Upload a CSV or JSON file to create a new dataset version.

Create from History

Build a dataset version from filtered request history.

Add Trace to Dataset

Add an observability trace or span subtree as a dataset row via the API.

Voice Agents Create from File

⌘I

Get Started

Core Concepts

Providers

Guides

AI Tools

Reference

Deprecated

What you can do

How it fits together

Next steps

Evaluations

Programmatic Evals

Create from File

Create from History

Add Trace to Dataset

​What you can do

​How it fits together

​Next steps

Evaluations

Programmatic Evals

Create from File

Create from History

Add Trace to Dataset

What you can do

How it fits together

Next steps