What you can do
- Create datasets from files, request history, or evaluation results.
- Edit draft datasets before publishing a version, including adding rows, renaming or deleting columns, updating values, and converting columns to JSON.
- Organize dataset groups with folders, tags, changelog history, and archive controls.
- Reuse a specific dataset version in an evaluation blueprint, or update a blueprint to a different dataset later.
- Export datasets to CSV and use report outputs to seed the next round of testing.
How it fits together
- Create a dataset group and populate a draft.
- Edit the draft and save a version.
- Attach that version to an evaluation blueprint.
- Run full batches, review results and history, and feed outputs into the next dataset iteration.
Next steps
Evaluations
Learn how datasets power evaluation blueprints, scoring, and batch runs.
Programmatic Evals
Build datasets and evaluations from code or CI workflows.
Create from File
Upload a CSV or JSON file to create a new dataset version.
Create from History
Build a dataset version from filtered request history.

