> ## Documentation Index
> Fetch the complete documentation index at: https://docs.helicone.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Datasets & Fine-Tuning

When building production AI applications, you need to improve model performance on specific tasks beyond what general-purpose models provide. Datasets & Fine-Tuning help you curate high-quality training data from your real production traffic and fine-tune models for better accuracy, consistency, and domain-specific performance.

## Why use Datasets & Fine-Tuning

* **Production-ready datasets**: Transform your actual LLM requests into high-quality training data with scoring and filtering
* **Seamless fine-tuning integration**: Export to JSONL or connect directly to fine-tuning platforms like OpenPipe
* **Iterative improvement**: Use real performance data to continuously refine your datasets and models

<Frame caption="Dataset curation interface showing request filtering and scoring for fine-tuning preparation">
  <img src="https://mintcdn.com/helicone/psm-vDV7pnoZSp6H/images/features/fine-tuning/dataset2.webp?fit=max&auto=format&n=psm-vDV7pnoZSp6H&q=85&s=551d97d0ef6c8b87a3d3fae7261e3989" alt="Helicone dataset curation interface with request filtering, scoring, and dataset management tools" width="2324" height="1268" data-path="images/features/fine-tuning/dataset2.webp" />
</Frame>

## Quick Start

<Steps>
  <Step title="Score Your Requests">
    Review your existing LLM requests in the Helicone dashboard and assign quality scores based on accuracy and relevance. You can score manually or use [automated scoring](/features/advanced-usage/scores) to identify your best examples.

    <img src="https://mintcdn.com/helicone/tEQUFyBH7IjDxuEd/images/datasets/scores.webp?fit=max&auto=format&n=tEQUFyBH7IjDxuEd&q=85&s=180e89cf2dec6b109f6bc26ed4274b19" alt="Score your requests" width="1278" height="770" data-path="images/datasets/scores.webp" />
  </Step>

  <Step title="Filter Requests">
    Use Helicone's filtering system to find high-quality requests based on scores, dates, models, or custom properties.

    <img src="https://mintcdn.com/helicone/tEQUFyBH7IjDxuEd/images/datasets/filters.webp?fit=max&auto=format&n=tEQUFyBH7IjDxuEd&q=85&s=a83618a1afd919bf3739f36aa1fbe635" alt="Filter your requests" width="1588" height="466" data-path="images/datasets/filters.webp" />
  </Step>

  <Step title="Select for Dataset">
    Choose the filtered requests you want to include and add them to a new or existing dataset.

    <video autoPlay controls loop muted playsInline className="w-full aspect-video rounded-xl" src="https://marketing-assets-helicone.s3.us-west-2.amazonaws.com/datasets-create.mp4" />
  </Step>

  <Step title="Curate Dataset">
    Review, organize, and refine your dataset by removing poor examples, balancing categories, and ensuring consistency.

    <video autoPlay controls loop muted playsInline className="w-full aspect-video rounded-xl" src="https://marketing-assets-helicone.s3.us-west-2.amazonaws.com/datasets-curate.mp4" />
  </Step>

  <Step title="Export">
    Export your curated dataset in JSONL format for use with fine-tuning platforms like OpenAI, Anthropic, or OpenPipe.

    <video autoPlay controls loop muted playsInline className="w-full aspect-video rounded-xl" src="https://marketing-assets-helicone.s3.us-west-2.amazonaws.com/datasets-download.mp4" />
  </Step>
</Steps>

## Export Formats

Helicone supports multiple export formats for different fine-tuning platforms:

* **OpenAI JSONL**: Compatible with OpenAI's fine-tuning API
* **Anthropic Format**: For Claude model fine-tuning
* **Generic JSONL**: Works with most platforms
* **CSV**: For data analysis and custom workflows

## Related Features

<CardGroup cols={2}>
  <Card title="Scores" icon="star" href="/features/advanced-usage/scores">
    Score LLM responses to identify high-quality training data
  </Card>

  <Card title="User Feedback" icon="thumbs-up" href="/features/advanced-usage/feedback">
    Collect user feedback to improve dataset quality
  </Card>

  <Card title="Prompt Management" icon="file-text" href="/features/advanced-usage/prompts">
    Version and manage prompts used in your training data
  </Card>

  <Card title="Sessions" icon="layers" href="/features/sessions">
    Group related requests for better dataset organization
  </Card>
</CardGroup>
