When building production AI applications, you need to improve model performance on specific tasks beyond what general-purpose models provide. Datasets & Fine-Tuning help you curate high-quality training data from your real production traffic and fine-tune models for better accuracy, consistency, and domain-specific performance.
Review your existing LLM requests in the Helicone dashboard and assign quality scores based on accuracy and relevance. You can score manually or use automated scoring to identify your best examples.
2
Filter Requests
Use Helicone’s filtering system to find high-quality requests based on scores, dates, models, or custom properties.
3
Select for Dataset
Choose the filtered requests you want to include and add them to a new or existing dataset.
4
Curate Dataset
Review, organize, and refine your dataset by removing poor examples, balancing categories, and ensuring consistency.
5
Export
Export your curated dataset in JSONL format for use with fine-tuning platforms like OpenAI, Anthropic, or OpenPipe.