Helicone's dataset curation interface for preparing fine-tuning data
Curate your dataset
Advanced filtering options
Export for fine-tuning
Create fine-tuning job
Setting | Type | Description | Default | Example |
---|---|---|---|---|
minScore | number | Minimum score for including requests | 0 | 0.8 |
maxTokens | number | Maximum tokens per example | ∞ | 4096 |
format | string | Export format for platform | "openai" | "anthropic" |
includeMetadata | boolean | Include Helicone metadata | false | true |
Setting | Type | Description | Default | Example |
---|---|---|---|---|
validationSplit | number | Percentage for validation set | 0.2 | 0.15 |
deduplication | boolean | Remove duplicate examples | true | false |
sampling | object | Sampling configuration | {} | {"method": "random", "size": 1000} |
Validation Split
Deduplication Strategies