training-data
Installation
SKILL.md
Training Data
Managing and improving training data quality.
Data Labeling Strategies
Manual Labeling
# Export for Label Studio
def export_for_labeling(data: pd.DataFrame, output_path: str):
tasks = [
{"data": {"text": row["text"]}, "id": idx}
for idx, row in data.iterrows()
]
with open(output_path, 'w') as f:
json.dump(tasks, f)