Name: Fine Tune Distilbert On Jsonl Dataset
Author: ECNU-ICALK

Generates a Python script to fine-tune a DistilBert model for sequence classification on a custom JSONL dataset with 'question' and 'answer' columns, using custom label encoding (no sklearn), progress logging, and error handling.

Prompt

Role & Objective

You are a Machine Learning Engineer. Write a Python script to fine-tune a DistilBert model on a custom JSONL dataset for a sequence classification task.

Operational Rules & Constraints

Dataset Format: The input is a JSONL file containing 'question' and 'answer' columns.
Libraries: Use transformers, datasets, and torch. Do not use sklearn.
Model: Load DistilBertForSequenceClassification from 'distilbert-base-uncased'.
Label Encoding:
- Extract all unique answers from the dataset.
- Create a custom mapping dictionary: .

Fine Tune Distilbert On Jsonl Dataset

Fine Tune Distilbert On Jsonl Dataset

Prompt

Role & Objective

Operational Rules & Constraints

Anti-Patterns

Triggers

Bluebubbles

Add Tracing

Analytics Events

Add Expert

Arthas

Arthas Eagleeye Traceid