This workflow helps you explore and understand datasets used in evaluations. It covers HuggingFace datasets, CSV files, and JSON/JSONL files.

Key Concepts

For detailed information on Inspect's dataset types (datasets.Dataset vs inspect_ai.dataset.Dataset), the hf_dataset() pipeline, caching behaviour, and test utilities, see references/inspect-dataset-patterns.md.

Common Patterns in Evals

Evals typically define:

DATASET_PATH: HuggingFace repo path (e.g., "qiaojin/PubMedQA")
DATASET_REVISION: Optional git revision/tag for reproducibility
record_to_sample(): Function converting raw records to Sample objects

Prerequisites

Access to the evaluation code to find dataset configuration
Python environment with datasets, pandas, and inspect_ai installed

This workflow helps you explore and understand datasets used in evaluations. It covers HuggingFace datasets, CSV files, and JSON/JSONL files.

Key Concepts

Common Patterns in Evals

Evals typically define:

DATASET_PATH: HuggingFace repo path (e.g., "qiaojin/PubMedQA")
DATASET_REVISION: Optional git revision/tag for reproducibility
record_to_sample(): Function converting raw records to Sample objects

Prerequisites

Access to the evaluation code to find dataset configuration
Python environment with datasets, pandas, and inspect_ai installed

Investigate Dataset

Key Concepts

Common Patterns in Evals

Prerequisites

Investigate Dataset

Key Concepts

Common Patterns in Evals

Prerequisites

Steps

1. Identify the Dataset Source

2. Load the Raw Dataset

3. Explore Structure and Quality

4. Understand the Sample Conversion

5. Test the Inspect Loading Pipeline

Quick Reference Commands

Caching and Troubleshooting

Visualization Expert

Data Analyst

Huggingface Hub

Multi Reviewer Patterns

Dbt Transformation Patterns

Startup Financial Modeling