Name: Skill: source retrieval
Author: Dingxingdi

Skill: source retrieval

Use this skill when the user wants SQL training data where the hard part is first finding the right database or table collection before writing the query. Trigger it for requests like “make examples where the agent has to figure out which dataset matters”, “the data should not already be handed to the model”, “it should need to look through many tables first”, or “make open-domain SQL questions.” Example trigger: “The question should not tell the model which database to use.” Example trigger: “Make examples where the agent has to search several candidate tables first.” Example trigger: “I want SQL tasks that feel like finding the right data source is half the job.”

Dingxingdi0 estrellas8 abr 2026

Ocupación
Categorías: Bases de Datos SQL

1. Capability Definition & Real Case

Professional Definition: The ability to identify which database, table bundle, or evidence-bearing source should be queried before any SQL is written, especially when the user question is asked in an open-domain or multi-database setting.
Dimension Hierarchy: Environment Grounding->Retrieval and Alignment->source retrieval

Real Case

[Case 1]

Initial Environment: A large open collection of tabular sources is available, but the relevant table is not preselected. The agent starts only with the natural-language question and a retrieval interface over many candidate tables.
Real Question: What is the highest eligible free rate for K-12 students in the schools in Alameda County?
Real Answer: The agent must first retrieve the correct school-related table for Alameda County before the final SQL can be written and executed.
Why this demonstrates the capability: This task is not hard because of exotic SQL syntax alone. It tests whether the agent can localize the correct source table from a larger corpus, avoid semantically nearby but irrelevant tables, and only then translate the question into executable SQL. A SQL agent that skips this retrieval step will often produce syntactically plausible but evidentially ungrounded queries.

Skill: source retrieval

Dingxingdi0 estrellas8 abr 2026

Ocupación
Categorías: Bases de Datos SQL

1. Capability Definition & Real Case

Professional Definition: The ability to identify which database, table bundle, or evidence-bearing source should be queried before any SQL is written, especially when the user question is asked in an open-domain or multi-database setting.

Dimension Hierarchy: Environment Grounding->Retrieval and Alignment->source retrieval

Real Case

[Case 1]

Initial Environment: A large open collection of tabular sources is available, but the relevant table is not preselected. The agent starts only with the natural-language question and a retrieval interface over many candidate tables.

Real Question: What is the highest eligible free rate for K-12 students in the schools in Alameda County?

Real Answer: The agent must first retrieve the correct school-related table for Alameda County before the final SQL can be written and executed.

Why this demonstrates the capability: This task is not hard because of exotic SQL syntax alone. It tests whether the agent can localize the correct source table from a larger corpus, avoid semantically nearby but irrelevant tables, and only then translate the question into executable SQL. A SQL agent that skips this retrieval step will often produce syntactically plausible but evidentially ungrounded queries.

Skill: source retrieval

1. Capability Definition & Real Case

Real Case

Skill: source retrieval

1. Capability Definition & Real Case

Real Case

Pipeline Execution Instructions

Postgres Patterns

Postgres Patterns

Database Migrations

Postgres Patterns

Postgres Patterns

Jpa Patterns