Name: Categorizing Files
Author: austinogilvie

搵技能.../

Categorizing Files | Skills Pool

File Type	Category	Reasoning
SQL DDL (CREATE TABLE)	Docs	Documents database structure
SQL DML (INSERT/SELECT)	Data	Contains or queries data
.duckdb, .sqlite files	Data	Database storage
schema.json, openapi.yaml	Docs	Specification/contract files
Shell scripts (.sh)	Scripts	Executable automation
requirements.txt	Config	Dependency configuration

function categorize(filePath, content):

  # PHASE 1: Filename + directory rules
  category = byLocationOrExtension(filePath)
  if category != UNKNOWN: return (category, "High")

  # PHASE 2: Frontmatter refinement
  fm = extractYAMLFrontmatter(content)
  if fm indicates config: return (CONFIG, "Medium")
  if fm indicates ai_tooling: return (AI_TOOLING, "Medium")

  # PHASE 3: Content structure analysis
  if looksLikeTest(content): return (TESTS, "Medium")
  if looksLikeScript(content): return (SCRIPTS, "Medium")
  if looksLikeSource(content): return (SOURCE_CODE, "Medium")
  if looksLikeDocs(content): return (DOCS, "Medium")
  if looksLikeData(content): return (DATA, "Medium")

  # PHASE 4: Keyword detection (fallback)
  kw_category = detectByKeywords(content)
  if kw_category: return (kw_category, "Low")

  return (OTHER, "Low")

### [Filename]

- **Category**: [Config | Tests | Docs | Scripts | Source Code | Data | AI Tooling | Other]
- **Confidence**: [High | Medium | Low]
- **Reasoning**: [Why this category was chosen]
- **Recommended Location**: [Suggested directory if misfiled, or "Correct" if well-placed]

### src/utils/helpers.py
- **Category**: Source Code
- **Confidence**: High
- **Reasoning**: Located in `src/` directory; `.py` extension; module structure
- **Recommended Location**: Correct

### customers.csv
- **Category**: Data
- **Confidence**: High
- **Reasoning**: CSV extension; tabular structure detected
- **Recommended Location**: `data/customers.csv`

### notes.txt
- **Category**: Other
- **Confidence**: Low
- **Reasoning**: Prose content; no structural markers; could be Docs if formalized
- **Recommended Location**: Manual review needed — consider `docs/` if documentation

Config (5):
  - .gitignore
  - pyproject.toml
  - docker-compose.yml

Source Code (12):
  - src/main.py
  - src/utils/helpers.py

Tests (4):
  - tests/test_main.py
  - tests/conftest.py

Docs (2):
  - README.md
  - docs/API.md

AI Tooling (1):
  - .claude/skills/categorizing-files/SKILL.md

Other (1):
  - notes.txt (Low confidence — review needed)

Excluded 2 directories: 1 via Layer 1 (always-exclude), 1 via .gitignore

# Basic usage
python scripts/categorize.py [path]

# Enable content analysis (Phases 2-4)
python scripts/categorize.py --analyze-content [path]

# Include .gitignore-excluded files (bypass Layers 2-3)
python scripts/categorize.py --include-ignored [path]

# Include ALL files including node_modules (use with caution)
python scripts/categorize.py --include-all [path]

python scripts/categorize.py myfile.py
# Output: myfile.py: Source Code (High)

python scripts/categorize.py .
# Output: Grouped list by category with exclusion summary

Category	Description
Config	Configuration files for tools, environments, and build systems
Tests	Test files, fixtures, and testing utilities
Docs	Documentation, READMEs, and guides
Scripts	Standalone executable scripts and automation
Source Code	Core application/library source files
Data	Data files, datasets, and static assets
AI Tooling	AI/ML configs, prompts, and agent definitions
Other	Files that don't fit other categories (fallback)

Category	Description
Config	Configuration files for tools, environments, and build systems
Tests	Test files, fixtures, and testing utilities
Docs	Documentation, READMEs, and guides
Scripts	Standalone executable scripts and automation
Source Code	Core application/library source files
Data	Data files, datasets, and static assets
AI Tooling	AI/ML configs, prompts, and agent definitions
Other	Files that don't fit other categories (fallback)

Categorizing Files

File Categorization

Categories

Category Mapping Rules

Categorizing Files

File Categorization

Categories

Category Mapping Rules

Directory Exclusion

Categorization Priority

Categorization Algorithm

Pattern Matching Details

Output Format

Content Analysis Caveats

Examples

Example 1: Clear Categorization

Example 2: Misfiled Resource

Example 3: Ambiguous File

Example 4: Project Inventory

Optional: Automated Script

Coding Agent (bash-first)

Fix

Commit

Init

Github Copilot Upgrader

Rebuilding Flutter Tool