Skill-Datei

Databricks SA Knowledge Base

Name: Databricks SA Knowledge Base
Author: slysik

Databricks Solutions Architect knowledge base for Data Warehousing (DW spike), FinServ vertical, and design & architecture interview prep. Use when: designing Databricks lakehouse architectures, answering Medallion/Delta Lake/Unity Catalog questions, comparing Databricks vs Snowflake/Teradata, generating Mermaid architecture diagrams, handling DW migration scenarios, modeling star schema or Data Vault on Delta Lake, explaining Liquid Clustering / Photon / Serverless SQL / LakeFlow / Lakebridge, or coaching SA interview answers. Keywords: Databricks, lakehouse, Delta Lake, medallion, bronze silver gold, Unity Catalog, DW, data warehouse, FinServ, financial services, Teradata migration, Snowflake comparison, SCD, slowly changing dimension, Liquid Clustering, Photon, Serverless SQL, LakeFlow, Lakebridge, SA interview, solutions architect.

slysik0 Sterne10.03.2026

Beruf
Kategorien: Data Engineering

Skill-Inhalt

Purpose

This skill provides deep Databricks Data Warehousing knowledge for Steve Lysik's SA interview preparation. It contains architecture patterns, feature reference, competitive intelligence, and scenario-based learning material.

Spike: Data Warehousing (DW)
Vertical: Financial Services (FinServ)
Interview type: Design & Architecture

Quick Reference: DW Feature Status (2025–2026)

Feature	Status	Key Talking Point
Serverless SQL Warehouse	GA	40% perf gain in 2025; 2–6s startup; IWM + PQE
Liquid Clustering	GA (DBR 15.2+)	Replaces partitioning + Z-ORDER; `CLUSTER BY AUTO`
Predictive Optimization	GA (auto-enabled)	Auto OPTIMIZE/VACUUM/ANALYZE on UC managed tables

Verwandte Skills

Databricks SA Knowledge Base | Skills Pool

SOURCES → INGESTION → MEDALLION (Bronze/Silver/Gold) → GOVERNANCE → CONSUMPTION

Dimension	Databricks	Snowflake
Price/Performance	2.8x faster, 3.6x less cost (TPC-DS-like)	Simpler pricing model
Streaming	Native Structured Streaming, DLT, Auto Loader	Snowpipe simpler for light CDC
AI/ML	Native MLflow, Feature Store, Model Serving	Snowpark growing but less mature
Open Formats	Delta Lake + Iceberg; zero vendor lock-in	Iceberg support added recently
Governance	Unity Catalog (multi-cloud, multi-language)	Strong SQL-level policies
BI Concurrency	Improving rapidly with IWM	Historical strength
Ease of Use	More powerful, steeper curve	Simpler for SQL-only teams

File	When to Load
references/dw-architecture.md	Detailed DW patterns, SCD code, Liquid Clustering syntax
references/competitive.md	Full competitive battle cards and objection handling
references/discovery-framework.md	Complete discovery question bank by category

File	Scenario
scenarios/finserv.yaml	FinServ bank — regulatory reporting, real-time risk
scenarios/dw-migration.yaml	500TB Teradata migration to Databricks
scenarios/wegmans.yaml	Retail — demand forecasting, real-time inventory

Script	Purpose
scripts/gen-arch.sh	Generate architecture diagram from a scenario YAML
scripts/open-live.sh	Open live-arch.md in browser for real-time preview

Databricks SA Knowledge Base

Purpose

Quick Reference: DW Feature Status (2025–2026)

Databricks SA Knowledge Base

Purpose

Quick Reference: DW Feature Status (2025–2026)

Architecture Patterns

The 5-Layer Template (Use Every Time)

Medallion Layer Guidance

Star Schema on Databricks

Data Vault on Databricks

Migration Framework (Teradata/Netezza → Databricks)

Competitive Intelligence

Databricks vs Snowflake

Databricks vs Synapse / Fabric

Reference Files

Scenario Files

Scripts

Clickhouse Io

Clickhouse Io

Claude Devfleet

Clickhouse Io

Ai First Engineering

Postgres Patterns