Medallion Lakehouse Starter
A production-shaped Bronze / Silver / Gold lakehouse for Microsoft Fabric — deployed in your tenant in under a day, ready to extend with your own data.
The skeleton we use to start nearly every Fabric engagement, packaged so your team can use it without us. Three Lakehouses laid out for Bronze / Silver / Gold, a metadata-driven Spark notebook framework that handles 80% of pipelines you'll ever build, deployment pipelines wired for dev/test/prod, and a Direct Lake semantic model template ready to plug into Power BI.
Designed so a senior engineer can deploy it, configure their first source, and see data flow Bronze → Silver → Gold → Power BI in their first afternoon.
Who this is for
This is for engineering teams who:
- Are starting fresh on Microsoft Fabric and don't want to reinvent the medallion pattern from scratch.
- Already have Fabric capacity provisioned and at least one source they want to land into a lakehouse.
- Want to ship the first production data product in two weeks, not two quarters.
- Prefer to learn by working through a battle-tested reference rather than from a blog post.
It's not for teams who need a no-code or self-service tool. This is engineering source — Python notebooks, JSON config, Spark transformations. Bring an engineer.
What's in the box
A complete, source-controlled Fabric deployment package. Open the repo, fill in three config files, run the deployment script.
| File | Type | What it does |
|---|---|---|
infrastructure/deploy.ps1 | PS | Idempotent PowerShell script that provisions the three Lakehouses, the workspace, and Git integration via Fabric REST APIs. |
infrastructure/workspace_config.json | JSON | Workspace + capacity assignments + role assignments. Edit this file, run deploy — done. |
notebooks/00_framework_setup.ipynb | Python | Bootstraps the metadata tables and configuration schema in the Bronze lakehouse. |
notebooks/10_ingest_bronze.ipynb | Python | Metadata-driven ingest. Reads a control table; for each source, lands raw data idempotently with file partitioning and arrival metadata. |
notebooks/20_transform_silver.ipynb | Python | Bronze → Silver with Delta MERGE patterns, deduplication, type conformance, slowly-changing-dimension Type 1 + Type 2 templates. |
notebooks/30_serve_gold.ipynb | Python | Silver → Gold star schema patterns with surrogate keys, late-arriving dimension handling, and built-in row-count + freshness checks. |
notebooks/90_optimize.ipynb | Python | Scheduled OPTIMIZE + VACUUM + Z-ORDER with metadata-driven hot-table selection. |
pipelines/orchestrator.json | JSON | Fabric pipeline that runs the notebooks in order, with retry, timeout, and failure alerting baked in. |
warehouse/control_tables.sql | T-SQL | Schema for the metadata control tables: source_registry, load_history, data_quality_results. |
semantic-model/template.bim | TMDL | Direct Lake semantic model template with date dimension, conformed measures, RLS placeholder, BPA-clean. |
powerbi/starter-report.pbix | PBIX | Two-page Power BI report wired to the semantic model — a working "first dashboard." |
tests/test_bronze.py | Python | pytest suite covering ingest idempotency, schema evolution, and partition correctness. |
docs/RUNBOOK.md | MD | The Wednesday-morning operations runbook: alerts, on-call playbook, common failure modes and fixes. |
docs/ARCHITECTURE.md | MD | Annotated reference architecture with the design decisions explained. |
docs/CAPACITY_SIZING.xlsx | XLSX | Capacity sizing model — input your workload, output an F-SKU recommendation. |
How the architecture works
Three lakehouses, one workspace per environment, one orchestrator pipeline, one semantic model.
From download to production data flow
- Day 1 morning. Download the repo. Open
workspace_config.json, fill in your tenant + capacity + workspace names. Rundeploy.ps1. Three lakehouses appear in your tenant; Git integration kicks in. - Day 1 afternoon. Connect your first source. Add a row to the
source_registrycontrol table. Run the Bronze ingest notebook. Verify data lands. - Day 2. Configure the Silver transformation for that source. Copy a template, adjust the column mappings, run. Watch data flow Bronze → Silver.
- Day 3. Define the Gold star schema. Build your first fact + 2-3 dimensions using the templates. Wire the semantic model to your Gold tables.
- Day 4. Open the starter Power BI report, swap in your model, publish. Your first dashboard is live on real data.
- Week 2. Add 2-3 more sources following the same pattern. Configure the orchestrator pipeline schedule. Hand over the runbook to your operations team.
Pricing
One-time purchase. Includes 12 months of free updates. Cancel any time — your repo stays yours.
- Full repo + docs
- 12 months of updates
- Email support (5-day response)
- Everything in Individual
- Commercial license, client projects
- Priority email support (2-day)
- 30-minute onboarding call
- Everything in Team
- Unlimited org-wide use
- 2-hour implementation workshop
- Lifetime updates
What's NOT included
Honesty matters more than upsell. This accelerator gets you a Fabric medallion lakehouse foundation. It does not include:
- The data itself. You bring your sources.
- Source-specific connectors beyond the templates (we ship Azure SQL CDC, files, and REST patterns; bring your own for SAP, Workday, etc.).
- Custom semantic-model business logic. The template has the structure; you fill in the measures.
- White-glove implementation. For that, our End-to-End Solution Build service uses this accelerator as the foundation.
Frequently asked questions
Do I need a Fabric capacity already?
What's the minimum F-SKU I need?
Can I use it on multiple client projects?
Do updates require re-deploying?
What if I find a bug?
Can I extend it?
Refund policy?
License terms
Individual: Single named developer. Personal projects + commercial work where you are the sole consumer. May not be used to deliver to clients.
Team: Up to 10 named developers within one organization. May be used on unlimited internal projects and on client-facing engagements as part of your services. May not be resold standalone.
Site: Unlimited developers within one organization. Same commercial-use rights as Team. Lifetime updates included.
All tiers: you may modify the source freely. You may not redistribute the original or modified source as a competing accelerator product.
Start with the right foundation
Most Fabric projects spend their first month rebuilding the medallion pattern. Buy ours, skip the month.
Pairs well with
Real-Time Anomaly Detection
Add streaming sources and Activator alerts on top of your medallion foundation.
From $799 AIFinance Data Agent
Put a conversational layer on top of your Gold semantic model.
From $699 GovernancePower BI Governance Pack
Workspace taxonomy and deployment pipelines for your BI estate.
From $599