22-25 April 2026

Build a Lakehouse in a Day with Metadata & Open-Source Tools

Proposed session for SQLBits 2026

TL; DR

During this workshop, participants will get an in-depth understanding of how CF.Cumulus can integrate Azure Data Factory, Azure Databricks, Azure Synapse Analytics, and Microsoft Fabric and other resources to streamline data insight deliveries.

Session Details

Unlock the power and speed of a metadata-driven Lakehouse architecture.

In the fast-paced, data-driven world, the ability to swiftly and efficiently deliver a robust data platform is key to maintaining a competitive edge. Join us for an immersive, full-day hands-on workshop, where we will guide you through the process of building a metadata-driven Lakehouse using the open-source product framework, known as CF.Cumulus. Leveraging and abstracting Microsoft cloud native technologies to ease delivery challenges.

During this workshop, participants will get an in-depth understanding of how CF.Cumulus can integrate Azure Data Factory, Azure Databricks, Azure Synapse Analytics, and Microsoft Fabric and other resources to streamline data insight deliveries.

Our expert instructors will provide practical insights on overcoming common data challenges, including fragmented data ingestion, change data capture, and orchestration scalability, using our proven best practices.

Attendees will learn how to utilise metadata, open-standards, and seamless cloud integration to accelerate time-to-insight with minimal technical debt, ensuring cost control and operational resilience. This workshop is ideal for both data engineers and data leaders who are looking to enhance their cloud data platform delivery and unlock the potential to build a Lakehouse in a day using a metadata driven approach.

3 things you'll get out of this session

• Demonstrate how to build a metadata-driven Lakehouse using Microsoft cloud native technologies. • Showcase strategies to overcome data challenges such as fragmented data ingestion, change data capture, and orchestration scalability. • Highlight the use of automation, open-standards, and seamless cloud integration to achieve fast, efficient data platform delivery with minimal technical debt.