From STTM to Snowflake SQL: Building a Metadata-Driven Data Engineering Copilot

#dataengineering #ai #snowflake #etl

Most data engineering teams do not struggle because they lack smart people.

They struggle because too much of the delivery process is still repetitive.

A source-to-target mapping document comes in.

Then someone has to manually create:

target table DDL
transformation SQL
data dictionary
technical specification
data quality rules
reconciliation checks
test cases

For one or two tables, this is manageable.

For a real enterprise program with many tables, changing requirements, multiple source systems, and repeated delivery cycles, this becomes a major productivity problem.

That is the problem I am exploring with Data Engineering Copilot.

Website: https://dataengineeringcopilot.com

The idea

The idea is simple:


text
Upload STTM
   ↓
Parse metadata
   ↓
Normalize into a canonical metadata model
   ↓
Generate engineering artifacts

Top comments (1)

Amit Kumar Singh • Jun 14

🚀 Project Update

A lot has happened since publishing this article.

DE Copilot has evolved from generating Snowflake SQL to a broader metadata intelligence platform.

Recent additions:

✅ Canonical Metadata Model

✅ ER Diagram Generation

✅ Snowflake DDL Generation

✅ Snowflake SQL Generation

✅ Data Dictionary Generation

✅ Technical Specification Generation

✅ Data Quality Rule Generation

✅ AI-Powered Metadata Analysis

✅ Downloadable ZIP Project Packages

✅ Relationship-Aware SQL Generation using Lookup & Join Metadata

Current Architecture:

STTM → Metadata Discovery Engine → Canonical Metadata Model → Artifact Factory

The more I work on this, the more I believe the Canonical Metadata Model is the key abstraction layer. Once metadata is normalized, generating downstream engineering artifacts becomes much easier.

Build metadata once. Generate everywhere.

Website:
dataengineeringcopilot.com