Perfil de habilidad

Data Lake Architecture

Medallion architecture, data zones, partitioning, compaction, storage organization

Data Engineering Data Lakehouse

Roles

2

donde aparece esta habilidad

Niveles

5

ruta de crecimiento estructurada

Requisitos obligatorios

10

los otros 0 opcionales

Dominio

Data Engineering

skills.group

Data Lakehouse

Última actualización

17/3/2026

Cómo usar

Selecciona tu nivel actual y compara las expectativas.

Qué se espera en cada nivel

La tabla muestra cómo crece la profundidad desde Junior hasta Principal.

Rol Obligatorio Descripción
Analytics Engineer Obligatorio Understands data lake zone architecture (raw, curated, consumption). Queries data from curated layers using SQL and dbt models. Follows team conventions for partitioning, file formats, and naming standards.
Data Engineer Obligatorio Understands medallion architecture principles (bronze/silver/gold). Ingests data into raw landing zones using batch loaders and schema registries. Follows established patterns for Parquet/ORC file layout and catalog metadata.
Rol Obligatorio Descripción
Analytics Engineer Obligatorio Independently builds analytics data products on top of data lake layers using dbt and Spark SQL. Optimizes query performance through intelligent partitioning and Z-ordering. Ensures data quality with Great Expectations checks at zone boundaries.
Data Engineer Obligatorio Independently designs ETL pipelines across data lake zones with schema evolution support. Optimizes storage costs using lifecycle policies, compaction, and tiered storage. Implements data quality gates between medallion layers with automated validation.
Rol Obligatorio Descripción
Analytics Engineer Obligatorio Architects data systems with Data Lake Architecture. Optimizes for big data. Implements data governance and quality frameworks.
Data Engineer Obligatorio Designs data architecture with Data Lake Architecture. Optimizes for big data. Implements data governance and quality frameworks.
Rol Obligatorio Descripción
Analytics Engineer Obligatorio Defines the data lake architecture for the analytics platform: medallion approach (bronze/silver/gold), storage format selection (Parquet, Delta, Iceberg). Implements partitioning and retention standards for cost optimization.
Data Engineer Obligatorio Defines data lake standards: zone architecture (bronze/silver/gold), file formats, partition strategies. Implements access control and data classification. Coordinates between data producers and consumers.
Rol Obligatorio Descripción
Analytics Engineer Obligatorio Architects the enterprise lakehouse: Delta Lake/Iceberg as open table format, integration with dbt for transformations, unified governance. Defines the strategy for combining data lake and warehouse for different analytical workloads.
Data Engineer Obligatorio Designs data lakehouse architecture: unified storage layer, query engine federation (Trino/Spark), governance framework. Defines when lakehouse vs traditional DWH vs data mesh.

Comunidad

👁 Seguir ✏️ Sugerir cambio Inicia sesión para sugerir cambios
📋 Propuestas
Aún no hay propuestas para Data Lake Architecture
Cargando comentarios...