技能档案

Apache Airflow

DAGs, operators, sensors, XComs, dynamic task mapping, KubernetesPodOperator

Data Engineering Data Orchestration

角色数

4

包含此技能的角色

级别数

5

结构化成长路径

必要要求

20

其余 0 个可选

领域

Data Engineering

skills.group

Data Orchestration

最后更新

2026/3/17

如何使用

选择当前级别并对比期望。下方卡片显示晋升所需掌握的内容。

各级别期望

表格展示从初级到首席的技能深度变化。点击行查看详情。

角色 必要性 描述
Analytics Engineer 必要 Understands basic Airflow concepts: DAGs, operators, and task dependencies. Follows established DAG templates to build simple transformation pipelines. Uses dbt + Airflow integration patterns defined by the team.
BI Analyst 必要 Understands basic Airflow DAG structure and scheduling concepts. Monitors scheduled report refresh pipelines and identifies failures. Follows team guidelines for triggering dashboard data updates through Airflow UI.
Data Analyst 必要 Understands basic Airflow concepts and DAG scheduling. Monitors data pipeline runs that feed analytical datasets. Follows team documentation to trigger ad-hoc DAG runs for data refresh and extraction tasks.
Data Engineer 必要 Creates Airflow DAGs: PythonOperator, BashOperator, task dependencies. Understands execution date, catchup, schedule interval. Monitors runs in Airflow UI. Debugs failed tasks through logs.
角色 必要性 描述
Analytics Engineer 必要 Independently builds Airflow DAGs for ELT pipelines with dbt operators and data quality checks. Configures retry policies, SLAs, and alerting for transformation jobs. Optimizes task parallelism and resource pools.
BI Analyst 必要 Independently configures Airflow DAGs for scheduled report generation and dashboard data refresh. Implements data quality sensors to validate source data before BI layer updates. Troubleshoots pipeline failures affecting reporting.
Data Analyst 必要 Independently builds Airflow DAGs for automated data extraction and cohort preparation pipelines. Implements data validation tasks with Great Expectations integration. Configures scheduling for recurring analytical data refreshes.
Data Engineer 必要 Designs Airflow DAGs: dynamic task generation, XCom for data passing, TaskGroups for organization. Uses sensors, hooks for external system integration. Configures connections and variables.
角色 必要性 描述
Analytics Engineer 必要 Architects data systems with Apache Airflow. Optimizes for big data. Implements data governance and quality frameworks.
BI Analyst 必要 Designs Airflow-based data pipeline architecture for enterprise BI platform. Implements complex dependency graphs across multiple data sources with SLA monitoring. Mentors team on DAG design patterns for reporting workflows.
Data Analyst 必要 Designs Airflow pipeline architecture for complex analytical workflows with cross-dataset dependencies. Implements data lineage tracking and audit logging. Optimizes DAG performance for large-scale analytical data processing.
Data Engineer 必要 Designs Airflow architecture: KubernetesExecutor for dynamic scaling, custom operators/hooks, DAG factory pattern for generation. Optimizes performance: pool management, priority weight, concurrency.
角色 必要性 描述
Analytics Engineer 必要 Defines orchestration strategy for the analytics pipeline: Airflow for coordinating dbt runs, sensors for upstream data dependencies. Implements DAG design standards: idempotency, retry policies, SLA monitoring for analytical models.
BI Analyst 必要 Defines BI data pipeline strategy and Airflow platform standards. Establishes DAG development guidelines, code review practices, and deployment workflows for reporting team. Coordinates data freshness SLAs with stakeholders.
Data Analyst 必要 Defines analytical data pipeline strategy and Airflow governance standards. Establishes DAG naming conventions, testing requirements, and monitoring practices. Drives adoption of self-service pipeline creation among analyst teams.
Data Engineer 必要 Defines Airflow standards: DAG structure, naming conventions, testing requirements, deployment workflow. Chooses between Airflow and alternatives (Dagster, Prefect) by scenario.
角色 必要性 描述
Analytics Engineer 必要 Architects enterprise analytics platform orchestration: Airflow/Dagster for multi-project dbt, event-driven triggers, cross-team dependency management. Defines migration strategy to managed orchestration (dbt Cloud, Dagster Cloud).
BI Analyst 必要 Defines enterprise data orchestration strategy spanning Airflow, dbt, and BI tools. Evaluates orchestration platforms and migration paths. Shapes organizational data delivery standards and cross-team pipeline governance.
Data Analyst 必要 Defines enterprise analytical data orchestration strategy. Shapes organizational standards for pipeline reliability and data delivery guarantees. Evaluates next-gen orchestration tools and drives platform evolution decisions.
Data Engineer 必要 Designs orchestration strategy: Airflow for batch, event-driven for real-time, hybrid patterns. Defines multi-team governance, shared infrastructure, cost allocation.

社区

👁 关注 ✏️ 建议修改 登录以建议修改
📋 提案
暂无提案 Apache Airflow
正在加载评论...