技能档案

ML Pipelines

Kubeflow, Vertex AI, SageMaker Pipelines, reproducibility, training automation

Machine Learning & AI MLOps

角色数

6

包含此技能的角色

级别数

5

结构化成长路径

必要要求

21

其余 9 个可选

领域

Machine Learning & AI

skills.group

MLOps

最后更新

2026/3/17

如何使用

选择当前级别并对比期望。下方卡片显示晋升所需掌握的内容。

各级别期望

表格展示从初级到首席的技能深度变化。点击行查看详情。

角色 必要性 描述
Computer Vision Engineer Understands ML pipeline stages for computer vision: data loading, augmentation, training, and evaluation. Runs existing pipelines and tracks experiment metrics using MLflow or Weights & Biases.
Data Scientist 必要 Creates basic ML pipelines via scikit-learn Pipeline: preprocessing, feature engineering, model training. Understands pipeline importance for reproducibility and data leakage prevention. Saves pipelines as single artifacts for deployment.
LLM Engineer Knows ML pipeline basics: data loading, preprocessing, training, evaluation. Builds simple pipelines for LLM fine-tuning data preparation using Hugging Face Datasets.
ML Engineer 必要 Understands ML pipeline concept: data → features → training → evaluation → deployment. Writes simple pipeline scripts. Uses Airflow DAG for basic ML workflow.
MLOps Engineer Understands ML pipeline concepts: data validation, model training, and deployment stages. Configures pipeline runners (Airflow, Kubeflow) and monitors pipeline health metrics.
NLP Engineer 必要 Knows ML pipeline basics for NLP: data collection, preprocessing, feature extraction, training, evaluation. Uses scikit-learn Pipeline and spaCy for building simple NLP pipelines.
角色 必要性 描述
Computer Vision Engineer Builds end-to-end ML pipelines for vision models: dataset versioning, distributed training, hyperparameter tuning, and model registry integration. Implements data quality checks for image datasets.
Data Scientist 必要 Designs production ML pipelines using Airflow, Prefect, or Dagster. Automates the full ML cycle: data ingestion, validation, feature engineering, training, evaluation, deployment. Configures scheduling and retry logic.
LLM Engineer Independently builds production ML pipelines for LLM: data ingestion, cleaning, tokenization, training, evaluation. Uses Airflow or Prefect for orchestration, ensures idempotency.
ML Engineer 必要 Designs ML pipelines with Kubeflow/Airflow. Configures parameterized pipelines for different models. Automates retraining with data quality checks. Implements pipeline testing.
MLOps Engineer Implements automated ML pipelines with feature stores, model versioning, and A/B testing infrastructure. Builds CI/CD for model training with automated retraining triggers on data drift detection.
NLP Engineer 必要 Independently designs ML pipelines for NLP tasks: data versioning, text feature engineering, hyperparameter tuning, model selection. Automates via Airflow or Prefect.
角色 必要性 描述
Computer Vision Engineer 必要 Designs scalable ML pipeline architectures for large-scale vision systems: multi-GPU training orchestration, model distillation workflows, and production inference pipelines with latency optimization.
Data Scientist 必要 Designs scalable ML pipelines for enterprise: Kubeflow, Vertex AI Pipelines, SageMaker Pipelines. Implements continuous training, automated model promotion. Optimizes pipeline performance through caching, parallel execution, and incremental processing.
LLM Engineer Designs complex ML pipelines for LLM platforms: multi-stage data processing, continuous training, automated retraining triggers. Optimizes pipeline throughput and reliability.
ML Engineer 必要 Designs ML pipeline architecture. Optimizes pipeline execution (caching, parallel steps). Configures CI/CD for pipeline deployment. Ensures reproducibility.
MLOps Engineer 必要 Designs enterprise ML platform with pipeline orchestration, model governance, and reproducibility guarantees. Implements canary deployments for models, automated rollback on performance degradation, and cost-optimized training infrastructure.
NLP Engineer 必要 Designs production ML pipelines for NLP systems. Implements CI/CD for models, automatic retraining on drift detection, A/B testing of NLP models with automatic promotion.
角色 必要性 描述
Computer Vision Engineer 必要 Defines ML Pipelines strategy at the team/product level. Establishes standards and best practices. Conducts reviews.
Data Scientist 必要 Defines ML pipeline infrastructure strategy for the data science team. Establishes pipeline development, testing, and monitoring standards. Coordinates unification of pipeline approaches across projects and teams.
LLM Engineer Defines ML pipeline standards for the LLM team. Establishes guidelines for pipeline architecture, monitoring, error handling. Coordinates pipeline infrastructure for training and inference.
ML Engineer 必要 Defines ML pipeline strategy. Standardizes pipeline components. Designs pipeline templating for faster development.
MLOps Engineer 必要 Defines the ML Pipelines strategy at team/product level. Establishes standards and best practices. Conducts reviews.
NLP Engineer 必要 Defines ML pipeline standards for the NLP team. Establishes MLOps best practices, defines model lifecycle management processes, and ensures reproducibility of all NLP experiments.
角色 必要性 描述
Computer Vision Engineer 必要 Defines ML Pipelines strategy at the organizational level. Establishes enterprise approaches. Mentors leads and architects.
Data Scientist 必要 Shapes ML orchestration platform strategy at organizational level. Defines enterprise requirements: scalability, governance, cost management. Designs unified ML pipeline platform for all data science teams.
LLM Engineer Shapes enterprise ML pipeline platform. Defines approaches to unified pipeline frameworks, cross-team pipeline components, and SLA and cost management for data/training/serving pipelines.
ML Engineer 必要 Defines ML pipeline platform strategy. Evaluates pipeline orchestrators. Designs enterprise ML pipeline architecture.
MLOps Engineer 必要 Defines the ML Pipelines strategy at the organizational level. Establishes enterprise approaches. Mentors leads and architects.
NLP Engineer 必要 Shapes enterprise MLOps strategy for the NLP platform. Defines ML pipeline standards, model governance, and infrastructure for scaling NLP ML operations at organizational level.

社区

👁 关注 ✏️ 建议修改 登录以建议修改
📋 提案
暂无提案 ML Pipelines
正在加载评论...