Skill-Profil

LLM Deployment

Dieser Skill definiert Erwartungen über Rollen und Level.

Machine Learning & AI LLM & Generative AI

Rollen

wo dieser Skill vorkommt

Stufen

strukturierter Entwicklungspfad

Pflichtanforderungen

die anderen 5 optional

Machine Learning & AI

LLM & Generative AI

22.2.2026

Wählen Sie Ihr aktuelles Level und vergleichen Sie die Erwartungen.

Was wird auf jedem Level erwartet

Die Tabelle zeigt, wie die Tiefe von Junior bis Principal wächst.

Rolle	Pflicht	Beschreibung
LLM Engineer		Knows LLM deployment basics: REST API endpoint, model loading, basic serving. Deploys simple inference server on vLLM or text-generation-inference under mentor guidance.

Rolle	Pflicht	Beschreibung
LLM Engineer		Independently deploys LLM to production: configures vLLM with continuous batching, quantization (GPTQ/AWQ), and health checks. Implements monitoring of latency, throughput, and error rates.

Rolle	Pflicht	Beschreibung
LLM Engineer		Designs production LLM serving infrastructure: multi-model serving, A/B testing, canary deployments, auto-scaling. Optimizes latency (p50/p95/p99) and throughput under high load.

Rolle	Pflicht	Beschreibung
LLM Engineer		Defines LLM deployment strategy for the team. Establishes SLA for inference services, monitoring standards, rollback and incident response processes for LLM production systems.

Rolle	Pflicht	Beschreibung
LLM Engineer		Shapes enterprise LLM serving platform. Defines approaches to multi-model inference at scale, cost optimization, capacity planning, and disaster recovery for critical LLM services.

Junior 1 Anforderungen

LLM Engineer

Knows LLM deployment basics: REST API endpoint, model loading, basic serving. Deploys simple inference server on vLLM or text-generation-inference under mentor guidance.

Middle 1 Anforderungen

LLM Engineer

Independently deploys LLM to production: configures vLLM with continuous batching, quantization (GPTQ/AWQ), and health checks. Implements monitoring of latency, throughput, and error rates.

Senior 1 Anforderungen

LLM Engineer

Designs production LLM serving infrastructure: multi-model serving, A/B testing, canary deployments, auto-scaling. Optimizes latency (p50/p95/p99) and throughput under high load.

Lead / Staff 1 Anforderungen

LLM Engineer

Defines LLM deployment strategy for the team. Establishes SLA for inference services, monitoring standards, rollback and incident response processes for LLM production systems.

Principal 1 Anforderungen

LLM Engineer

Shapes enterprise LLM serving platform. Defines approaches to multi-model inference at scale, cost optimization, capacity planning, and disaster recovery for critical LLM services.

Kommentare werden geladen...