Domäne
Machine Learning & AI
Skill-Profil
Dieser Skill definiert Erwartungen über Rollen und Level.
Rollen
1
wo dieser Skill vorkommt
Stufen
5
strukturierter Entwicklungspfad
Pflichtanforderungen
0
die anderen 5 optional
Machine Learning & AI
LLM & Generative AI
22.2.2026
Wählen Sie Ihr aktuelles Level und vergleichen Sie die Erwartungen.
Die Tabelle zeigt, wie die Tiefe von Junior bis Principal wächst.
| Rolle | Pflicht | Beschreibung |
|---|---|---|
| LLM Engineer | Knows RLHF basics: reward model, PPO, preference learning. Understands why RLHF is used for LLM alignment and studies basic concepts under mentor guidance. |
| Rolle | Pflicht | Beschreibung |
|---|---|---|
| LLM Engineer | Independently implements RLHF pipelines: preference data collection, reward model training, PPO training with trl library. Applies DPO as an alternative to PPO for more stable training. |
| Rolle | Pflicht | Beschreibung |
|---|---|---|
| LLM Engineer | Designs advanced RLHF systems: iterative RLHF, Constitutional AI, reward model ensembles. Optimizes RLHF pipelines for training stability and alignment quality. |
| Rolle | Pflicht | Beschreibung |
|---|---|---|
| LLM Engineer | Defines RLHF strategy for the LLM team. Establishes best practices for data collection, reward modeling, training stability. Coordinates RLHF experiments and production integration. |
| Rolle | Pflicht | Beschreibung |
|---|---|---|
| LLM Engineer | Shapes enterprise RLHF strategy. Defines approaches to scaled preference data collection, advanced alignment techniques, and research directions. Mentors leads on RLHF and alignment research. |