Domäne
Machine Learning & AI
Skill-Profil
Dieser Skill definiert Erwartungen über Rollen und Level.
Rollen
1
wo dieser Skill vorkommt
Stufen
5
strukturierter Entwicklungspfad
Pflichtanforderungen
0
die anderen 5 optional
Machine Learning & AI
Natural Language Processing
22.2.2026
Wählen Sie Ihr aktuelles Level und vergleichen Sie die Erwartungen.
Die Tabelle zeigt, wie die Tiefe von Junior bis Principal wächst.
| Rolle | Pflicht | Beschreibung |
|---|---|---|
| LLM Engineer | Knows tokenization basics: BPE, WordPiece, SentencePiece. Understands how tokenizer affects LLM quality and cost. Uses pre-trained tokenizers from Hugging Face for basic tasks. |
| Rolle | Pflicht | Beschreibung |
|---|---|---|
| LLM Engineer | Independently works with LLM tokenization: analyzes token distribution, optimizes input length, handles special tokens. Trains custom tokenizers on domain-specific corpora. |
| Rolle | Pflicht | Beschreibung |
|---|---|---|
| LLM Engineer | Designs tokenization strategies for LLM: multi-language tokenizer training, vocabulary extension, tokenizer-aware data preprocessing. Optimizes fertility rate and coverage for target domains. |
| Rolle | Pflicht | Beschreibung |
|---|---|---|
| LLM Engineer | Defines tokenization standards for the LLM team. Establishes guidelines for tokenizer selection and training, tokenization quality evaluation, and integration with training and inference pipelines. |
| Rolle | Pflicht | Beschreibung |
|---|---|---|
| LLM Engineer | Shapes enterprise tokenization strategy. Defines approaches to unified tokenizer management, multi-language coverage, tokenizer versioning, and evaluation at organizational scale. |