领域
Observability & Monitoring
技能档案
Log aggregation, LogQL, labels, retention, multi-tenancy, Grafana integration
角色数
2
包含此技能的角色
级别数
5
结构化成长路径
必要要求
6
其余 4 个可选
Observability & Monitoring
Logging
2026/3/17
选择当前级别并对比期望。下方卡片显示晋升所需掌握的内容。
表格展示从初级到首席的技能深度变化。点击行查看详情。
| 角色 | 必要性 | 描述 |
|---|---|---|
| Platform Engineer | Queries logs in Grafana Loki using basic LogQL syntax. Navigates Grafana dashboards to view application log streams. Understands label-based log filtering and basic log aggregation concepts. | |
| Site Reliability Engineer (SRE) | Uses Grafana Loki to search and filter logs during incident investigation. Understands log retention policies and storage concepts. Follows runbooks that reference Loki queries for common troubleshooting scenarios. |
| 角色 | 必要性 | 描述 |
|---|---|---|
| Platform Engineer | Configures Loki ingestion pipelines with Promtail and structured metadata extraction. Builds Grafana dashboards combining Loki logs with Prometheus metrics for correlated observability. Sets up log-based alerting rules for platform health monitoring. | |
| Site Reliability Engineer (SRE) | Configures Loki for multi-tenant log aggregation across services. Creates advanced LogQL queries with metric extraction for SLI tracking. Builds alerting rules on log patterns and participates in on-call rotation using log-based diagnostics. |
| 角色 | 必要性 | 描述 |
|---|---|---|
| Platform Engineer | 必要 | Architects Loki deployment topology for high-throughput multi-cluster log aggregation. Designs log pipeline standards including labeling conventions, retention policies, and cost optimization. Integrates Loki into the platform observability stack alongside tracing and metrics. |
| Site Reliability Engineer (SRE) | 必要 | Designs the organization-wide logging strategy with Loki as the centralized log platform. Defines SLI/SLO based on log-derived metrics and automates error-budget alerting. Leads post-mortems leveraging Loki correlation with distributed traces and APM data. |
| 角色 | 必要性 | 描述 |
|---|---|---|
| Platform Engineer | 必要 | Adopts Grafana Loki as cost-effective logging solution for the platform: multi-tenant configuration, retention policies. Designs label strategy for optimal query performance. Integrates with Grafana for unified observability (logs + metrics + traces in single UI). |
| Site Reliability Engineer (SRE) | 必要 | Defines Loki standards: label strategy (low cardinality), retention policies, query patterns. Implements Loki for cost-effective log aggregation. Compares Loki vs ELK by scenarios. |
| 角色 | 必要性 | 描述 |
|---|---|---|
| Platform Engineer | 必要 | Defines logging strategy: Loki vs ELK vs managed solutions for various platform use cases. Designs Loki at scale: microservices mode, S3 backend, caching. Shapes vision for cost-efficient observability data platform with tiered storage. |
| Site Reliability Engineer (SRE) | 必要 | Designs log aggregation strategy: Loki for Kubernetes-native logging, multi-tenant setup, long-term storage. Defines when Loki vs ELK vs managed (Datadog/Splunk). |