CPU Telemetry & Observability Development Lead
Full-time • US I Canada / UK / India or Hybrid • SW + Tech Lead
Nuvacore is building ground-up CPU silicon for next-generation compute workloads. We are seeking an experienced technical leader to head our CPU Telemetry and Observability organization, responsible for the architecture, development, and deployment of telemetry systems enabling deep visibility into next-generation processor platforms. This role leads a team developing infrastructure that captures, aggregates, and analyzes on-chip telemetry, performance counters, debug signals, and runtime system metrics across CPU, memory, and platform subsystems. These capabilities are critical for silicon bring-up, performance optimization, reliability monitoring, and large-scale production observability. You will work at the intersection of CPU architecture, silicon design, firmware, operating systems, cloud infrastructure, and data analytics, enabling engineering teams to extract actionable insights from complex hardware systems.
Technical Leadership
- Define and drive the software architecture for next-generation CPU telemetry and observability infrastructure and toolset.
- Lead development of telemetry systems enabling real-time visibility into CPU microarchitecture behavior, memory performance, and platform interactions.
- Oversee instrumentation and telemetry frameworks including:
- On-chip performance counters
- Hardware telemetry registers
- Trace and event collection systems
- Runtime system telemetry infrastructure
- Enable first-silicon bring-up and platform validation through advanced telemetry capture and analysis capabilities.
- Diagnose complex performance, power, and reliability issues using large-scale telemetry data.
Telemetry Architecture & Data Infrastructure
- Design scalable telemetry pipelines supporting collection, aggregation, storage, and analysis of high-volume system telemetry.
- Define instrumentation strategies for CPU, SoC, and platform subsystems.
- Ensure telemetry infrastructure supports debugging, performance tuning, and reliability monitoring across development and production environments.
Team Leadership
- Build, lead, and mentor a high-performance telemetry and observability engineering team.
- Establish development processes, technical roadmaps, and delivery milestones.
- Collaborate with architecture, firmware, OS, validation, and data platform teams globally.
Cross-Functional Collaboration
- Partner with CPU architecture, silicon design, firmware, kernel, validation, and cloud infrastructure teams to enable end-to-end platform observability.
- Provide telemetry capabilities that support performance analysis, power management, reliability engineering, and production monitoring.