Principal Product Manager

New

Skills

Disaster Recovery Incident Management Observability Platform Development Product Management Risk Mitigation SLAs SLIs SLOs Telemetry

Join our team as a Principal Product Manager, focusing on platform development. This role is crucial in driving growth, ensuring fault tolerance, and building capacity buffers within our infrastructure.

Key Responsibilities
  • Set platform KPIs including SLOs, SLAs, and SLIs.
  • Own incident management, disaster recovery, and business continuity roadmaps.
  • Drive observability and telemetry for real-time system health monitoring.
  • Collaborate with engineering teams to identify and mitigate systemic risks.
  • Embed reliability as a core product requirement.
Required Skills & Qualifications
  • 10+ years of product management experience with 3-5 years in senior or principal roles.
  • Experience in platform/infrastructure or developer-facing roles within fast-growing consumer tech.
  • Ability to focus on key objectives amidst distractions.
  • Familiarity with SLOs, SLAs, SLIs, error budgets, on-call duties, and incident management.
  • Strong ability to align large, matrixed engineering organizations.
  • Excellent written and verbal communication skills.

No forms. Your profile is generated instantly.

Job Type: Remote

Salary: Not Disclosed

Experience: Entry

Duration: Months

Share this job:

Similar Jobs

Senior Software Engineer

Posted 21 days ago

Deliver infrastructure projects end-to-end.

Build platform primitives for deployment and debugging.

AI-assisted Development AWS Azure Disaster Recovery

IT System Administrator

Posted 11 days ago

Provide IT Service Desk support for corporate devices and applications.

Configure and enforce security policies across services.

Access Control Management Antivirus Administration Backup Management Disaster Recovery

Customer Engineer Role

Posted 17 days ago

Be the technical front door for post-sales accounts.

Diagnose and resolve runtime issues effectively.

Container Orchestration Grafana Incident Management Kubernetes

Junior Technical Support Engineer

Posted 26 days ago

Focus on incident response in IT and contact center environments.

Manage incidents and coordinate resolution across multiple teams.

AI Tools Communication Skills Crisis Handling Dynatrace

Junior Data Engineer Role

Posted 25 days ago

Monitor and maintain ETL pipelines.

Provide incident support and resolution.

Azure Data Factory Azure Monitor Data Processing ETL Processes

Technical Support Engineer

Posted 25 days ago

Provide 2-level technical support.

Ensure compliance with SLAs.

Automation Tools Communication Skills Customer Relationship Management Incident Management

Junior Technical Support Engineer

Posted 25 days ago

Provide operational support in IT environments.

Manage incidents and coordinate resolutions.

Cloud Call Center Platforms Crisis Handling Dynatrace Genesys Engage

Senior Lead Engineer

Posted 19 days ago

Lead and manage a team of engineers.

Own the roadmap for storage durability and performance.

C++ Consensus Distributed Systems Go

Head of Information Security

Posted 21 days ago

Enhance Sendbird's information security program.

Monitor controls across audit frameworks.

Cloud Security Compliance GDPR HIPAA

Service Delivery Manager

Posted 20 days ago

Serve as primary contact for Five9 Managed Services.

Lead Day2 service delivery programs using PMP and Agile.

Agile practices Governance and administration Incident Management ITIL process design

Incident Commander Role

Posted 6 days ago

Lead IT Incident and Problem Management.

Act as Incident Commander during major outages.

Automation and Monitoring Business Continuity Planning Configuration Management Database (CMDB) Disaster Recovery Planning

Platform Resilience Engineer

Posted 12 days ago

Lead automation efforts for infrastructure reliability.

Develop tools and software for platform growth.

Automation Cloud Technologies Distributed Systems Incident Management

Backend Software Engineer

Posted 12 days ago

Design and develop backend APIs.

Collaborate with cross-functional teams.

Backend API Development Caching Systems Code Review Practices Collaborative Development

Premium Support Engineer

Posted 12 days ago

Provide high-priority support to Premium customers.

Diagnose and resolve complex technical issues.

Automation Collaboration Tools Customer Communication Debugging

Technical Program Manager

Posted 12 days ago

Drive program planning and SDLC adoption.

Lead execution reviews on product initiatives.

Agile Methodologies Incident Management JIRA OKRs (Objectives and Key Results)

Incident Management Program Lead

Posted 11 days ago

Lead the incident management program.

Develop AI-powered incident tools.

AI Tools Cross-functional Collaboration Databricks Data Fluency

Technical Account Manager

Posted 11 days ago

Optimize customers' technical health.

Serve as primary technical interface.

API Integrations Communication Skills Cross-Functional Collaboration Customer Success

Senior Site Reliability Engineer

Posted 11 days ago

Improve reliability of Block's platform.

Utilize AI tooling for better observability.

AI Tooling Alerting CI/CD Pipelines Deployment Automation

Software Engineering Manager

Posted 10 days ago

Lead Furnishing Engineering team.

Build and operate credit reporting pipelines.

Agile Methodologies Backend System Design Consumer-Scale Product Development Credit Reporting

Infrastructure Software Engineer

Posted 10 days ago

Design and develop backend systems.

Create distributed data systems.

Async Execution C++ ClickHouse Concurrency

Automated Reliability Engineer

Posted 10 days ago

Design automated reliability systems.

Improve incident management and on-call health.

Data Analysis Datadog Grafana Incident Management

Customer Success Engineering Manager

Posted 7 days ago

Lead a team of Customer Success Engineers.

Manage 24x7 support operations and escalations.

AI/ML B2B SaaS Cloud Computing Customer Success

Software Development Director

Posted 6 days ago

Set technical direction for gateway/datapath.

Lead and grow the engineering organization.

Firewalls Gateway/Datapath Systems Incident Management L4-L7 Proxies

Site Reliability Engineer

Posted 5 days ago

Design observability platforms with Grafana and Prometheus.

Establish metrics standards and SLOs.

Alerting Distributed Systems Grafana Incident Management

SOC Engineer Level 2

Posted 5 days ago

Monitor and analyze security incidents.

Lead incident response efforts.

AWS or Azure Security Firewalls Forensic Analysis IDS/IPS

Helpdesk Technician

Posted 4 days ago

Provide technical support via calls and emails.

Create incidents in ServiceNow for all service requests.

Communication Skills Customer Service Hardware Support Incident Management

Command Center Specialist

Posted 4 days ago

Own and evolve Command Center processes.

Strengthen escalation framework for incidents.

Change Management Cross-Functional Coordination Dashboard Reporting Data Center Operations

IT Specialist Position

Posted 3 days ago

Provide hands-on IT support for US users.

Administer and improve ITIL 4 processes.

Change Management Freshservice Google Workspace Incident Management

Senior Site Reliability Engineer

Posted 3 days ago

Improve reliability and performance of systems.

Define and implement SLIs, SLOs, and error budgets.

Automation Cloud Computing (AWS GCP Azure)

ServiceNow Supervisor Role

New

Oversee ServiceNow platform operations.

Manage and lead IT teams effectively.

Change Management Governance Frameworks Healthcare Compliance Incident Management

Chaos Engineer Lead

Posted 37 days ago

Lead the chaos engineering strategy at Goodnotes.

Design and execute fault injection experiments.

Automation Chaos Engineering Engineer Observability

Senior Engagement Manager UK

Posted 37 days ago

Manage remote client engagements

Drive adoption of Grafana platform

Communication Skills Customer success Grafana Observability

Staff API Engineer Role

Posted 37 days ago

Design and scale high-performance GraphQL APIs

Champion developer-friendly API architecture and standards

API Design GraphQL Observability Python

SRE Engineer UK

Posted 37 days ago

Improve system availability and reliability, Enhance incident response and postmortems, Collaborate

on monitoring solutions, Optimize observability tooling, Mentor engineering

Observability Software Development

Platform Engineering Manager Role

Posted 37 days ago

Lead and mentor engineering teams

Scale and manage cloud infrastructure

AWS CI/CD Cloud Security Infrastructure as code

Senior Software Engineer

Posted 37 days ago

- Lead complex projects in a fast-paced, agile environment - Design and develop high-quality,

le features - Collaborate with cross-functional teams - Stay current with emerging technologies -

Algorithms Data Structures GraphQL Java

Lead Software Engineer

Posted 37 days ago

Lead technical strategy for high-availability systems

Migrate to TypeScript/Node.js across teams

Ai Tools API Design Distributed systems Java

Infrastructure Staff Engineer

Posted 37 days ago

Build and enhance infrastructure capabilities

Maintain and optimize system reliability

Computer science Content management systems Engineer Javascript

Trust & Safety Engineer

Posted 37 days ago

Develop anti-abuse tools for Wikimedia projects

Design privacy-respecting detection systems

Content management systems Content Moderation Css Drupal

Engineering Manager - Integration Quality

Posted 37 days ago

Lead and manage a team focused on improving integration quality and reliability

Make product and technical decisions autonomously while seeking alignment on larger decisions

Ai Ai Tools Apis ChatGPT

Remote Junior Tech Success Manager

Posted 37 days ago

- Provide exceptional customer support - Enhance customer experiences - Drive company growth -

inate onboarding processes - Resolve technical

Cloud Computing Customer success Cybersecurity Observability

Neon Blue Data Platform

Posted 37 days ago

Design and build React-based SaaS platform, Implement scalable data systems, Collaborate with

functional teams, Own projects end-to-end, Solve novel data

AWS Node.js Observability Postgres

RavenTek Remote Job Opportunities

Posted 37 days ago

Enhance client missions

Improve organizational performance

Cloud Computing Cybersecurity Data Analytics Data Management

Principal Engineer - Multi-Tenant Scale

Posted 37 days ago

Design and evolve GitLab’s multi-tenant platform into distributed systems

Provide technical leadership across infrastructure and development areas

Architecture Cloud Cloud Computing Distributed systems

Principal Engineer - Group Tenant Scale

Posted 37 days ago

Design and evolve GitLab’s multi-tenant platform

Provide technical leadership across infrastructure and development areas

Access control API Design Architecture Cloud

Node.js Developer - Specialist

Posted 37 days ago

Offering exponential growth and support for personal development

Encouraging a remote work environment with flexibility and well-being benefits

Git Node.js Observability Restful Api

Visa Junior Software Engineer

Posted 37 days ago

Ensure system scalability and reliability

Maintain data integrity across distributed systems

Cloud Computing Devops Distributed systems Golang