Site Reliability Engineer

New

Skills

Alerting Dashboards Grafana Infrastructure as Code Kubernetes Linux Networking Fundamentals Prometheus RBAC Terraform

We are seeking a skilled Site Reliability & DevOps Engineer to design, build, and operate observability platforms utilizing Grafana and Prometheus. In this role, you will be responsible for defining and maintaining metrics standards, dashboards, alerts, and SLOs, all while improving signal quality and supporting incident response.

Key Responsibilities
  • Design, build, and operate observability platforms using Grafana and Prometheus.
  • Define and maintain metrics standards, dashboards, alerts, and SLOs.
  • Improve signal quality by reducing alert noise and tuning thresholds.
  • Support incident response with actionable telemetry and conduct post-incident analysis.
  • Instrument services and automate observability using infrastructure as code.
  • Collaborate with platform, infrastructure, and application teams.
Required Skills & Qualifications
  • Strong experience with Prometheus, including scraping, federation, and alerting.
  • Experience with Grafana dashboards, alerting, templating, and RBAC.
  • Fundamentals of Linux and networking.
  • Experience with observability stacks in Kubernetes environments.
  • Infrastructure as code experience, preferably with Terraform.
  • Familiarity with incident management and on-call practices.

No forms. Your profile is generated instantly.

Job Type: Remote

Salary: Not Disclosed

Experience: Entry

Duration: Months

Share this job:

Similar Jobs

Senior Site Reliability Engineer

Posted 6 days ago

Improve reliability of Block's platform.

Utilize AI tooling for better observability.

AI Tooling Alerting CI/CD Pipelines Deployment Automation

Site Reliability Engineer

New

Design observability platforms with Grafana and Prometheus.

Establish metrics standards and SLOs.

Alerting Distributed Systems Grafana Incident Management

Revenue Operations Analyst

Posted 20 days ago

Maintain and manage data dictionaries and validation rules.

Perform bulk updates and account assignments in Salesforce.

Account Management Dashboards Data Analysis Data Management

Product Operations Trainee

Posted 16 days ago

Engage in hands-on product operations projects.

Learn about AI-powered tools and dashboards.

AI Tools Analytics Automation Collaboration

Remote UX Product Designer

Posted 16 days ago

To hire a strong product designer with ownership of product direction.

To create intuitive user experiences for complex AI and research tools.

B2B Workflows Collaboration Dashboards Design Systems

Remote UX Product Designer

Posted 15 days ago

To hire a Founding Designer for a remote role.

To enhance the product design process for an AI qualitative research tool.

B2B Workflows Collaboration Dashboards Design Process

Sales Strategy Manager

Posted 9 days ago

Lead Field Sales Operations for North America.

Design go-to-market strategies in partnership with sales leadership.

Account Assignment Dashboards GTM Strategy Design KPIs Development

Sales Strategy Manager

Posted 9 days ago

Lead Field Sales Operations for Commercial North America.

Design and implement go-to-market strategies.

Clari Collaboration Dashboards GTM Strategy

Growth Product Manager

Posted 5 days ago

Own Flexport.com as a growth channel.

Drive SEO and enhance organic traffic.

Attribution Modeling Conversion Optimization Dashboards Experimentation

Finance Analyst, Product Finance

Posted 5 days ago

Own revenue initiatives for Instacart Ads.

Partner with various teams to drive outcomes.

Ad Hoc Modeling Advanced Excel Dashboards Digital Advertising

Revenue Operations Analyst

Posted 3 days ago

Manage and update Salesforce data effectively.

Ensure data integrity and accuracy.

Billing Automation Dashboards Data Loader Excel

Team Lead for Technical Account Manager

Posted 32 days ago

Lead and mentor a team of Technical Account Managers

Drive customer adoption, satisfaction, and retention

Account manager Architecture Customer Experience Git

Technical Account Manager Lead

Posted 32 days ago

Lead and develop the Technical Account Management team

Drive customer success through technical expertise and onboarding excellence

Account manager Customer Experience Git Grafana

WordPress Performance Engineer

Posted 32 days ago

Seeking a WordPress expert with advanced troubleshooting skills

Deep diving into WordPress and platform-related issues

Css Customer Experience Engineer Git

WordPress Support Engineer

Posted 32 days ago

Hiring a WordPress expert with advanced troubleshooting skills

Supporting high-touch enterprise clients and agencies

Engineer Front end Git Grafana

Senior SRE / Gitops Engineer

Posted 32 days ago

Drive development of automation and Gitops within the team.

Collaborate closely with the IS architect for alignment.

Architecture Ceph Ci/cd Pipelines Cloud Computing

Site Reliability GitOps Engineer

Posted 32 days ago

Automate IT operations and infrastructure

Enhance reliability and scalability of services

Agile Methodologies Ceph CI/CD Ci/cd Pipelines

Staff Software Engineer, Mimir

Posted 32 days ago

Influence roadmap and career objectives

Deliver new features and iterate for improvement

AWS C++ Engineer Go

Senior Engagement Manager UK

Posted 32 days ago

Manage remote client engagements

Drive adoption of Grafana platform

Communication Skills Customer success Grafana Observability

Senior Engagement Manager Remote

Posted 32 days ago

Hire a remote Senior Engagement Manager

Oversee customer projects and delivery

Cloud Technologies Communication Skills Customer success Grafana

OpenGov DevOps Engineer

Posted 32 days ago

Enhance accountability and efficiency in government agencies through SaaS solutions.

Design, deploy, and maintain a scalable and secure multi-tenant SaaS environment in AWS.

AWS Github Actions Grafana Jenkins

Senior DevOps Engineer - AWS

Posted 32 days ago

Design, deploy, and manage cloud infrastructure on AWS for scalability and security.

Optimize containerized applications using Kubernetes for high availability and performance.

AWS Bash Datadog Devops

Site Reliability Engineer Role

Posted 32 days ago

Enhance reliability of open source infrastructure

Automate operations for scalability

Architecture Automation AWS CI/CD

Lead DevOps Engineer

Posted 32 days ago

Lead and mentor a team of DevOps engineers

Architect and maintain scalable and secure infrastructure on AWS with Kubernetes

Architecture AWS CI/CD Cloudformation

Junior Blockchain Node Operator

Posted 32 days ago

Operate and maintain blockchain nodes

Automate and optimize infrastructure workflows

Ansible Blockchain CI/CD Docker

Senior DevOps Engineer India

Posted 32 days ago

Design and maintain cloud infrastructure

Automate provisioning and deployments

Chef Devops Engineer Golang

SaaS Solutions Engineer

Posted 32 days ago

Provide technical leadership and guidance Lead single tenant to multi-tenant initiative Collaborate

with cross-functional teams Drive continuous improvement in development processes Stay current with

Api Integration CI/CD Devops Grafana

Senior Azure Windows Engineer

Posted 32 days ago

Provide advanced technical support for Azure and Windows Server environments.

Manage and maintain Kubernetes (AKS) clusters for application reliability.

Azure Cloud Engineer Grafana

MySQL Database Administrator Role

Posted 32 days ago

Manage and optimize database infrastructure

Implement automation and monitoring solutions

AWS Database Administration Grafana Mongodb

MySQL Database Administrator

Posted 32 days ago

Seeking a skilled MySQL Database Administrator

Responsibilities include infrastructure management and monitoring

AWS Database Administration Documentation Grafana

Engineering Manager Flex Billing

Posted 32 days ago

Lead and develop the Flex Billing & Backend engineering team

Deliver scalable and reliable backend solutions for Flex

CI/CD Datadog Docker Grafana

DevOps Engineer Project

Posted 32 days ago

Automate development processes, Build CI/CD pipelines, Monitor systems, Maintain cloud servers,

with

Ansible AWS Bash Scripting Docker

DevOps Engineer for Customer Service Outsourcing

Posted 32 days ago

Automate development and testing processes; Build and maintain CI/CD processes; Configure and

r monitoring systems; Manage cloud servers; Administer

Ansible AWS Bash Docker

AI Engineer for Legal Documentation Framework

Posted 32 days ago

- Develop autonomous legal documentation framework - Create annotation system and vector storage -

uild legal reasoning framework - Expose legal engine via API - Work on model inference on

Ansible AWS Docker Gcp

Remote Prometheus Developer Jobs

Posted 32 days ago

Enhance scalability and automation solutions Collaborate on innovative architectures Develop Java

d Kotlin applications for smart metering Build data platform and architecture solutions Design and

AWS Azure Docker Grafana

Remote Cloud Engineer Jobs

Posted 32 days ago

- Contribute to innovative DevOps projects remotely - Gain hands-on experience in AWS and Azure -

velop technical specializations with AWS - Enhance IT infrastructure and support Azure Cloud

Ansible AWS Azure Cloud

OpenGov Software Engineering Project

Posted 32 days ago

Develop top-tier SaaS solutions for government agencies

Provide technical leadership and guidance

AWS Azure CI/CD Devops

Cybersecurity Network Traffic Analysis

Posted 32 days ago

Enhancing database with malicious host information

Implementing user-friendly GUI for presenting security findings

Django Docker Elasticsearch Grafana

Unified Data Event Platform

Posted 32 days ago

Develop a unified data event platform for querying and analyzing data within GitLab.

Enable users to monitor team health, processes, and services through a consistent dashboard experience.

AWS CI/CD Cloud infrastructure Devops

Senior SRE Engineer - Runway

Posted 32 days ago

Build and optimize infrastructure for the GitLab Runway project

Enhance monitoring and logging systems

AWS Cloud Devops Gcp

Senior Site Reliability Engineer - ClickHouse

Posted 32 days ago

Design, build, and maintain ClickHouse and PostgreSQL clusters

Provision and orchestrate cloud infrastructure

Chef Devops Engineer Grafana

Senior SRE, GitLab Runway

Posted 32 days ago

Build and maintain robust cloud infrastructure

Enable self-service development and deployment

AWS Cloud Devops Engineer

Platform Engineer – Hospitality AI

Posted 32 days ago

Enhance customer experience through robust platform engineering.

Support and scale cloud infrastructure for hospitality solutions.

Ansible CI/CD Distributed systems Grafana