MaintainX Logo

MaintainX

Site Reliability Engineer

Posted 13 Days Ago
Be an Early Applicant
Easy Apply
In-Office
2 Locations
Mid level
Easy Apply
In-Office
2 Locations
Mid level
The Site Reliability Engineer will enhance reliability, observability, and developer autonomy while collaborating with product and platform engineering teams and mentoring developers on reliability practices.
The summary above was generated by AI

MaintainX is the world's leading Asset and Work Intelligence platform for industrial and frontline environments. We are a modern, IoT-enabled, cloud-based tool for reliability, safety, and operations of physical equipment and facilities. MaintainX powers operational excellence for 12,000 businesses, including Duracell, Univar Solutions Inc., Titan America, McDonald's, Brenntag, Cintas, Xylem, and Shell.

We recently completed a $150 million Series D funding round, bringing our total funding to $254 million and valuing the company at $2.5 billion.

We’re looking for a Site Reliability Engineer (SRE) to help advance MaintainX’s reliability, observability, and developer autonomy as we scale our platform.

In this role, you’ll partner closely with product and platform engineering teams to improve the stability, resilience, and operational readiness of our services. You’ll work alongside teams to design for reliability from the start, establish clear ownership and standards, and build shared tooling that enables teams to operate their services with confidence.

You’ll also contribute to company-wide initiatives that define how MaintainX approaches reliability engineering, including observability standards, incident response practices, and service health metrics, helping the organization adopt proven industry practices at scale.

This role is well-suited for an engineer who enjoys working across teams, influencing technical direction through strong engineering practices, and turning reliability principles into practical, scalable systems.

What You'll Do:

  • Assess service maturity and provide insights to development teams
  • Partner with development teams to implement observability best practices
  • Enable development teams to become autonomous with their service deployment, support, and infrastructure
  • Mentor developers on reliability practices, focusing on making them self-sufficient
  • Act as the bridge, ear and eyes of the Platform Division teams to drive tooling and practice adoption across development teams

About You:

  • Deep understanding of observability practices in a distributed system environment and how it influences system design and team behaviour
  • Practical experience with SRE concepts (SLOs, error budgets, incident management)
  • 3–5+ years in software engineering, SRE, DevOps, or production engineering roles with experience operating production systems
  • Proficient in cloud-native platforms and infrastructure-as-code concepts and tools
  • Working knowledge of at least one programming language (TypeScript/Node.js is a plus)
  • Excellent communication and collaboration abilities across technical and non-technical teams
  • Ability to translate complex reliability concepts into actionable guidance
  • You enjoy enabling teams to succeed independently and measuring success by reduced dependency on you

What’s in it For You:

  • Competitive salary and meaningful equity opportunities.
  • Healthcare, dental, and vision coverage.
  • 401(k) / RRSP enrollment program.
  • Take what you need PTO.
  • A Work Culture where:
    • You’ll work alongside folks across the globe that reflect the MaintainX values, Smart Humble Optimist.
    • We believe in meritocracy, where ideas and effort are publicly celebrated.

About Us:

Our mission is to deliver one platform for maintenance, repair & operations teams to keep the physical world running. We believe the greatest asset in any organization is the people. That’s why we built an intuitive, mobile-first solution to help boost productivity and collaboration across teams and locations.

MaintainX is committed to creating a diverse environment. All qualified applicants will receive consideration for employment without regard to race, colour, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.

Top Skills

Cloud-Native Platforms
Infrastructure-As-Code
Node.js
Typescript

Similar Jobs

Yesterday
In-Office or Remote
27 Locations
Senior level
Senior level
Blockchain • Fintech • Payments • Financial Services • Cryptocurrency • Web3
Design, maintain, and secure cloud infrastructure and CI/CD pipelines; automate operations with Go/Python; manage Kubernetes and blockchain nodes; implement disaster recovery; use AI tools for monitoring, anomaly detection, and capacity planning; participate in on-call rotations; mentor team members to improve reliability and performance.
Top Skills: Go,Python,Shell,Terraform,Crossplane,Aws Lambda,Kubernetes,Helm,Ethereum,Solana,Arbitrum,Base,Avalanche,Postgresql,Redis,Opensearch,Apache Airflow,Aws Dms,Snowflake,Github Copilot,Gemini,Chatgpt,Llms,Apm,Rum,Telemetry
5 Days Ago
Easy Apply
Hybrid
Toronto, ON, CAN
Easy Apply
Senior level
Senior level
Cloud • Mobile • Software
Own and improve reliability domains end-to-end, implement SRE practices (SLIs/SLOs, error budgets), design observability, lead multi-team reliability projects, operate AWS/IaC environments, contribute code and automation, participate in on-call and incident response, mentor engineers, and document standards and runbooks to reduce toil and improve operability.
Top Skills: Python,Node.Js,Typescript,Datadog,Prometheus,Grafana,Honeycomb,New Relic,Aws,Terraform,Ecs,Eks,Kubernetes,Pagerduty,Incident.Io,Opsgenie,Llms/Ai-Assisted Tooling
6 Days Ago
Easy Apply
Hybrid
Toronto, ON, CAN
Easy Apply
Mid level
Mid level
Cloud • Mobile • Software
Improve and protect production reliability and performance by implementing SRE practices (SLIs/SLOs, error budgets), building observability, evolving AWS infrastructure with Terraform, contributing automation and code, participating in incident response, and documenting runbooks and standards across teams.
Top Skills: Python,Node.Js,Typescript,Aws,Terraform,Docker,Ecs,Eks,Kubernetes,Datadog,Prometheus,Grafana,Honeycomb,New Relic,Incident.Io,Pagerduty,Opsgenie,Llms

What you need to know about the Montreal Tech Scene

With roots dating back to 1642, Montreal is often recognized for its French-inspired architecture and cobblestone streets lined with traditional shops and cafés. But what truly sets the city apart is how it blends its rich tradition with a modern edge, reflected in its evolving skyline and fast-growing tech industry. According to economic promotion agency Montréal International, the city ranks among the top in North America to invest in artificial intelligence, making it le spot idéal for job seekers who want the best of both worlds.

Key Facts About Montreal Tech

  • Number of Tech Workers: 255,000+ (2024, Tourisme Montréal)
  • Major Tech Employers: SAP, Google, Microsoft, Cisco
  • Key Industries: Artificial intelligence, machine learning, cybersecurity, cloud computing, web development
  • Funding Landscape: $1.47 billion in venture capital funding in 2024 (BetaKit)
  • Notable Investors: CIBC Innovation Banking, BDC Capital, Investissement Québec, Fonds de solidarité FTQ
  • Research Centers and Universities: McGill University, Université de Montréal, Concordia University, Mila Quebec, ÉTS Montréal

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account