Camunda Logo

Camunda

Senior Cloud Infrastructure Engineer - Kubernetes (East Coast-Remote)

Posted 9 Days Ago
Remote
3 Locations
Senior level
Remote
3 Locations
Senior level
Design, maintain, and enhance Kubernetes-based infrastructure and multi-cloud platforms while ensuring operational excellence and collaborating with cross-functional teams.
The summary above was generated by AI

Camunda is the leader in enterprise agentic automation, orchestrating complex business processes, including high-value knowledge work, across agents, people, and systems. By creating production-ready, enterprise-grade agents with built-in governance, Camunda uniquely delivers trusted AI agents for business-critical processes. Over 700 leading innovators like Atlassian, ING, and Vodafone, rely on Camunda to slash time-to-value from months to days, boost operational efficiency, and elevate customer experiences.


As a fully remote, global company, we’re rewriting the rules of modern business. Named GP Bullhound’s 2024 Top 100 Next Unicorn list, certified as a Great Place to Work, and recognized by Flexa for true flexibility, we’re growing fast and looking for top talent to join our team. If you’re excited to do meaningful work and make real impact, keep reading, this role could be the one you’ve been waiting for.

About the role:

At Camunda, we believe in empowering businesses to automate their processes – and that starts with building incredibly reliable platforms. As a Senior Site Reliability Engineer, you’ll be at the heart of this mission! You'll play a crucial role in designing, maintaining, and improving our Kubernetes-based infrastructure and multi-cloud platform. You’ll work alongside talented engineers across teams to ensure Camunda runs smoothly for our customers worldwide, proactively identifying opportunities for improvement and contributing to a culture of continuous learning and operational excellence. This isn't just about keeping things running; it's about shaping the future of how we deliver value through automation.

Curious about the kind of challenges you'll work on at Camunda? Watch this quick 30-minute talk from our engineers to learn more about the new Camunda Exporter and how we’re solving complex problems at scale

What You’ll Be Doing:

  • Architect & Maintain Our Platform: Design, build, and maintain our Kubernetes-based infrastructure and multi-cloud platform, focusing on availability, scalability, fault tolerance. You will be directly involved in expanding Camunda SaaS capabilities by playing an important role in upcoming projects like:

    • Making our service available as a multi-region offering

    • Expanding the availability of our service to new regions and cloud providers

  • Champion Observability: Implement and enhance our monitoring tools to provide clear visibility into the health and performance of our entire stack – for both SREs and developers. You will be directly involved in helping Camunda continue its Observability journey by being an instrumental part of evolving our monitoring and observability practice supporting a multi-cloud, multi-region product.

  • Collaborate & Innovate: Work closely with cross-functional teams (development, product, etc.) to define, improve, and efficiently ship new features. Bring your experience to bear on how we can innovate and automate our processes further. You will be directly involved in developing new capabilities for Camunda SaaS.

  • Be a Trusted Resource: Provide 3rd level support for critical incidents and participate in our on-call rotation, ensuring rapid response and resolution. You will directly assist our customers and partners in providing a world-class SaaS experience.

  • Drive Automation: Identify opportunities to automate manual tasks and improve operational efficiency across the platform. You will help Camunda:

    • Continue to scale operations with automation

    • Evolve operational strategy to uplevel Camunda as a world-class SaaS provider

What You Bring:

  • Must Haves:

    • 5+ years of experience in Site Reliability Engineering (SRE) or a similar role, with a strong focus on cloud infrastructure.

    • Deep understanding and practical experience with Kubernetes and containerization technologies (Docker, etc.).

    • Proficiency in at least one scripting language (Python, Go, Bash) for automation and tooling development.

    • Experience with monitoring and observability tools (Prometheus, Grafana, ELK stack, Datadog, New Relic – or similar).

    Nice to Haves:

    • Experience working in a multi-cloud environment (AWS, Azure, GCP).

    • Familiarity with Infrastructure as Code (IaC) tools like Terraform or CloudFormation.

#LI-SK1 #Li-Remote #USEAST

What We Have to Offer:

Compensation

We offer competitive, fair, and transparent compensation. Salary ranges are location-based, with Standard and Major markets (global tech hubs) reflecting local competition.

The Annual Total Target Cash (base salary + 100% variable target, where applicable) shown below spans from the minimum in a Standard market to the maximum in a Major market. Final offers depend on skills, experience, and location, and we typically hire in the first half of the range to allow room for growth:

  • United States: $149,800 to $247,200

  • Germany: €96,800 to €160,100

  • United Kingdom: £94,100 to £154,700

  • Singapore: S$186,100 to S$279,100

If you’re based elsewhere, you’ll be hired via Remote.com (our global employer partner), and your Talent Acquisition Partner will provide a personalized Total Rewards Calculator after your first interview.

Equity: We also offer equity (where applicable) through our Virtual Stock Option Plan (VSOP).

 

Benefits & Perks

We invest in your wellbeing, growth, and ability to connect, along with perks that support you no matter where you’re based. Our benefits are globally designed and locally delivered where applicable.

  • Remote & Flexible: Work from anywhere with the setup that suits you, home office budget, co-working space support, and flexible time off to recharge when you need it.

  • In Person Connection: We invest in meaningful face time through our Annual Kickoff (Vienna in 2025, Madrid in 2026!), team offsites, and Camundi Connection Budgets, including contributing to meetups while travelling,, and local gatherings with fellow Camundi.

  • Health & Wellbeing: Access locally tailored healthcare, Modern Health for global mental wellbeing, and an annual fitness reimbursement.

  • Financial Security: Retirement and pension plans (often with company contributions), plus life and disability insurance where relevant.

  • Professional Growth: Up to $/€/£1,000 per year for self-driven learning: courses, certifications, books, you decide!

  • More of what we offer globally & in your country can be found here.

”Everyone is welcome at Camunda” it’s a celebrated component of our culture. We strive to create an inclusive environment that empowers our people. At Camunda, we honour diverse cultures and backgrounds and are proud to be an equal opportunity employer. All qualified applicants will receive consideration without regard to gender, race, ethnicity, religion, belief, sexual orientation, age, disability or any other protected characteristics under applicable law. We are looking forward to your application!

Come join us and be part of Camunda’s incredible journey: Make an impact at a pivotal moment in our story!

Top Skills

Bash
CloudFormation
Datadog
Docker
Elk Stack
Go
Grafana
Kubernetes
New Relic
Prometheus
Python
Terraform

Similar Jobs

10 Hours Ago
Remote
CAN
Senior level
Senior level
Computer Vision • Healthtech • Information Technology • Logistics • Machine Learning • Software • Manufacturing
Recruit and assess top talent in Machine Learning through strategic partnership with leadership, evaluating candidates’ technical expertise and contributions while owning the full-cycle recruitment process.
Top Skills: Applied AiDeep LearningMachine Learning
16 Hours Ago
Remote
30 Locations
Senior level
Senior level
Artificial Intelligence • Productivity • Software • Automation
Manage and develop the Data Engineering team to build scalable data systems and APIs. Set architectural vision, ensure data quality, and collaborate across teams to drive business impact.
Top Skills: AirflowAWSDatabricksDbtKafkaPythonTypescript
20 Hours Ago
Easy Apply
Remote
Canada
Easy Apply
Senior level
Senior level
Artificial Intelligence • Enterprise Web • Information Technology • Productivity • Sales • Software • Database
The Senior Social Media Manager will lead organic social efforts, create engaging content, manage campaigns, and analyze performance to enhance brand awareness and engagement.
Top Skills: CanvaFigmaNotionSprout

What you need to know about the Montreal Tech Scene

With roots dating back to 1642, Montreal is often recognized for its French-inspired architecture and cobblestone streets lined with traditional shops and cafés. But what truly sets the city apart is how it blends its rich tradition with a modern edge, reflected in its evolving skyline and fast-growing tech industry. According to economic promotion agency Montréal International, the city ranks among the top in North America to invest in artificial intelligence, making it le spot idéal for job seekers who want the best of both worlds.

Key Facts About Montreal Tech

  • Number of Tech Workers: 255,000+ (2024, Tourisme Montréal)
  • Major Tech Employers: SAP, Google, Microsoft, Cisco
  • Key Industries: Artificial intelligence, machine learning, cybersecurity, cloud computing, web development
  • Funding Landscape: $1.47 billion in venture capital funding in 2024 (BetaKit)
  • Notable Investors: CIBC Innovation Banking, BDC Capital, Investissement Québec, Fonds de solidarité FTQ
  • Research Centers and Universities: McGill University, Université de Montréal, Concordia University, Mila Quebec, ÉTS Montréal

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account