Botpress Logo

Botpress

Site Reliability Engineer

Posted 7 Days Ago
Be an Early Applicant
In-Office
Montréal, QC
Mid level
In-Office
Montréal, QC
Mid level
As a Site Reliability Engineer, you'll ensure platform stability, scalability, and security, focusing on infrastructure reliability and operational excellence.
The summary above was generated by AI

Help bring AI agents to companies worldwide. 

Over the next decade, autonomous agents will redefine how we work.

Botpress allows companies to build and deploy advanced AI agents that move beyond conversation into real business logic. 

Our product works today and at scale, across industries, regions, and limitless use cases.

As the 3rd fastest-growing B2B AI start-up worldwide, we’re at the forefront of the AI revolution, providing the most widely-used platform for sophisticated AI agents. 

The work ahead is ambitious. The opportunity is rare. We take a deliberate approach to growth: product-led, capital-efficient, and highly focused.

If you want to build foundational technology for one of the most meaningful platform shifts in software, we’re looking for top talent to join us.

Key Highlights:

  • Over 1 million AI agents and chatbots deployed
  • 700,000+ platform users
  • Trusted by 35% of Fortune 500 companies
  • 7 years of expertise in AI solutions
About the Role

We’re hiring a Site Reliability Engineer to help ensure the stability, scalability, and security of our platform. You’ll be part of the product team, owning the systems that keep our services resilient and performant under real-world loads.

This is a hands-on engineering role focused on infrastructure reliability and operational excellence. You’ll architect and maintain the cloud systems (e.g. AWS) that power Botpress, with a strong focus on observability, uptime, and automation. 

You’ll collaborate closely with engineers to refine how we ship, monitor, and operate software — always with an eye toward reducing risk and improving speed. Part of this role will include opening up the site to different regions of users. 

Responsibilities
  • Architect and maintain scalable infrastructure
  • Design and optimize CI/CD pipelines to ensure smooth delivery of changes
  • Improve observability through advanced monitoring, logging, and alerting
  • Own incident response and support the engineering team in diagnosing and resolving issues
  • Build systems that increase platform reliability, resiliency, and uptime
  • Enforce security best practices across environments and workflows
  • Manage infrastructure as code using tools like Terraform or Pulumi
  • Document operational procedures, disaster recovery plans, and system runbooks

Requirements
  • 3+ years in SRE, DevOps, or infrastructure engineering roles
  • Deep experience with AWS cloud infrastructure and services (ECS, S3, Lambda, RDS)
  • Comfortable with Linux systems, containerization, and orchestration (e.g. Docker, Kubernetes)
  • Proficient in CI/CD tools, infrastructure-as-code, and automation scripting
  • Familiar with incident management and site reliability principles
  • Experience with observability stacks like Datadog, Grafana, Prometheus, etc.
  • Strong communicator and collaborator across technical teams
  • Calm and systematic under pressure when production issues arise
  • Bonus: Previous experience in a fast-paced startup or SaaS environment
About Botpress

Botpress recently raised its $25 million Series B funding. As a fast-growing start-up, we run a lean and innovative ship that leans on AI for maximum business impact. At Botpress, everyone is an owner, bringing their unique perspective and talents.

Our teams are talented and passionate. We intentionally hire individuals who are eager, passionate, talented, and hungry to learn and grow throughout their career.

You'll be on a team that's not just adapting to the AI revolution, but leading it. Joining our team means changing the future of enterprise AI and building technology that will define the next era of business automation.


Benefits
  • Work at one of Canada’s fastest-growing AI start-ups
  • Work with a talented and passionate team
  • 4 weeks of vacation
  • Paid sick and parental leave
  • Comprehensive health, dental, vision, travel, and life insurance
  • Funding for education and skills improvement
  • Fully-stocked fridge and cupboard – we take snacks seriously 
  • Your own desk – no ‘hot-desk’-style sign-up systems
  • A vibrant office community, including weekly socials

Top Skills

AWS
Datadog
Docker
Grafana
Kubernetes
Linux
Prometheus
Pulumi
Terraform

Similar Jobs

Yesterday
Hybrid
Gatineau, QC, CAN
Mid level
Mid level
Artificial Intelligence • Hardware • Information Technology • Security • Software • Cybersecurity • Big Data Analytics
As a Site Reliability Engineer, you will enhance reliability for public safety products by implementing monitoring solutions, handling incidents, and fostering a strong SRE culture.
Top Skills: Chaos EngineeringCi/Cd PipelinesCloud-Based ApplicationsDevops ToolingInfrastructure As CodeMicroservicesRest ApisSlisSlos
2 Hours Ago
In-Office
Montréal, QC, CAN
Mid level
Mid level
Logistics • Transportation
The Site Reliability Engineer will manage incident response, collaborate with teams for deployments, automate tasks, and improve system performance.
Top Skills: AWSAzureCloud ManagementCloudflareDatadogGCPPagerdutyTerraform
10 Days Ago
Easy Apply
Hybrid
3 Locations
Easy Apply
Senior level
Senior level
Big Data • Cloud • Software • Database
The Senior Site Reliability Engineer will help build, scale, and run applications on MongoDB Atlas, contributing to a supportive culture and employee growth.
Top Skills: AIAWSGCPAzureMongoDB

What you need to know about the Montreal Tech Scene

With roots dating back to 1642, Montreal is often recognized for its French-inspired architecture and cobblestone streets lined with traditional shops and cafés. But what truly sets the city apart is how it blends its rich tradition with a modern edge, reflected in its evolving skyline and fast-growing tech industry. According to economic promotion agency Montréal International, the city ranks among the top in North America to invest in artificial intelligence, making it le spot idéal for job seekers who want the best of both worlds.

Key Facts About Montreal Tech

  • Number of Tech Workers: 255,000+ (2024, Tourisme Montréal)
  • Major Tech Employers: SAP, Google, Microsoft, Cisco
  • Key Industries: Artificial intelligence, machine learning, cybersecurity, cloud computing, web development
  • Funding Landscape: $1.47 billion in venture capital funding in 2024 (BetaKit)
  • Notable Investors: CIBC Innovation Banking, BDC Capital, Investissement Québec, Fonds de solidarité FTQ
  • Research Centers and Universities: McGill University, Université de Montréal, Concordia University, Mila Quebec, ÉTS Montréal

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account