Chelsea Avondale Logo

Chelsea Avondale

Reliability Engineer

Reposted 10 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in Canada
Junior
Remote
Hiring Remotely in Canada
Junior
The Reliability Engineer will ensure the reliability and performance of systems, implement AWS cloud infrastructure, and enhance monitoring and alerting in Python.
The summary above was generated by AI

Chelsea Avondale is the world’s most cutting-edge home insurance group. We have developed sophisticated risk modeling and insurance pricing technologies for home insurance and deploy that technology through our own insurance company.

Our team consists of some of the brightest minds in insurance, software development, finance, and operations. Our group includes our scientific research & engineering division (Skynet Software) and Canadian property & casualty insurance company (Max Insurance).

Together, our group is transforming the Canadian and global insurance landscape.

JOB DESCRIPTION:

Chelsea Avondale is looking for a Reliability Engineer with a background in infrastructure system engineering to support the growth of a secure, dynamic, and scalable IT environment across the group. Our business is going through rapid growth, and it is essential that our systems infrastructure keeps pace.

The Reliability Engineer will play a crucial role in ensuring the reliability, scalability, and performance of our systems, enabling the continuous delivery of our products and services. They will be accountable for ensuring overall availability, as well as enhancing Engineering teams’ capability to design, build and operate robust systems at scale.

This position is ideal for candidates who have an extraordinary sense of responsibility and are not afraid to roll up their sleeves. Our IT environment is not toolkit rich. What we are NOT looking for is someone who wants to take months installing a large number of tools from their preferred toolkit. We take pride in maintaining a fundamental stack of technologies, much of it in Python, and we are looking for someone who shares this mentality.

If you are someone who thrives in a high-performance culture and is eager for work that is both challenging and constantly evolving, this role is perfect for you. We strongly encourage and help our team members to improve and enhance their personal skill sets within our organization. On your journey with us, you will have the ability to learn and grow rapidly, taking on more responsibilities.

RESPONSIBILITIES:

  • Play an integral role in the design, implementation & maintenance of AWS cloud server environments.
  • Design, implement, and maintain robust monitoring and alerting systems in Python to detect and respond to incidents in a timely manner.
  • Collaborate with cross-functional teams to enhance reliability of our systems and services.
  • Design, configure, deploy, and maintain infrastructure on AWS using best practices and industry standards.
  • Conduct post-incident analysis to identify root causes, implement corrective actions, and prevent similar issues in the future.
  • Assist in capacity planning & optimize services to provide scalable, stable, & secure systems.
  • Implement high availability and disaster recovery solutions to provide data redundancy, resilience, and data loss prevention.
  • Assist with the implementation of select network engineering solutions including firewalls, load balancing, VPNs & LANs, where necessary.

PREFERRED EXPERIENCE & SKILLS:

  • Bachelor’s degree in Computer Science, Computer Engineering, Electrical Engineering, or related field.
  • 1+ years of experience as a Reliability Engineer or similar role, with a focus on maintaining high-performance, scalable, and reliable web systems.
  • We also encourage highly motivated new grads to apply.
  • Hands-on experience with AWS cloud environments – instances, CloudWatch, EFS, etc.
  • Proficiency at Python is a must.
  • Experience using NGINX for reverse proxy, load balancing, and caching.
  • Experience with Unix / Windows server configuration, administration, performance tuning and troubleshooting.
  • Working knowledge of web technologies (web servers, DNS, SSL, Browsers).
  • Working knowledge of web development processes (source control, deployment, etc.).
  • Experience load testing, pen testing, and providing security for cloud resources is beneficial.

Top Skills

AWS
Nginx
Python

Similar Jobs

16 Days Ago
Easy Apply
Remote
Canada
Easy Apply
Senior level
Senior level
Cloud • Security • Software • Cybersecurity • Automation
As a Senior Site Reliability Engineer at GitLab, you will automate and manage the lifecycle of GitLab environments, ensuring reliability and scalability while leading incident responses and architectural decisions.
Top Skills: AnsibleAWSElkGCPGoGrafanaKubernetesPrometheusRubyTerraform
20 Days Ago
In-Office or Remote
3 Locations
Senior level
Senior level
Fashion • Retail • Software
The role involves maintaining SQL and NoSQL databases, automating processes, improving monitoring, managing incidents, and ensuring security and compliance in a collaborative environment.
Top Skills: AnsibleAWSAzureBashDatadogDockerGCPGrafanaKubernetesMongoDBNoSQLPostgresPrometheusPythonSQLTerraform
9 Hours Ago
In-Office or Remote
Open Hall, Subd. F, NL, CAN
Senior level
Senior level
Digital Media • Social Media
The Senior Site Reliability Engineer at TextNow will maintain and scale production services, improve reliability, write automation code, and collaborate with development teams for optimal infrastructure performance.
Top Skills: AnsibleAWSBashDockerGoKubernetesLinuxMariadbPuppetPythonRedisRubyTerraform

What you need to know about the Montreal Tech Scene

With roots dating back to 1642, Montreal is often recognized for its French-inspired architecture and cobblestone streets lined with traditional shops and cafés. But what truly sets the city apart is how it blends its rich tradition with a modern edge, reflected in its evolving skyline and fast-growing tech industry. According to economic promotion agency Montréal International, the city ranks among the top in North America to invest in artificial intelligence, making it le spot idéal for job seekers who want the best of both worlds.

Key Facts About Montreal Tech

  • Number of Tech Workers: 255,000+ (2024, Tourisme Montréal)
  • Major Tech Employers: SAP, Google, Microsoft, Cisco
  • Key Industries: Artificial intelligence, machine learning, cybersecurity, cloud computing, web development
  • Funding Landscape: $1.47 billion in venture capital funding in 2024 (BetaKit)
  • Notable Investors: CIBC Innovation Banking, BDC Capital, Investissement Québec, Fonds de solidarité FTQ
  • Research Centers and Universities: McGill University, Université de Montréal, Concordia University, Mila Quebec, ÉTS Montréal

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account