Autodesk Logo

Autodesk

Senior Site Reliability Engineer

Posted 20 Hours Ago
Be an Early Applicant
In-Office
Toronto, ON
Senior level
In-Office
Toronto, ON
Senior level
The Senior Site Reliability Engineer manages AWS infrastructure, ensuring reliability and performance. Responsibilities include architecture, cloud automation, CI/CD processes, and operational support.
The summary above was generated by AI

Job Requisition ID #

25WD92369

Position Overview

We are seeking a highly motivated and experienced Senior Site Reliability Engineer (SRE) to manage critical cloud

infrastructure and site reliability operations for Autodesk's global Product Access journey. This pivotal role focuses on ensuring

the highest reliability, availability, and performance of our AWS-hosted cloud infrastructure. Reporting to the Engineering

Manager, you will be leading design and development of resilient and scalable architecture and innovative solutions for the

platform. You will independently manage and deliver end-to-end solutions while engaging with key stakeholders and partners.

Responsibilities

  • Lead architecture, solution design, development and maintenance of cloud infrastructure for microservices architecture

  • Independently manage requirement analysis, solution design, implementation, and release planning

  • Ensure high adherence to trust and security compliance, guidelines and standards

  • Streamline CI/CD processes, improve system reliability, and ensure infrastructure scalability and security

  • Automate infrastructure deployment, scaling, and management using modern DevOps tools and practices

  • Implement and maintain configuration management and infrastructure as code (IaC) using Terraform

  • Lead Disaster Recovery (DR) strategies, failover exercises, gamedays, and period maintenance activities

  • Contribute to critical vulnerability (CVEs) remediation efforts

  • Promote and document security and best practices across all pillars of DevOps/SRE throughout system design

  • Provide real-time operational support and collaborate across functions to resolve system, infrastructure, and CI/CD issues

  • Participate in on-call rotations, providing critical 24x7 support for production systems

Minimum Qualifications

  • Bachelor’s degree or higher in Computer Science, Engineering, or a related field

  • 5+ years of progressive experience in Site Reliability Engineering, DevOps, or a similar field

  • Proficiency with managing AWS resources and understanding of networking and security protocols

  • Expertise in infrastructure as code (IaC) and cloud automation tools such as Terraform, Serverless, and CloudFormation

  • Expertise in defining and building CI/CD processes with tools like Jenkins, GitHub, and Artifactory

  • Experience with container-based technologies like Docker and AWS ECS

  • Experience with monitoring and logging tools such as Dynatrace, Grafana, DataDog, ELK Stack, and CloudWatch

  • Experience in Linux Systems Administration, scripting, and troubleshooting in a production environment

  • Proficiency in programming languages such as UNIX, Python, Go, Bash, Groovy, and Node.js

  • Technology Stack: Java/SpringBoot, AWS (ECS Fargate, Elastic Cache, Lambda, Kinesis, DynamoDB, VPC, IAM policies, API Gateway, NLB/ALB, Route 53, CloudWatch, Kibana, Open Search), Kafka, GoLang, Node.js, Groovy, Python, Jenkins, GitHub, Jira, ServiceNow, and Splunk.

Preferred Qualifications

  • Knowledge in applying AI and ML solutions for engineering processes and/or DevOps automation

  • Knowledge of standardized observability frameworks such as OpenTelemetry

  • Relevant certifications (e.g., AWS Certified DevOps Engineer, AWS Site Reliability Engineer)

  • Broad knowledge of AWS, Redis, server programming, databases, and cloud architectures

  • Broad knowledge with data streaming pipelines like Kinesis, Firehose, and Kafka

  • Knowledge on core Java and SpringBoot concepts in JVM optimization

  • Knowledge on build tools, e.g. Gradle

  • Strong interpersonal and communication skills to effectively collaborate in an Agile/Scrum-oriented environment

  • Self-directed team player and independent contributor, demonstrating accountability and end-to-end ownership

#LI-AD1

Learn More

About Autodesk

Welcome to Autodesk! Amazing things are created every day with our software – from the greenest buildings and cleanest cars to the smartest factories and biggest hit movies. We help innovators turn their ideas into reality, transforming not only how things are made, but what can be made.

We take great pride in our culture here at Autodesk – it’s at the core of everything we do. Our culture guides the way we work and treat each other, informs how we connect with customers and partners, and defines how we show up in the world.

When you’re an Autodesker, you can do meaningful work that helps build a better world designed and made for all. Ready to shape the world and your future? Join us!

Salary transparency

Salary is one part of Autodesk’s competitive compensation package. Offers are based on the candidate’s experience and geographic location. In addition to base salaries, our compensation package may include annual cash bonuses, commissions for sales roles, stock grants, and a comprehensive benefits package.

Diversity & Belonging
We take pride in cultivating a culture of belonging where everyone can thrive. Learn more here: https://www.autodesk.com/company/diversity-and-belonging

Are you an existing contractor or consultant with Autodesk?

Please search for open jobs and apply internally (not on this external site).

Top Skills

AWS
Bash
CloudFormation
Datadog
Docker
Elk Stack
Git
Go
Grafana
Groovy
Java
Jenkins
Kafka
Node.js
Python
Servicenow
Splunk
Spring Boot
Terraform
Unix

Similar Jobs

6 Days Ago
In-Office
2 Locations
Senior level
Senior level
Transportation • Travel
The Senior Site Reliability Engineer will design and manage infrastructure, automate systems, enhance performance, lead incident response, and mentor colleagues.
Top Skills: AWSCloudFormationCloudwatchConfluent CloudGitlab CiGrafanaKibanaPingdomPrometheusPuppetRundeckTerraform
13 Days Ago
In-Office or Remote
9 Locations
Senior level
Senior level
Blockchain • Internet of Things • Payments • Cryptocurrency • Web3
As a Senior Site Reliability Engineer, you'll build observability platforms, support telemetry types, ensure reliability and security, and collaborate with engineers to deploy services.
Top Skills: AWSCC++Elk StackGithub ActionsGoGrafanaJavaKubernetesPackerPerlPrometheusPythonRubySplunkTerraform
20 Hours Ago
In-Office
Ottawa, ON, CAN
Senior level
Senior level
Information Technology
The Senior Cloud Site Reliability Engineer will ensure the health of Solace Cloud services, manage production incidents, optimize operations, and implement infrastructure tooling across multiple cloud platforms.
Top Skills: AWSAzureCloud FormationDatadogGCPGoGroovyKibanaKubernetesPrometheusPythonTerraform

What you need to know about the Montreal Tech Scene

With roots dating back to 1642, Montreal is often recognized for its French-inspired architecture and cobblestone streets lined with traditional shops and cafés. But what truly sets the city apart is how it blends its rich tradition with a modern edge, reflected in its evolving skyline and fast-growing tech industry. According to economic promotion agency Montréal International, the city ranks among the top in North America to invest in artificial intelligence, making it le spot idéal for job seekers who want the best of both worlds.

Key Facts About Montreal Tech

  • Number of Tech Workers: 255,000+ (2024, Tourisme Montréal)
  • Major Tech Employers: SAP, Google, Microsoft, Cisco
  • Key Industries: Artificial intelligence, machine learning, cybersecurity, cloud computing, web development
  • Funding Landscape: $1.47 billion in venture capital funding in 2024 (BetaKit)
  • Notable Investors: CIBC Innovation Banking, BDC Capital, Investissement Québec, Fonds de solidarité FTQ
  • Research Centers and Universities: McGill University, Université de Montréal, Concordia University, Mila Quebec, ÉTS Montréal

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account