Calix Logo

Calix

Staff AI Ops Engineer

Reposted 6 Days Ago
Remote
2 Locations
Senior level
Remote
2 Locations
Senior level
The role involves designing and maintaining infrastructure for machine learning applications, deploying ML pipelines, optimizing resources on GCP, and ensuring system observability.
The summary above was generated by AI
Calix provides the cloud, software platforms, systems and services required for communications service providers to simplify their businesses, excite their subscribers and grow their value.

Calix is where passionate innovators come together with a shared mission: to reimagine broadband experiences and empower communities like never before. As a true pioneer in broadband technology, we ignite transformation by equipping service providers of all sizes with an unrivaled platform, state-of-the-art cloud technologies, and AI-driven solutions that redefine what’s possible. Every tool and breakthrough we offer is designed to simplify operations and unlock extraordinary subscriber experiences through innovation.

Calix is seeking a highly skilled Staff AI Ops Engineer with hands-on experience with GCP to join our cutting-edge AI/ML team. In this role, you will be responsible for building, scaling, and maintaining the infrastructure that powers our machine learning and generative AI applications. You will work closely with data scientists, ML engineers, and software developers to ensure our ML/AI systems are robust, efficient, and production ready.

This is a remote-based position that can be located anywhere in the United States or Canada.

Key Responsibilities:

  • Design, implement, and maintain scalable infrastructure for ML and GenAI applications

  • Deploy, operate, and troubleshoot production ML/GenAI pipelines/services

  • Build and optimize CI/CD pipelines for ML model deployment and serving

  • Scale compute resources across CPU/GPU architectures to meet performance requirements

  • Implement container orchestration with Kubernetes

  • Architect and optimize cloud resources on GCP for ML training and inference

  • Setup and maintain runtime frameworks and job management systems (Airflow, KubeFlow, MLflow, etc.)

  • Establish monitoring, logging and alerting for systems observability

  • Optimize system performance and resource utilization for cost efficiency

  • Develop and enforce AIOps best practices across the organization

Qualifications:

  • Bachelor's degree in Computer Science, Information Technology, or a related field (or equivalent experience). 

  • 8+ years of overall software engineering experience

  • 3+ years of focused experience in DevOps/AIOps or similar ML infrastructure roles

  • Proficient in IaC, using Terraform.

  • Strong experience with containerization and orchestration using Docker and Kubernetes

  • Demonstrated expertise in cloud infrastructure management on GCP

  • Proficiency with workflow management such as Airflow & Kubeflow

  • Strong CI/CD expertise with experience implementing automated testing and deployment pipelines

  • Experience with scaling distributed compute architectures utilizing various accelerators (CPU/GPU)

  • Solid understanding of system performance optimization techniques

  • Experience implementing comprehensive observability solutions for complex systems

  • Knowledge of monitoring and logging tools (Prometheus, Grafana, ELK stack).

  • Strong proficiency in Python

  • Familiarity with ML frameworks such as PyTorch and ML platforms like Vertex AI

  • Excellent problem-solving skills and ability to work independently

  • Strong communication skills and ability to work effectively in cross-functional teams

#LI-Remote

The base pay range for this position varies based on the geographic location. More information about the pay range specific to candidate location and other factors will be shared during the recruitment process. Individual pay is determined based on location of residence and multiple factors, including job-related knowledge, skills and experience.

San Francisco Bay Area:

156,400 - 265,700 USD Annual

All Other US Locations:

136,000 - 231,000 USD Annual

As a part of the total compensation package, this role may be eligible for a bonus. For information on our benefits click here.

Top Skills

Airflow
Docker
Elk Stack
GCP
Grafana
Kubeflow
Kubernetes
Mlflow
Prometheus
Python
PyTorch
Terraform
Vertex Ai

Similar Jobs

4 Hours Ago
Remote or Hybrid
British Columbia, BC, CAN
Senior level
Senior level
Cloud • Fintech • Information Technology • Machine Learning • Software
As a Senior Applied Scientist, you will implement Generative AI solutions for enhancing customer experiences, develop hybrid machine learning methods, and design scalable systems to tackle product challenges using complex datasets.
Top Skills: Generative AiLlm ApisMachine LearningNlpPythonScala
4 Hours Ago
Easy Apply
Remote or Hybrid
Canada
Easy Apply
Expert/Leader
Expert/Leader
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
The Business Technology Product Manager will lead the development and refinement of sales technology solutions and AI capabilities, collaborating across teams to drive growth and employee satisfaction.
Top Skills: AIConfluenceCpqCRMJIRASalesforce
4 Hours Ago
Easy Apply
Remote
Canada
Easy Apply
Senior level
Senior level
Big Data • Fintech • Mobile • Payments • Financial Services
Manage and scale macOS device management, ensuring compliance, driving automation initiatives, troubleshooting endpoint issues, and mentoring junior engineers. Collaborate with cross-functional teams to enhance endpoint security and efficiency.
Top Skills: AutopkgBashJamf ProMdmOktaPythonTerraformWindows Intune

What you need to know about the Montreal Tech Scene

With roots dating back to 1642, Montreal is often recognized for its French-inspired architecture and cobblestone streets lined with traditional shops and cafés. But what truly sets the city apart is how it blends its rich tradition with a modern edge, reflected in its evolving skyline and fast-growing tech industry. According to economic promotion agency Montréal International, the city ranks among the top in North America to invest in artificial intelligence, making it le spot idéal for job seekers who want the best of both worlds.

Key Facts About Montreal Tech

  • Number of Tech Workers: 255,000+ (2024, Tourisme Montréal)
  • Major Tech Employers: SAP, Google, Microsoft, Cisco
  • Key Industries: Artificial intelligence, machine learning, cybersecurity, cloud computing, web development
  • Funding Landscape: $1.47 billion in venture capital funding in 2024 (BetaKit)
  • Notable Investors: CIBC Innovation Banking, BDC Capital, Investissement Québec, Fonds de solidarité FTQ
  • Research Centers and Universities: McGill University, Université de Montréal, Concordia University, Mila Quebec, ÉTS Montréal

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account