Interface AI Logo

Interface AI

Lead DevOps / Platform Engineer

Posted Yesterday
Be an Early Applicant
In-Office
7 Locations
Senior level
In-Office
7 Locations
Senior level
Design and maintain AI platform infrastructure, focusing on reliability, observability, and developer experience, while managing complex workloads and automation.
The summary above was generated by AI

interface.ai is the industry's-leading specialized AI provider for banks and credit unions, serving over 100 financial institutions. The company's integrated AI platform offers a unified banking experience through voice, chat, and employee-assisting solutions, enhanced by cutting-edge proprietary Generative AI.

Our mission is clear: to transform the banking experience so every consumer enjoys hyper-personalized, secure, and seamless interactions, while improving operational efficiencies and driving revenue growth.

interface.ai offers pre-trained, domain-specific AI solutions that are easy to integrate, scale, and manage, both in-branch and online. Combining this with deep industry expertise, interface.ai is the AI solution for banks and credit unions that want to deliver exceptional experiences and stay at the forefront of AI innovation.

About the Role

We are looking for a Lead Platform Engineer to design, build, and evolve our core AI platform infrastructure. This role is at the intersection of software engineering, infrastructure automation, and platform reliability, enabling product and AI teams to ship faster with confidence.

You will design developer-facing platforms, define standards for reliability and observability, and help scale complex workloads like LLM orchestration, vector databases, and event-driven systems.

This is a hands-on role where you’ll shape the foundational components that power our multi-product ecosystem — Sphere (Voice AI), Orbit (Chat AI), and Nexus (Employee Copilot).

What You’ll Do
  • Platform Architecture: Design, implement, and maintain core platform services and internal APIs for scalable, multi-tenant workloads.
  • Developer Experience: Build internal developer platforms (IDP) that streamline CI/CD, environment provisioning, and observability across teams.
  • System Reliability: Architect for fault tolerance, auto-scaling, and zero-downtime deployments for distributed microservices and AI pipelines.
  • Infrastructure as Code: Own and extend Terraform/Crossplane configurations to standardize provisioning across environments.
  • Performance & Observability: Implement deep observability (OpenTelemetry, Prometheus, Grafana) for tracing, metrics, and proactive alerting.
  • Service Orchestration: Manage Kubernetes, Helm, and service mesh (Istio/Linkerd) to ensure secure and efficient service communication.
  • Platform APIs: Build and evolve backend services in Go/Node.js/Python for internal orchestration, configuration, and workload routing.
  • AI Platform Integration: Collaborate with AI teams to optimize LLM workflows, caching strategies, and retrieval pipelines for low-latency inference.
  • Automation: Write high-quality scripts/tools in Python/Go to automate operational tasks, resilience testing, and rollout management.
  • Cross-Functional Partnership: Work with Product, DevOps, and Security to ensure every platform capability meets performance, compliance, and reliability goals.
What You’ll Bring
  • 6–9 years of engineering experience, with at least 3+ years in platform, infrastructure, or DevOps-heavy roles.
  • Strong proficiency in at least two backend languages (Go, Node.js, or Python).
  • Hands-on experience with Kubernetes, Helm, Terraform, and declarative infrastructure management.
  • Deep understanding of distributed systems, container orchestration, and microservice communication.
  • Proficiency in AWS cloud architecture (EKS, S3, RDS, Lambda, IAM, VPC).
  • Proven experience with observability and tracing systems (OpenTelemetry, Prometheus, Grafana).
  • Experience with CI/CD pipeline design (Jenkins, GitHub Actions, ArgoCD, GitOps workflows).
  • Exposure to AI/ML or data-intensive systems, including model serving, vector databases, or RAG pipelines.
  • Knowledge of networking, service mesh, and security controls in production-grade environments.
  • Strong debugging and performance tuning skills; ability to reason about failure modes and resilience.
  • Excellent collaboration skills — able to partner with developers, product managers, and AI researchers effectively.
Why Join Us
  • Build core platform systems that power one of the fastest-growing AI companies in fintech.
  • Shape developer experience, infrastructure standards, and reliability practices for an AI-first ecosystem.
  • Collaborate with top-tier engineers, AI researchers, and architects on large-scale distributed systems.
  • Work in a high-trust, fast-growth environment where innovation meets real-world impact.

Compensation

  •  Compensation is expected to be between $170,000 - $200,000. Exact compensation may vary based on skills and location.

What We Offer

  • 💡 100% paid health, dental & vision care
  • 💰 401(k) match & financial wellness perks
  • 🌴 Discretionary PTO + paid parental leave
  • 🏡 Remote-first flexibility
  • 🧠 Mental health, wellness & family benefits
  • 🚀 A mission-driven team shaping the future of banking

At interface.ai, we are committed to providing an inclusive and welcoming environment for all employees and applicants. We celebrate diversity and believe it is critical to our success as a company. We do not  discriminate on the basis of race, color, religion, national origin, age, sex, gender identity, gender expression, sexual orientation, marital status, veteran status, disability status, or any other legally protected status. All employment decisions at Interface.ai are based on business needs, job requirements, and individual qualifications. We strive to create a culture that values and respects each person's unique perspective and contributions. We encourage all qualified individuals to apply for employment opportunities with Interface.ai and are committed to ensuring that our hiring process is inclusive and accessible.

Top Skills

Argocd
AWS
Github Actions
Go
Grafana
Helm
Jenkins
Kubernetes
Node.js
Opentelemetry
Prometheus
Python
Terraform

Similar Jobs

8 Days Ago
In-Office
Waterloo, ON, CAN
Senior level
Senior level
Consumer Web • Digital Media
As a Senior DevOps/Platform Engineer, you'll manage and enhance infrastructure, improve CI/CD pipelines, and develop decentralized systems for XR applications.
Top Skills: AnsibleAWSDockerGCPGoJenkinsKubernetesPythonRustTerraformUnityUnreal Engine
55 Minutes Ago
In-Office
Calgary, AB, CAN
Junior
Junior
Big Data • Information Technology • Software • Analytics • Energy
The Sales Development Representative will qualify new opportunities, communicate with industry professionals, and achieve performance targets while representing the company at events.
Top Skills: Linkedin Sales NavigatorMS OfficeSalesforceZoominfo
Expert/Leader
Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Lead a team developing embedded software for vehicle controls. Provide technical vision and manage stakeholder relationships while ensuring quality and integration across products.
Top Skills: AutovalC ProgrammingCanDspaceEthernetGitHilJIRALauterbachLinMatlabSilSimulink

What you need to know about the Montreal Tech Scene

With roots dating back to 1642, Montreal is often recognized for its French-inspired architecture and cobblestone streets lined with traditional shops and cafés. But what truly sets the city apart is how it blends its rich tradition with a modern edge, reflected in its evolving skyline and fast-growing tech industry. According to economic promotion agency Montréal International, the city ranks among the top in North America to invest in artificial intelligence, making it le spot idéal for job seekers who want the best of both worlds.

Key Facts About Montreal Tech

  • Number of Tech Workers: 255,000+ (2024, Tourisme Montréal)
  • Major Tech Employers: SAP, Google, Microsoft, Cisco
  • Key Industries: Artificial intelligence, machine learning, cybersecurity, cloud computing, web development
  • Funding Landscape: $1.47 billion in venture capital funding in 2024 (BetaKit)
  • Notable Investors: CIBC Innovation Banking, BDC Capital, Investissement Québec, Fonds de solidarité FTQ
  • Research Centers and Universities: McGill University, Université de Montréal, Concordia University, Mila Quebec, ÉTS Montréal

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account