Lutra Logo

Lutra

Staff Platform Engineer, Manufacturing AI

Reposted 21 Days Ago
Remote
Hiring Remotely in Canada
Mid level
Remote
Hiring Remotely in Canada
Mid level
The Staff Platform Engineer will design and implement scalable infrastructure for AI applications in manufacturing while ensuring reliability, operability, and cross-team collaboration.
The summary above was generated by AI
Your opportunity

Our client is a well-funded, seed-stage AI startup that builds agents for the factory floor. They develop and distribute a software-first agent layer that plugs into the cameras and machines factories already have. Their models run and act at the edge so agents can see, decide, and act in real time. Events and metrics flow into a dashboard that provides plant teams immediate visibility. They’re approaching a large (~$14B) and underserved market with a disruptive, asset-light alternative to hardware-heavy robotics and batch analytics and they’ve already found early traction with clients in the food & beverage, pharma/cosmetics, and materials processing verticals.

As a staff platform engineer, you’ll join an emergent platform team and help shape the form that it takes. You will become fluent in the hardware platform, networking topologies, and application stack and draw on your longitudinal perspective to build a platform practice and guide the leadership team’s decision making. You’ll be firmly in the critical path (to begin) as the primary on-call and equipped to turn incidents into monitoring signals, build playbooks from first principles, and shape a culture of streamlined root cause analysis.

You’ll be joining a flat, dynamic environment in the midst of its scale-up phase that’s led by an accomplished ex-Deepmind researcher with specialization in reinforcement learning, deep learning and robotics. The company closed a $13.9M CAD seed round in March of 2025 and are scaling R&D and delivery to meet accelerating demand, with headcount tracking to double by year-end.

Please note that this role may involve participation in an on-call rotation that includes evenings and weekends.

Thematic responsibilities

  • Infrastructure & application ownership: Design and implement scalable infrastructure architectures across on-premise (edge) and cloud environments; evolve core infrastructure platforms that support production and pre-production workflows
  • Pre-production environments & validation: Build and maintain sandbox, staging, and shadow-run environments that mirror production behavior; own how systems are provisioned, isolated, tested, and validated before rollout
  • Replay-based testing & safe version rollouts: Design infrastructure to support A/B playback testing of models and software versions, offline and replay-based workload testing, and shadow-mode execution prior to version switching
  • Reliability engineering, fault isolation & performance determinism: Define infrastructure standards that ensure reliable, isolated systems with predictable performance under real-world workloads
  • Operability & cross-team collaboration: Partner with DevOps to ensure infrastructure designs are deployable, observable, and operable; collaborate with Edge and AI teams to enable safe experimentation

Tech stack

  • Operating system: Linux
  • Backend: Python (Flask, FastAPI), TypeScript/Node.js
  • Orchestration & compute: Kubernetes, on-prem bare metal, VMs
  • Containers: Docker
  • Monitoring, observability & logging: Prometheus, Grafana, ELK
  • Cloud providers: AWS, Azure, GCP
  • Databases & storage: SQL, InfluxDB, MongoDB
  • Messaging & IoT: MQTT, HTTP/REST, RabbitMQ, Apache Kafka
  • Edge platforms: NVIDIA Jetson, Raspberry Pi (ARM)
  • GPU/acceleration: CUDA, TensorRT, ONNX, OpenVINO
  • ML/DL frameworks: PyTorch, TensorFlow, Keras, scikit-learn
  • Scientific computing: NumPy, Pandas
  • Computer vision: OpenCV
  • Cameras & vision I/O: GenICam, GigE Vision, USB3 Vision
  • Industrial automation: PLC integration; protocols: Ethernet/IP, Modbus, Profinet, OPC UA

Your know-how

  • You have significant experience supporting the design and implementation of scaled production environments in hybrid (edge-cloud) or on-prem environments
  • You have strong Linux systems knowledge and experience building and operating underlying compute platforms
  • You have significant experience with infrastructure orchestration platforms (Kubernetes/K8s preferred) and/or virtualization platforms
  • You are experienced with monitoring, observability and alerting stacks and best practices 
  • You have high comfort with, and understanding of, distributed systems and failure modes
  • You have enough software engineering skills to be dangerous, and specific command of Python for infrastructure automation and validation tooling
  • You have experience collaborating effectively within and across cross-functional delivery teams
  • You are a contagiously curious person with entrenched learning habits

It’s a bonus if

  • You have experience designing and operating scaled production environments for manufacturing, robotics, IoT and/or industrial automation applications
  • You have deep expertise in computer vision, robotics, or manufacturing automation
  • You have experience supporting GPU-based or real-time workloads
  • You are predisposed to mentorship and crafting a culture of continuous improvement
  • You have experience scaling an AI and/or B2B SaaS venture

Interested in learning more?

Please apply using the following form or send your resume or LinkedIn profile URL to [email protected] with “Staff Platform Engineer, Manufacturing AI” as the subject line. One of our talent partners will be in contact shortly.

Compensation
The base pay range for this role is CA$200,000 – CA$250,000 per year.

Similar Jobs

9 Hours Ago
Remote or Hybrid
East York, ON, CAN
Junior
Junior
Big Data • Food • Hardware • Machine Learning • Retail • Automation • Manufacturing
Lead design, deployment, and sustainment of IL6S/TPM systems to eliminate losses and improve equipment reliability. Train and coach teams, run Kaizen and DMAIC events, track KPIs (OEE, MTBF/MTTR), implement SOPs and visual management, perform loss analysis, and support preventive/predictive maintenance to drive productivity and safety targets.
Top Skills: 5WhysAutonomous MaintenanceDmaicE2E Data Collection SystemsGeIshikawaKaizenLean Six SigmaMakigamiMtbbMtbfMttrOeeParetoPdcaPredictive MaintenanceRoot Cause Analysis (Rca)SmedStandard WorkTpmValue Stream Mapping (Vsm)Visual ManagementWpi Tool
9 Hours Ago
Remote or Hybrid
CA
Senior level
Senior level
eCommerce • Fintech • Hardware • Payments • Software • Financial Services
Outbound-focused senior account executive responsible for sourcing and closing new restaurant merchant logos. Duties include prospecting, discovery, demos, consultative selling of Square ecosystem, field relationship building, partnering with BD/Product/Marketing, managing the sales cycle and onboarding, and meeting monthly sales KPIs using Salesforce.
Top Skills: SalesforceSquare
14 Hours Ago
Remote or Hybrid
Senior level
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Manage and grow ServiceNow partner relationships across Canada: build partner practices, set targets, drive governance, enablement, reporting, business reviews, remediation plans, and achieve joint revenue goals while coaching partners and collaborating with global teams.
Top Skills: AIServicenow

What you need to know about the Montreal Tech Scene

With roots dating back to 1642, Montreal is often recognized for its French-inspired architecture and cobblestone streets lined with traditional shops and cafés. But what truly sets the city apart is how it blends its rich tradition with a modern edge, reflected in its evolving skyline and fast-growing tech industry. According to economic promotion agency Montréal International, the city ranks among the top in North America to invest in artificial intelligence, making it le spot idéal for job seekers who want the best of both worlds.

Key Facts About Montreal Tech

  • Number of Tech Workers: 255,000+ (2024, Tourisme Montréal)
  • Major Tech Employers: SAP, Google, Microsoft, Cisco
  • Key Industries: Artificial intelligence, machine learning, cybersecurity, cloud computing, web development
  • Funding Landscape: $1.47 billion in venture capital funding in 2024 (BetaKit)
  • Notable Investors: CIBC Innovation Banking, BDC Capital, Investissement Québec, Fonds de solidarité FTQ
  • Research Centers and Universities: McGill University, Université de Montréal, Concordia University, Mila Quebec, ÉTS Montréal

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account