Delos Data Logo

Delos Data

System Software Engineer - AI

Reposted 13 Days Ago
Be an Early Applicant
Hybrid
Montréal, QC, CAN
Mid level
Hybrid
Montréal, QC, CAN
Mid level
The System Software Engineer will design and implement communication and execution layers for AI models across GPUs, addressing performance issues in distributed training and inference tasks.
The summary above was generated by AI

System Software Engineer - AI

About us:

We are a stealth-mode startup building foundational technology to address performance, scalability, and resiliency challenges in large-scale AI data center clusters. We are backed by top-tier VC firms and notable angel investors.

The company is led by experienced builders and operators who have founded companies, taken them to scale, and exited successfully. We work with a strong sense of unity and shared responsibility, and we expect trust, integrity, and respect in how we collaborate and make decisions. We hold ourselves accountable to one another and to the quality of the work we deliver.

Headquartered in Silicon Valley, we operate across a mix of remote and on-site locations in the U.S. and Canada. We aim to create an environment where people are treated fairly, supported in their growth, and are empowered to do meaningful work alongside others who take the craft seriously.

We are looking for:

We are looking for a talented System Software Engineer to help us redefine the infrastructure layer of AI. In this role, you will bridge the gap between high-level AI frameworks and low-level system software. You will be responsible for designing and implementing the communication and execution primitives that allow large-scale AI models to run efficiently across thousands of GPUs. We are looking for a "builder" who thrives in the early stages of a product’s lifecycle and is passionate about solving the "hard" systems problems of the generative AI era.

Key Responsibilities:

  • Collaborate across the stack to influence the design of our foundational technology, ensuring it meets the needs of next-generation AI models.

  • Identify and resolve performance bottlenecks in distributed training and inference workloads through deep-dive analysis of the software-hardware interface.

  • Conduct rigorous performance benchmarking and characterization on multi-node clusters.

Required Skills and Qualifications:

  • Strong proficiency in C++ and Python, with a deep understanding of systems programming fundamentals (memory management, concurrency, OS internals).

  • Proficient in a Linux development environment.

Desired Skills:

  • Experience with GPU programming (CUDA) and performance optimization for parallel architectures.

  • Familiarity with distributed AI frameworks (PyTorch, JAX, or DeepSpeed) and/or inference engines (vLLM, SGLang, Dynamo/TRT-LLM).

  • Hands-on experience with large-scale cluster orchestration and telemetry tools.

Education:

  • Bachelor's or Master's degree in Computer Engineering, Computer Science, or a related field.

Compensation:

Target base salary for this role is $120,000 - $180,000 CAD per year + meaningful equity + benefits + 401k. Our salary ranges are determined by role, level, experience, and location.

We are an equal opportunity employer. We value a range of perspectives and experiences and make employment decisions based on merit and business needs. We do not discriminate on the basis of legally protected characteristics.

Agency Note:

We do not accept resumes from agencies or search firms. Please do not forward candidate profiles through our careers page, email, LinkedIn messages, or directly to company employees. Any resumes submitted will be deemed the property of the company, and no fees will be paid in the event the candidate is hired.

#LI-EW1

Similar Jobs

An Hour Ago
Remote or Hybrid
Montréal, QC, CAN
Senior level
Senior level
AdTech • Cloud • Digital Media • Information Technology • News + Entertainment • App development
Design and implement a scalable UGC framework in Unreal Engine: data models, runtime systems, scripting model, APIs, sandboxing, performance budgets, and AI-enabled content tooling. Partner across gameplay, online, tools, and AI/ML teams, drive prototypes and documentation, and mentor engineers to ensure a cohesive, extensible platform for creators.
Top Skills: Agent-Based SystemsAsset StreamingBlueprintsC++Ci/CdEntity Component System (Ecs)LlmsLuaMultithreadingPythonRestRpcSerializationUnreal EngineVerseWorld Partitioning
2 Hours Ago
In-Office or Remote
CA
Senior level
Senior level
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
Design end-to-end product experiences that drive seller onboarding, activation, adoption, and retention across mobile and web. Partner with PMs, engineers, and data scientists to produce high-craft interaction designs, systems-level flows, and measurable outcomes while mentoring designers and advocating native mobile patterns.
Top Skills: AndroidiOS
3 Hours Ago
Hybrid
Montréal, QC, CAN
Junior
Junior
Gaming • Information Technology • Mobile • Software • Esports
Manage the localisation of game titles into multiple languages, coordinating with teams globally to ensure quality and timely delivery.
Top Skills: Bug Tracking SoftwareCat ToolsGoogle SuiteJIRAMS OfficeSlackXloc

What you need to know about the Montreal Tech Scene

With roots dating back to 1642, Montreal is often recognized for its French-inspired architecture and cobblestone streets lined with traditional shops and cafés. But what truly sets the city apart is how it blends its rich tradition with a modern edge, reflected in its evolving skyline and fast-growing tech industry. According to economic promotion agency Montréal International, the city ranks among the top in North America to invest in artificial intelligence, making it le spot idéal for job seekers who want the best of both worlds.

Key Facts About Montreal Tech

  • Number of Tech Workers: 255,000+ (2024, Tourisme Montréal)
  • Major Tech Employers: SAP, Google, Microsoft, Cisco
  • Key Industries: Artificial intelligence, machine learning, cybersecurity, cloud computing, web development
  • Funding Landscape: $1.47 billion in venture capital funding in 2024 (BetaKit)
  • Notable Investors: CIBC Innovation Banking, BDC Capital, Investissement Québec, Fonds de solidarité FTQ
  • Research Centers and Universities: McGill University, Université de Montréal, Concordia University, Mila Quebec, ÉTS Montréal

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account