Andromeda (andromeda.ai) Logo

Andromeda (andromeda.ai)

Software Engineer - AI Infrastructure

Reposted 5 Days Ago
In-Office or Remote
Hiring Remotely in Canada
Senior level
In-Office or Remote
Hiring Remotely in Canada
Senior level
As a Software Engineer in AI Infrastructure, you will design and develop core platform components, build APIs and services, enhance performance, and automate tooling while collaborating across teams and improving system reliability.
The summary above was generated by AI
Software Engineer - AI Infrastructure

Location: North America Remote / San Francisco · Full-Time

About Andromeda

Andromeda Cluster, founded by Nat Friedman and Daniel Gross, is on a mission to democratize access to cutting-edge AI infrastructure previously reserved for hyperscalers. What began with a single managed cluster has quickly evolved into a global platform, connecting leading AI labs, data centers, and cloud providers.

Our orchestration layer seamlessly routes training and inference jobs across the world, unlocking flexibility and efficiency in one of the fastest-growing sectors on earth. Our long-term vision is to establish a global marketplace for AI compute—powering AGI with the same fluidity as world financial markets.

We are scaling rapidly and seeking exceptional talent in AI infrastructure, research, and engineering.

The Role

As an Infrastructure Product Engineer, you will play a pivotal role in building the backbone of Andromeda’s platform. You'll transform complex, real-world infrastructure challenges into scalable product capabilities that benefit our customers.

Positioned at the intersection of infrastructure and product engineering, this role is deeply technical and systems-oriented, yet laser-focused on building solutions with broad leverage.

What You'll Do
  • Design and develop core platform components, including infrastructure orchestration, provisioning, and lifecycle management solutions.

  • Build robust APIs, services, and control planes that abstract over diverse infrastructure types (VMs, Kubernetes, bare metal, schedulers).

  • Translate customer usage patterns into product requirements, delivering impactful features and improvements.

  • Create automation and internal tooling to eliminate manual or ad-hoc operational work.

  • Enhance reliability, performance, and observability at the platform level, emphasizing durable improvements over quick fixes.

  • Collaborate with peer teams to define clear ownership boundaries between platform capabilities and customer-specific solutions.

  • Write clean, maintainable, and well-documented code with a focus on long-term sustainability.

  • Participate in technical design discussions and contribute to the architectural evolution of our platform.

What We're Looking For
  • 5+ years of experience in Infrastructure, Platform, or Backend Engineering roles.

  • Strong systems fundamentals: deep understanding of Linux, networking, storage, and distributed systems.

  • Proven expertise with Kubernetes, VMs, or bare-metal environments.

  • Advanced software engineering skills; capable of building production-grade APIs and services (Python, Go, or similar).

  • Extensive experience with infrastructure as code and automation tools (Terraform, Ansible, Helm, etc.).

  • Demonstrated ability to navigate ambiguity and distill complex problems into clear, maintainable abstractions.

  • Product-focused mindset: care about interfaces, defaults, reliability, and sustainable operations.

  • Excellent written and verbal communication skills; effective collaborator across engineering and product functions.

Nice to Have:

  • Hands-on experience with GPU or AI infrastructure.

  • Experience with control-plane or orchestration systems.

  • Background spanning both infrastructure and application/backend engineering.

  • Experience architecting multi-tenant systems.

  • Strong skills in technical writing and design documentation.

  • Early-stage startup experience.

Why You’ll Love It Here

This is a true builder’s opportunity: you’ll have ownership and autonomy to shape our systems, engage directly with customers and providers, and lay the foundations for scalable, reliable AI infrastructure. Join us at Andromeda and help power the future of AI.

Similar Jobs

3 Days Ago
Remote
Canada
Senior level
Senior level
Blockchain • Financial Services • Cryptocurrency • Web3
Design, build, and optimize high-performance AI infrastructure systems to support intelligent agents, ensuring reliability and scalability for millions of users.
Top Skills: Distributed SystemsMl InfrastructureMlopsRust
3 Hours Ago
Easy Apply
Remote
Canada
Easy Apply
Senior level
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Lead architecture and implementation of Coinbase's Risk Platform: build high-throughput, low-latency real-time fraud detection, decisioning, and mitigation systems. Define multi-quarter technical strategy, partner with Data Science/ML/Product/Compliance, implement AI-native agent-driven workflows, and mentor engineers to improve reliability, performance, and scale.
Top Skills: Agent FrameworksEvent-Driven ArchitecturesGenerative AiGraphQLMicroservicesReal-Time DecisioningRest
3 Hours Ago
Easy Apply
Remote
Canada
Easy Apply
Senior level
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Lead design and delivery of backend risk systems to detect and prevent fraud, manage credit and market risk, and protect users. Drive architecture for distributed, high-availability services, partner with Data Science/ML and product teams, build AI-native detection and response systems, mentor engineers, own operational excellence, and lead incident response and post-mortems.
Top Skills: Event-Driven ArchitectureGenerative AiGoGraphQLJavaMicroservicesPythonRest ApisRuby

What you need to know about the Montreal Tech Scene

With roots dating back to 1642, Montreal is often recognized for its French-inspired architecture and cobblestone streets lined with traditional shops and cafés. But what truly sets the city apart is how it blends its rich tradition with a modern edge, reflected in its evolving skyline and fast-growing tech industry. According to economic promotion agency Montréal International, the city ranks among the top in North America to invest in artificial intelligence, making it le spot idéal for job seekers who want the best of both worlds.

Key Facts About Montreal Tech

  • Number of Tech Workers: 255,000+ (2024, Tourisme Montréal)
  • Major Tech Employers: SAP, Google, Microsoft, Cisco
  • Key Industries: Artificial intelligence, machine learning, cybersecurity, cloud computing, web development
  • Funding Landscape: $1.47 billion in venture capital funding in 2024 (BetaKit)
  • Notable Investors: CIBC Innovation Banking, BDC Capital, Investissement Québec, Fonds de solidarité FTQ
  • Research Centers and Universities: McGill University, Université de Montréal, Concordia University, Mila Quebec, ÉTS Montréal

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account