Featherless AI Logo

Featherless AI

Senior Software Engineer - API Gateway

Reposted 16 Days Ago
Be an Early Applicant
Remote
Hiring Remotely in Canada
Senior level
Remote
Hiring Remotely in Canada
Senior level
Develop and enhance the API gateway for an AI inference platform, focusing on feature implementation, bug fixes, infrastructure management, and reliability improvements.
The summary above was generated by AI

About the Role

Featherless.ai is building the world’s most reliable and comprehensive open-model inference platform — the infrastructure powering the next generation of AI creators, startups, and enterprises. Our serverless approach to inference unlocks the best GPU utilization in AI infrastructure.

We’re hiring Senior Software Engineers to support and evolve the API gateway to our inference cloud, which is responsible for

  • authentication and inference to all models

  • subscription management and subscription entitlement (e.g. context-length, concurrency limits)

  • and providing the necessary API surface for applications and builders

API Gateway is constantly evolving in response to the unending stream of new models, modalities, clients and inference load.

What you'll do

The API gateway is managed by the Platform Team, who aim to make Featherless the best place to find and use models. As a member of the platform team, you will

  • undertake feature development and bug fixes to keep up with clients, resolve user issues, and onboard new models

  • improve the reliability of the existing API (increasing instrumentation and monitoring, right-sizing infrastructure)

  • respond to availability incidents

  • triage and resolve issues of inference quality and reliability

  • manage the infrastructure on which our gateway runs

What you'll bring

  • first-hand experience of the user’s we’re building for (familiarity with popular open LLMs, common clients, and experience building with LLM)

  • experience with the technologies and paradigms of the web (REST, websockets, DNS, networking, opentelemetry)

  • experience with significant components of our stack (k8s, node, mikro-orm, fastify, redis, mongodb, python, elastic cloud, cloudflare, sentry, otel)

  • ability to debug complex issues across a wide stack and build instrumentation as necessary

  • desire to work collaboratively as part of a skilled team

  • Alignment with team and company values, including

    • bias to action

    • responsiveness to users (bug-fixes over features)

    • instinct to iterate

    • subscribing to that done means proven by usage data

Other

This team operates on Eastern Time. We are remote, but with a preference to hire in Toronto, Canada.

Similar Jobs

36 Minutes Ago
Easy Apply
Remote or Hybrid
Canada
Easy Apply
Senior level
Senior level
eCommerce • Healthtech • Kids + Family • Retail • Social Media
As Senior Technical Product Manager, lead and execute technology solutions to enhance merchandising operations, driving efficiency and scalability at Babylist.
Top Skills: AIAPIsSystems Integration
14 Hours Ago
In-Office or Remote
Canada
Senior level
Senior level
Artificial Intelligence • Productivity • Software • Automation
The Automation Strategist will guide customers in automating processes, help identify use cases, and promote AI-enabled transformation, focusing on value delivery and relationship building.
Top Skills: AIAutomation
14 Hours Ago
In-Office or Remote
Junior
Junior
Artificial Intelligence • Big Data • Healthtech • Information Technology • Machine Learning • Software • Analytics
The Technical Engineer is responsible for remote support and maintenance of PACS and Cloud-based products, ensuring customer satisfaction through timely issue resolution and communication. Duties include diagnosing technical problems, conducting inspections, generating reports, and performing installations and upgrades.
Top Skills: Cloud StorageDicomDlt TapeGitlabGoogle Transfer AppliancesJIRALinuxMs SqlPacsSan StorageSql PlusUnixVMware

What you need to know about the Montreal Tech Scene

With roots dating back to 1642, Montreal is often recognized for its French-inspired architecture and cobblestone streets lined with traditional shops and cafés. But what truly sets the city apart is how it blends its rich tradition with a modern edge, reflected in its evolving skyline and fast-growing tech industry. According to economic promotion agency Montréal International, the city ranks among the top in North America to invest in artificial intelligence, making it le spot idéal for job seekers who want the best of both worlds.

Key Facts About Montreal Tech

  • Number of Tech Workers: 255,000+ (2024, Tourisme Montréal)
  • Major Tech Employers: SAP, Google, Microsoft, Cisco
  • Key Industries: Artificial intelligence, machine learning, cybersecurity, cloud computing, web development
  • Funding Landscape: $1.47 billion in venture capital funding in 2024 (BetaKit)
  • Notable Investors: CIBC Innovation Banking, BDC Capital, Investissement Québec, Fonds de solidarité FTQ
  • Research Centers and Universities: McGill University, Université de Montréal, Concordia University, Mila Quebec, ÉTS Montréal

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account