Procore Technologies Logo

Procore Technologies

Staff ML Data Engineer (Datagrid)

Posted Yesterday
In-Office or Remote
Hiring Remotely in CA
Senior level
In-Office or Remote
Hiring Remotely in CA
Senior level
The Staff ML Data Engineer will design and build scalable data systems for ML research, ensuring robust data pipelines and leading data engineering efforts.
The summary above was generated by AI

We’re looking for a Staff ML Data Engineer to join Procore’s AI & Frontier Models organization. In this role, you’ll be responsible for designing and building the data systems that power frontier‑scale machine learning research and applied AI products, with a particular focus on spatial intelligence and multimodal data. The primary goal of this role is to ensure that researchers and engineers can reliably discover, curate, transform, and operate on large‑scale datasets that move from experimentation to production.

As a Staff ML Data Engineer, you’ll work closely with ML researchers, applied ML engineers, and system architects to turn ambiguous research needs into scalable, production‑ready data pipelines. You’ll remain deeply hands‑on while providing technical leadership in data architecture, quality, and operational excellence. This is an opportunity to shape how Procore builds, evaluates, and deploys frontier models by ensuring the underlying data systems are robust, observable, and designed for iteration.

This role reports reports into the Manager, Software Engineering, and is based in our San Francisco office, supporting Procore's Datagrid AI Division. Given the collaborative and fast moving nature of this work, we are seeking candidates who are available to work onsite in a hybrid model at a minimum of 3 days per week. This is an immediate opening!

What you’ll do
  • Act as the technical lead for data engineering efforts supporting frontier model research and applied ML systems.

  • Design, build, and maintain scalable batch and streaming pipelines for multimodal data (e.g., documents, images, spatial metadata).

  • Partner closely with researchers and architects to translate experimental workflows into reliable, repeatable data systems.

  • Lead the development of dataset curation, versioning, and lineage workflows that support rapid experimentation and reproducibility.

  • Establish and uphold standards for data quality, validation, observability, and cost efficiency across AI data pipelines.

  • Contribute to data architecture decisions spanning research environments and production systems.

  • Identify gaps or inefficiencies in existing data workflows and run proofs‑of‑concept to evaluate improvements.

  • Mentor other engineers through code reviews, design discussions, and hands‑on collaboration.

What we’re looking for
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience.

  • 8+ years of experience designing and operating complex data systems in production or research‑adjacent environments.

  • Strong proficiency in SQL and Python; experience with data‑intensive or distributed systems.

  • Proven experience building scalable data pipelines that support machine learning training, evaluation, or inference workflows.

  • Solid understanding of data modeling, dataset lifecycle management, and data quality best practices.

  • Comfort operating in highly ambiguous problem spaces and collaborating closely with researchers and architects.

  • Demonstrated ability to lead through direct technical contribution, mentorship, and setting engineering standards.

  • Strong communication skills, with the ability to explain technical tradeoffs to both research and engineering audiences.

Nice to have experience with technologies such as:

  • ML & Research Data: Large‑scale dataset curation, annotation workflows, experiment tracking, reproducibility tooling

  • Data Platforms: Databricks, Spark, lakehouse architectures, cloud data warehouses

  • Streaming & Pipelines: Kafka, Pub/Sub, event‑driven data architectures

  • Orchestration & Observability: Airflow, Dagster, data quality and lineage tools

  • Cloud & Infrastructure: AWS or GCP, containerized data workloads, CI/CD, infrastructure‑as‑code

  • Performance & Cost: Optimizing data pipelines for GPU‑backed training and large‑scale inference workloads

Additional Information

Base Pay Range:

227,332.00 - 312,581.50 USD Annual

This role may also be eligible for Equity Compensation and/or Bonus Incentive Compensation. Procore is committed to offering competitive, fair, and commensurate compensation. Actual compensation will be based on a candidate’s job-related skills, experience, education or training, and location.

For Los Angeles County (unincorporated) Candidates:

Procore will consider for employment all qualified applicants, including those with arrest or conviction records, in accordance with the requirements of applicable federal, state, and local laws, including the City of Los Angeles’ Fair Chance Initiative for Hiring Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act.

A criminal history may have a direct, adverse, and negative relationship on the following job duties, potentially resulting in the withdrawal of the conditional offer of employment: 1. appropriately managing, accessing, and handling confidential information including proprietary and trade secret information, as well as accessing Procore's information technology systems and platforms; 2. interacting with and occasionally having unsupervised contact with internal/external customers, stakeholders, and/or colleagues; and 3. exercising sound judgment.

Similar Jobs

2 Hours Ago
Easy Apply
Remote or Hybrid
Canada
Easy Apply
Senior level
Senior level
eCommerce • Healthtech • Kids + Family • Retail • Social Media
As a Senior Technical Recruiter, you will manage full-cycle recruiting for complex technical roles, emphasizing Data Science, Machine Learning, and core Engineering positions, while fostering diverse talent pipelines and enhancing candidate experiences.
Top Skills: AIGreenhouseLinkedin Recruiter
10 Hours Ago
Remote or Hybrid
CA
Senior level
Senior level
Blockchain • Fintech • Mobile • Payments • Software • Financial Services
Looking for a Software Engineer to build financial products and tooling for Cash App's Lending team, ensuring quality and compliance while collaborating across domains.
Top Skills: AuroraAWSBuildkiteDatadogDynamoDBGradleGrpcGuiceHibernateHTTPJavaJettyJooqJSONJunitKafkaKotlinMySQLOkhttpProtocol BuffersRedisVitess
10 Hours Ago
Remote or Hybrid
Senior level
Senior level
Cloud • Fintech • Information Technology • Machine Learning • Software
The Lead Engineer will solve engineering problems, modernise systems, enhance the developer experience, and mentor other engineers while building high-quality software.
Top Skills: .NetAWSC#KubernetesReact

What you need to know about the Montreal Tech Scene

With roots dating back to 1642, Montreal is often recognized for its French-inspired architecture and cobblestone streets lined with traditional shops and cafés. But what truly sets the city apart is how it blends its rich tradition with a modern edge, reflected in its evolving skyline and fast-growing tech industry. According to economic promotion agency Montréal International, the city ranks among the top in North America to invest in artificial intelligence, making it le spot idéal for job seekers who want the best of both worlds.

Key Facts About Montreal Tech

  • Number of Tech Workers: 255,000+ (2024, Tourisme Montréal)
  • Major Tech Employers: SAP, Google, Microsoft, Cisco
  • Key Industries: Artificial intelligence, machine learning, cybersecurity, cloud computing, web development
  • Funding Landscape: $1.47 billion in venture capital funding in 2024 (BetaKit)
  • Notable Investors: CIBC Innovation Banking, BDC Capital, Investissement Québec, Fonds de solidarité FTQ
  • Research Centers and Universities: McGill University, Université de Montréal, Concordia University, Mila Quebec, ÉTS Montréal

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account