Arena (arena.ai) Logo

Arena (arena.ai)

Data Engineer

Reposted 10 Days Ago
Remote or Hybrid
Hiring Remotely in CA
Mid level
Remote or Hybrid
Hiring Remotely in CA
Mid level
The Analytics Engineer will design data models, build pipelines, and ensure data quality to support AI evaluation and insights.
The summary above was generated by AI
About Arena Intelligence

Arena is the platform for evaluating how AI models perform in the real world. Founded by researchers from UC Berkeley's SkyLab, we're on a mission to measure and advance the frontier of AI for real-world use, and to build the foundation for everyone to understand, shape, and benefit from it.


Tens of millions of people use Arena each month to evaluate how frontier systems handle the work they actually do. The preferences they share power the most transparent, rigorous, and human-centered evaluations in AI. Leading AI labs, enterprises, and independent researchers rely on our work and open datasets to understand how models behave in real workflows: agentic coding, creative generation, professional productivity, and beyond. We go beyond leaderboards and decompose what human experience reveals about AI, so models advance toward the work people actually do.


We're a team of researchers, academics, builders, and creatives from UC Berkeley, Google, Stanford, and DeepMind. We seek truth, move fast, and value craftsmanship, curiosity, and impact over hierarchy. We're building a company where thoughtful, curious people from all backgrounds can do their best work together, in an office culture that radiates excellence, energy, and focus.

About the Role

Arena Intelligence is seeking an experienced Data Engineer to own the data foundations that power real-world AI evaluation. In this role, you will design and build the analytics-layer data models, pipelines, and metrics that turn raw user activity and votes into trusted insights for the public, AI labs, and enterprise customers.

This role sits at the intersection of data engineering, analytics, and product. You’ll work closely with researchers, product managers, and engineers to define schemas, standardize metrics, and ensure that our evaluation data is accurate, interpretable, and scalable. Your work will directly shape how AI performance is measured, understood, and acted upon across the industry.

This is an ideal role for someone who enjoys building clean, well-modeled data systems, cares deeply about data quality and correctness, and wants to see their work influence both product decisions and external customers.

You’ll
  • Own the design and implementation of analytics-ready data models, schemas, and tables in our data warehouse

  • Build and maintain reliable data pipelines (batch and incremental) that transform raw event and vote data into standardized, trusted datasets

  • Define and standardize core metrics used across product, research, and customer-facing evaluations

  • Partner with product managers and researchers to translate evaluation questions into robust data models

  • Develop and maintain dashboards, reports, and data artifacts used by internal teams and external partners

  • Ensure data quality through testing, validation, monitoring, and documentation

  • Orchestrate and schedule data workflows using Airflow or equivalent tools

  • Optimize queries and pipelines to support large-scale analytical workloads

  • Contribute to improving data discoverability, lineage, and documentation across the warehouse

You’ll have
  • 3+ years of experience in analytics engineering, data engineering, or a closely related role

  • Strong proficiency in SQL, with experience designing analytics-friendly schemas and transformations

  • Hands-on experience working with a modern data warehouse (e.g., Databricks, Snowflake, BigQuery)

  • Experience building and orchestrating data pipelines using Airflow or similar workflow orchestration tools

  • Proficiency in Python for data transformation, validation, and pipeline development

  • A strong understanding of data modeling best practices (e.g., dimensional modeling, metrics layers)

  • Experience operating and debugging production data pipelines with a focus on correctness and reliability

Nice to have's

  • Experience with Spark or other distributed data processing frameworks

  • Familiarity with Delta Lake or similar table formats

  • Experience supporting experimentation, evaluation, or metrics-heavy products

  • Exposure to machine learning systems or ML-adjacent analytics

  • Experience improving data discovery, lineage, or documentation at scale

What we offer
  • We offer competitive compensation and equity aligned to the markets where our team members are based. The base salary range will depend on the candidate’s permanent work location.

  • Comprehensive health and wellness benefits, including medical, dental, vision, and additional support programs.

  • The opportunity to work on cutting-edge AI with a small, mission-driven team

  • A culture that values transparency, trust, and community impact

Come help build the space where anyone can explore and help shape the future of AI.

Arena Intelligence provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, religion, sex, national origin, age, disability, genetics, sexual orientation, gender identity, or gender expression. We are committed to a diverse and inclusive workforce and welcome people from all backgrounds, experiences, perspectives, and abilities.

Similar Jobs

10 Days Ago
Remote or Hybrid
6 Locations
Expert/Leader
Expert/Leader
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
As a Principal Data Engineer, you will design and implement LLM, AI-powered security data platforms, mentor engineers, and drive the adoption of data solutions across teams.
Top Skills: AirflowAWSBigQueryDaskDockerFlinkGCPKafkaKubeflowKubernetesLangchainLlamaindexMlflowMlops ToolsOciPulsarPythonSagemakerSnowflakeSparkVertex Ai
3 Days Ago
In-Office or Remote
CA
Expert/Leader
Expert/Leader
Blockchain • eCommerce • Fintech • Payments • Software • Financial Services • Cryptocurrency
Design, build, and operate production ML systems that generate trusted signals for ranking, retrieval, recommendations, propensity/churn/LTV, and next-best-action decisioning. Define signal/data contracts, own feature and candidate generation through serving, experimentation, monitoring, and feedback loops, and evaluate long-term business impact, trust, fairness, and compliance. Partner across product, data, modeling, risk, and compliance and apply AI/agents to accelerate engineering and operations.
Top Skills: Agent-Assisted Operations ToolingBatch PipelinesCloud InfrastructureCoding AgentsData WarehousesEmbeddingsEvaluation HarnessesEvent StreamsExperimentation SystemsFeature StoresJavaKotlinKubernetesLakehousesLightgbmModel-Serving InfrastructureObservability ToolingPythonPyTorchRanking/Retrieval SystemsRecommendation FrameworksSemantic SearchSQLTensorFlowWorkflow OrchestrationXgboost
3 Days Ago
Remote or Hybrid
CA
Expert/Leader
Expert/Leader
Blockchain • Fintech • Mobile • Payments • Software • Financial Services
Design, build, and operate production ML signal systems—ranking, retrieval, recommendations, propensity, and next-best-action—covering feature/candidate generation, serving, experimentation, monitoring, and feedback. Define signal contracts (freshness, provenance, confidence), evaluate long-term impact (trust, fairness, compliance), and partner across product, data, and risk teams to deliver reusable customer-intelligence capabilities.
Top Skills: Batch PipelinesCloud InfrastructureCoding AgentsData WarehousesEmbeddingsEvent StreamsExperimentation SystemsFeature StoresJavaKotlinKubernetesLakehousesLightgbmModel-Serving InfrastructureObservability ToolingPythonPyTorchRanking/Retrieval SystemsRecommendation FrameworksSemantic SearchSQLTensorFlowWorkflow OrchestrationXgboost

What you need to know about the Montreal Tech Scene

With roots dating back to 1642, Montreal is often recognized for its French-inspired architecture and cobblestone streets lined with traditional shops and cafés. But what truly sets the city apart is how it blends its rich tradition with a modern edge, reflected in its evolving skyline and fast-growing tech industry. According to economic promotion agency Montréal International, the city ranks among the top in North America to invest in artificial intelligence, making it le spot idéal for job seekers who want the best of both worlds.

Key Facts About Montreal Tech

  • Number of Tech Workers: 255,000+ (2024, Tourisme Montréal)
  • Major Tech Employers: SAP, Google, Microsoft, Cisco
  • Key Industries: Artificial intelligence, machine learning, cybersecurity, cloud computing, web development
  • Funding Landscape: $1.47 billion in venture capital funding in 2024 (BetaKit)
  • Notable Investors: CIBC Innovation Banking, BDC Capital, Investissement Québec, Fonds de solidarité FTQ
  • Research Centers and Universities: McGill University, Université de Montréal, Concordia University, Mila Quebec, ÉTS Montréal

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account