Photon Logo

Photon

Architect - Data Engineering - Mississauga - Canada

Posted 2 Days Ago
Be an Early Applicant
In-Office or Remote
Hiring Remotely in Canada
Expert/Leader
In-Office or Remote
Hiring Remotely in Canada
Expert/Leader
Design and architect large-scale distributed big data solutions and ETL pipelines using Java, Scala, and Spark. Build and optimize Spark applications on Cloudera CDH (HDFS, Hive, Impala, HBase, Kafka). Ensure data integrity, troubleshoot performance issues, and implement version control and CI/CD for Spark workloads while collaborating with cross-functional teams.
The summary above was generated by AI

Key Responsibilities: 

  • Architect and design large-scale, distributed big data solutions using Java and big data technologies to handle high-volume data processing and analytics. 
  • Optimize and tune Spark applications for better performance on large-scale data sets. 
  • Work with the Cloudera Hadoop ecosystem (e.g., HDFS, Hive, Impala, HBase, Kafka) to build data pipelines and storage solutions. 
  • Collaborate with data scientists, business analysts, and other developers to understand data requirements and deliver solutions. 
  • Design and implement high-performance data processing and analytics solutions. 
  • Ensure data integrity, accuracy, and security across all processing tasks. 
  • Troubleshoot and resolve performance issues in Spark, Cloudera, and related technologies. 
  • Implement version control and CI/CD pipelines for Spark applications. 

Required Skills & Experience: 

  • Minimum 15+ years of experience in application development. 
  • Strong hands on experience in Apache Spark, Scala, and Spark SQL for distributed data processing. 
  • Hands-on experience with Cloudera Hadoop (CDH) components such as HDFS, Hive, Impala, HBase, Kafka, and Sqoop. 
  • Familiarity with other Big Data technologies, including Apache Kafka, Flume, Oozie, and Nifi. 
  • Experience building and optimizing ETL pipelines using Spark and working with structured and unstructured data. 
  • Experience with SQL and NoSQL databases such as HBase, Hive, and PostgreSQL. 
  • Knowledge of data warehousing concepts, dimensional modeling, and data lakes. 
  • Ability to troubleshoot and optimize Spark and Cloudera platform performance. 
  • Familiarity with version control tools like Git and CI/CD tools (e.g., Jenkins, GitLab). 

Similar Jobs

2 Hours Ago
Remote or Hybrid
Senior level
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead and manage internal investigations, develop and implement compliance policies, advise on regulatory requirements, analyze operational risks, communicate findings to stakeholders, coach and lead teams, and support compliance program implementation and training to strengthen internal controls and ethical standards.
2 Hours Ago
Remote or Hybrid
Mid level
Mid level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
The role involves developing, testing, and validating Generative AI agents and maintaining automated testing standards. Responsibilities include mentoring junior associates, analyzing complex issues, and applying governance controls in AI-driven solutions.
Top Skills: AIAutomated TestingCi/CdData EngineeringLlmsMlPower Automate
2 Hours Ago
Remote or Hybrid
Senior level
Senior level
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
As an AI Engineering Manager at PwC, you will lead the design and operation of AI-powered platforms, mentor engineers, and ensure project delivery excellence while focusing on security and scalability.
Top Skills: AIAzureAzure Bot Framework SdkAzure Cognitive ServicesCloud EngineeringConversational AiData VisualizationDevOpsMachine Learning

What you need to know about the Montreal Tech Scene

With roots dating back to 1642, Montreal is often recognized for its French-inspired architecture and cobblestone streets lined with traditional shops and cafés. But what truly sets the city apart is how it blends its rich tradition with a modern edge, reflected in its evolving skyline and fast-growing tech industry. According to economic promotion agency Montréal International, the city ranks among the top in North America to invest in artificial intelligence, making it le spot idéal for job seekers who want the best of both worlds.

Key Facts About Montreal Tech

  • Number of Tech Workers: 255,000+ (2024, Tourisme Montréal)
  • Major Tech Employers: SAP, Google, Microsoft, Cisco
  • Key Industries: Artificial intelligence, machine learning, cybersecurity, cloud computing, web development
  • Funding Landscape: $1.47 billion in venture capital funding in 2024 (BetaKit)
  • Notable Investors: CIBC Innovation Banking, BDC Capital, Investissement Québec, Fonds de solidarité FTQ
  • Research Centers and Universities: McGill University, Université de Montréal, Concordia University, Mila Quebec, ÉTS Montréal

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account