RaceTrac Company Overview
Job Description:
The Lead Data Developer will play a pivotal role in designing, developing, and optimizing data pipelines and architectures within our Azure-based data ecosystem. This role will lead the development of scalable, high-performance solutions leveraging Azure Data Services and Databricks to enable robust analytics and business intelligence. The ideal candidate will bring deep technical expertise, a strategic mindset, and strong leadership to mentor junior team members and drive data engineering best practices.
Responsibilities:
Lead the end-to-end development of data pipelines and ETL/ELT workflows using Azure Data Factory, Databricks (PySpark/Scala), and SQL.Architect and maintain efficient, reusable, and reliable data systems within the Azure ecosystem (e.g., Azure Synapse, Azure Data Lake, Azure SQL DB).Design data models and data warehousing solutions that support analytics and reporting across the business.Collaborate with data scientists, analysts, and business stakeholders to understand data needs and deliver high-quality, trusted data products.Optimize performance of data processes, including monitoring and troubleshooting jobs, managing resource consumption, and tuning Spark clusters.Enforce governance, security, and compliance standards, including data quality, lineage, and cataloging using tools like Azure Purview.Provide technical leadership and mentorship to junior developers, including code reviews and guidance on architectural decisions.Support CI/CD automation, version control, and DevOps practices in the data engineering workflow.Stay current with evolving data technologies and recommend improvements to existing infrastructure and architecture.Qualifications:
Bachelor's degree in Computer Science, Engineering, Information Systems, or related field preferred.7+ years of experience in data engineering or software development roles required.3+ years of hands-on experience with Azure Data Services (e.g., Azure Data Factory, Azure Data Lake, Synapse, Azure SQL) required.Strong proficiency in Databricks, particularly with PySpark and/or Scala required.Deep understanding of distributed data processing and optimization in cloud environments required.Advanced SQL skills and familiarity with structured/unstructured data formats (Parquet, Avro, JSON) required.Experience with CI/CD pipelines, Git, and Infrastructure as Code (e.g., Terraform, ARM templates) preferred.All qualified applicants will receive consideration for employment with RaceTrac without regard to their race, national origin, religion, age, color, sex, sexual orientation, gender identity, disability, or protected veteran status, or any other characteristic protected by local, state, or federal laws, rules, or regulations.