Bangalore, KA, IN
1 day ago
Azure Data Engineer
Position Description:

Company Profile:

Founded in 1976, CGI is among the world's largest independent IT and business consulting services firms. With 94,000 consultants and professionals globally, CGI delivers an end-to-end portfolio of capabilities, from strategic IT and business consulting to systems integration, managed IT and business process services, and intellectual property solutions. CGI works with clients through a local relationship model complemented by a global delivery network that helps clients digitally transform their organizations and accelerate results. CGI Fiscal 2024 reported revenue is CA$14.68 billion, and CGI shares are listed on the TSX (GIB.A) and the NYSE (GIB). Learn more at cgi.com.

Experience: 7 to 12 years
Category: Software Development/ Engineering
Designation : Senior Software Engineer/Lead Analyst
Main location: India, Karnataka, Bangalore
Position ID: J0525-1998
Employment Type: Full Time

Position Description

The Azure Data Engineer is responsible for designing, implementing, and managing data pipelines and architectures on Microsoft Azure. The role involves working with Azure Databricks, SQL databases, and tools like Azure Data Factory (ADF) to transform, process, and integrate data for analytics and reporting. They collaborate with data scientists, business stakeholders, and other engineering teams to ensure reliable, scalable, and efficient data solutions.

Your future duties and responsibilities:

Job Role/ future duties and responsibilities
• Data Pipeline Development: Design, build, and maintain scalable ETL (Extract, Transform, Load) pipelines using Azure Data Factory (ADF). Develop data transformation workflows using Databricks and PySpark to process large datasets.
• Data Integration: Implement data integration between multiple sources such as on-premises databases, cloud-based storage (Azure Blob Storage), and third-party APIs. Ensure smooth data flow across the Azure ecosystem.
• Data Modelling & Storage: Design and optimize SQL databases and Data Lake architectures to store and retrieve large datasets efficiently. Work with business analysts to translate data requirements into optimized storage and retrieval solutions.
• Data Analysis & Reporting: Create and optimize queries using SQL to support data analysis and reporting. Collaborate with data scientists to create datasets for machine learning models.
• Automation & Optimization: Write and maintain Python/PySpark scripts to automate data processing and cleaning tasks. Optimize and tune performance for data pipelines and databases to handle large-scale data efficiently.
• Collaboration: Work closely with data scientists, business analysts, and BI teams to ensure alignment on data needs and requirements. Communicate technical issues and data solutions effectively with both technical and non-technical stakeholders.
• Monitoring & Troubleshooting: Monitor data pipelines and ensure data quality, consistency, and security. Troubleshoot and resolve any issues or failures in data processing pipelines.

Azure Data Platform: Proficiency in Azure Data Factory (ADF) for building ETL pipelines. Experience with Azure Databricks for data processing and analytics.
SQL Expertise: Strong knowledge of SQL for querying and managing relational databases (e.g., Azure SQL Database, SQL Data Warehouse). Ability to optimize SQL queries and database performance.
Programming Languages: Python and PySpark for writing data processing scripts and workflows. Understanding of object-oriented programming and coding best practices.
Big Data Processing: Experience with PySpark and Databricks for large-scale data transformations and analysis.
Data Integration: Knowledge of integrating data from diverse sources, such as APIs, databases, and cloud storage.
Data Lakes & Storage: Familiarity with Azure Data Lake Storage and Blob Storage for storing raw and processed data.
Data Governance & Security: Understanding of data security, governance, and best practices in cloud environments.
Problem Solving & Debugging: Strong analytical and problem-solving skills for troubleshooting data pipeline and infrastructure issues.

Required qualifications to be successful in this role:

Technologies required:
Azure Data Factory (ADF), Azure Databricks, Azure Data Lake Storage, Azure Blob Storage
Azure SQL Database, Azure Synapse Analytics (optional), Azure Monitor & Log Analytics
Programming Languages: Python, SQL, Pyspark.

Skills: Azure Data FactoryPythonSQL What you can expect from us:

Together, as owners, let’s turn meaningful insights into action.

Life at CGI is rooted in ownership, teamwork, respect and belonging. Here, you’ll reach your full potential because…

You are invited to be an owner from day 1 as we work together to bring our Dream to life. That’s why we call ourselves CGI Partners rather than employees. We benefit from our collective success and actively shape our company’s strategy and direction.

Your work creates value. You’ll develop innovative solutions and build relationships with teammates and clients while accessing global capabilities to scale your ideas, embrace new opportunities, and benefit from expansive industry and technology expertise.

You’ll shape your career by joining a company built to grow and last. You’ll be supported by leaders who care about your health and well-being and provide you with opportunities to deepen your skills and broaden your horizons.

Come join our team—one of the largest IT and business consulting services firms in the world.

Por favor confirme su dirección de correo electrónico: Send Email