Hyderabad, India
1 day ago
Principal Consultant_SQL-Based Data Quality and Cleansing

Ready to build the future with AI?
At Genpact, we don’t just keep up with technology—we set the pace. AI and digital innovation are redefining industries, and we’re leading the charge. Genpact’s AI Gigafactory, our industry-first accelerator, is an example of how we’re scaling advanced technology solutions to help global enterprises work smarter, grow faster, and transform at scale. From large-scale models to agentic AI, our breakthrough solutions tackle companies’ most complex challenges.
If you thrive in a fast-moving, innovation-driven environment, love building and deploying cutting-edge AI solutions, and want to push the boundaries of what’s possible, this is your moment.
Genpact (NYSE: G) is an advanced technology services and solutions company that delivers lasting value for leading enterprises globally. Through our deep business knowledge, operational excellence, and cutting-edge solutions – we help companies across industries get ahead and stay ahead. Powered by curiosity, courage, and innovation, our teams implement data, technology, and AI to create tomorrow, today. Get to know us at genpact.com and on LinkedIn, X, YouTube, and Facebook.
Inviting applications for the role of Principal Consultant_SQL-Based Data Quality and Cleansing
Responsibilities:
• Leverage strong SQL expertise to build data validation, transformation, and cleansing logic.
• Design and implement automated data quality checks and cleansing rules using SQL and PySpark in Databricks.
• Write optimized queries to detect anomalies, duplicates, nulls, and outliers in large datasets.
Data Profiling Analysis:
• Perform data profiling to assess quality, completeness, accuracy, and consistency.
• Generate profiling reports to identify root causes of data quality issues.
• Collaborate with data owners to define thresholds and cleansing criteria.
Databricks Development Automation:
• Develop notebooks and workflows in Databricks for data ingestion, standardization, and cleansing pipelines.
• Use Delta Lake for managing data versioning and change data capture (CDC) effectively.
• Build reusable SQL and Spark-based components to automate quality checks across domains.
Data Quality Monitoring Governance:
• Define and track Data Quality KPIs using dashboards (Power BI/Tableau or Databricks-native tools).
• Maintain data quality scorecards for ongoing monitoring.
• Ensure governance policies and data standards are embedded in pipelines.
Collaboration Stakeholder Engagement:
• Work closely with Data Engineers, Business Analysts, and Domain SMEs to gather rules for quality checks and cleansing logic.
• Participate in Agile ceremonies and data quality improvement initiatives.
• Provide clear documentation of implemented data rules and technical flows.
Issue Resolution RCA:
• Investigate and resolve data quality issues with root cause analysis.
• Implement preventive auto-cleansing logic to address repetitive issues.
• Maintain logs and alerts for failed checks or thresholds breaches.
Continuous Improvement:
• Recommend enhancements to existing pipelines for better efficiency and maintainability.
• Stay up to date with the new Databricks features and data engineering best practices.
• Promote automation-first and self-healing data processes where applicable.
Preferred Skills Qualifications:
• Hands-on experience with SQL development experience, preferably in large-scale data platforms.
• Strong experience in Databricks, PySpark, and Delta Lake.
• Proven knowledge of data quality frameworks, data cleansing techniques, and data governance.
• Familiarity with ETL pipelines and data pipeline orchestration tools (e.g., ADF, Airflow, or Databricks Workflows).
• Good understanding of data modeling and data warehousing concepts.
• Excellent analytical and problem-solving skills with strong attention to detail.

Minimum Qualifications

• Graduation: B.Tech/B.E, MBA/MCA

Preferred Qualifications

• The candidate must be a self-starter, capable of multitasking and efficiently manage their time in a multifaceted environment with demanding deadlines while requiring minimal levels of supervision.
• Additionally, the candidate must possess excellent writing, speaking, analytical, project management, organizational, teamwork, and customer service skills that will assist them in identifying solutions to sophisticated security problems
• Ability to deliver high quality and reliable software by collaborating with team. Outstanding analytical skills, ability to apply expertise to drive sophisticated, technical and highly commercial solutions. Possess good verbal and written communication skills.
Why join Genpact?
• Lead AI-first transformation – Build and scale AI solutions that redefine industries
• Make an impact – Drive change for global enterprises and solve business challenges that matter
• Accelerate your career—Gain hands-on experience, world-class training, mentorship, and AI certifications to advance your skills
• Grow with the best – Learn from top engineers, data scientists, and AI experts in a dynamic, fast-moving workplace
• Committed to ethical AI – Work in an environment where governance, transparency, and security are at the core of everything we build
• Thrive in a values-driven culture – Our courage, curiosity, and incisiveness - built on a foundation of integrity and inclusion - allow your ideas to fuel progress
Come join the 140,000 coders, tech shapers, and growth makers at Genpact and take your career in the only direction that matters: Up.
Let’s build tomorrow together.
Genpact is an Equal Opportunity Employer and considers applicants for all positions without regard to race, color, religion or belief, sex, age, national origin, citizenship status, marital status, military/veteran status, genetic information, sexual orientation, gender identity, physical or mental disability or any other characteristic protected by applicable laws. Genpact is committed to creating a dynamic work environment that values respect and integrity, customer focus, and innovation.
Furthermore, please do note that Genpact does not charge fees to process job applications and applicants are not required to pay to participate in our hiring process in any other way. Examples of such scams include purchasing a 'starter kit,' paying to apply, or purchasing equipment or training.

Por favor confirme su dirección de correo electrónico: Send Email