Job Title: AI Engineer - Document Intelligence with Big Data Expertise
Position Type: Full-time
Location: Gurgaon (Relocation will be required based upon project to Kanpur/ BLR/ Jaipur across India.
Job Summary:
As an AI Engineer with a specialization in Document Intelligence and Big Data, you will be at the forefront of designing and implementing AI solutions that leverage large-scale data processing technologies. Your role will involve integrating document and image analysis with big data ecosystems to derive insights from vast amounts of structured and unstructured data.
Key Responsibilities:
- Develop and optimize AI models for document classification, text extraction, and image processing, ensuring they are capable of handling big data volumes.
- Implement NLP and computer vision algorithms to analyze and interpret complex documents and images within large datasets.
- Utilize big data technologies such as Hadoop, Spark, and Hive to store, process, and analyze document-related data efficiently.
- Work with OCR and image processing tools to convert documents into analyzable data, enhancing accuracy with big data processing techniques.
- Establish and maintain a robust MLOps framework to manage the lifecycle of AI models within a big data environment.
- Collaborate with data engineering teams to design and implement scalable data pipelines for real-time and batch processing of documents.
- Monitor and optimize the performance of AI systems, ensuring they meet the scalability and reliability requirements of big data applications.
- Apply best practices in data governance, ensuring compliance with data privacy and security standards when dealing with sensitive information.
- Engage with stakeholders to gather requirements and deliver AI solutions that provide actionable insights from document data at scale.
- Document and share knowledge on big data technologies and AI model development within the organization.
Qualifications:
- Bachelor’s or master’s degree in computer science, Artificial Intelligence, Machine Learning.
- Proven experience with AI, machine learning, image processing, NLP, and big data technologies.
- Proficiency in programming languages such as Python, Java, Scala, or similar.
- Strong understanding of big data ecosystems, including HDFS, Hive, and Spark.
- Experience with machine learning frameworks (e.g., TensorFlow, PyTorch) and computer vision libraries (e.g., OpenCV).
- Knowledge of OCR technologies and their application in big data contexts.
- Familiarity with MLOps and AI Ops practices, particularly in big data environments.
- Excellent analytical and problem-solving skills, with a track record of working on complex data-driven projects.
Preferred Skills:
- Experience with cloud-based big data services
- Proficiency with data warehousing solutions and SQL-based query engines (e.g., Hive, Presto).
- Knowledge of containerization (Docker), orchestration (Kubernetes), and microservices architecture.
- Experience with CI/CD tools (e.g., Jenkins, GitLab CI) and infrastructure as code (e.g., Terraform, Ansible).
- Understanding of data privacy regulations and their implications for big data processing.
Work Experience: Minimum 2yrs in AI/ML