Job Title:
AI- LLMOps EngineerJob Description
We're Concentrix. The intelligent transformation partner. Solution-focused. Tech-powered. Intelligence-fueled.The global technology and services leader that powers the world’s best brands, today and into the future. We’re solution-focused, tech-powered, intelligence-fueled. With unique data and insights, deep industry expertise, and advanced technology solutions, we’re the intelligent transformation partner that powers a world that works, helping companies become refreshingly simple to work, interact, and transact with. We shape new game-changing careers in over 70 countries, attracting the best talent.
The Concentrix Catalyst team is the driving force behind Concentrix’s transformation, data, and technology services. We integrate world-class digital engineering, creativity, and a deep understanding of human behavior to find and unlock value through tech-powered and intelligence-fueled experiences. We combine human-centered design, powerful data, and strong tech to accelerate transformation at scale. You will be surrounded by the best in the world providing market leading technology and insights to modernize and simplify the customer experience. Within our professional services team, you will deliver strategic consulting, design, advisory services, market research, and contact center analytics that deliver insights to improve outcomes and value for our clients. Hence achieving our vision.
Our game-changers around the world have devoted their careers to ensuring every relationship is exceptional. And we’re proud to be recognized with awards such as \"World's Best Workplaces,\" “Best Companies for Career Growth,” and “Best Company Culture,” year after year.
Join us and be part of this journey towards greater opportunities and brighter futures.
Position Overview
We are seeking a skilled LLMOps Engineer with expertise in operationalizing Generative AI solutions to join our AI Engineering Center of Excellence. This role will focus on establishing robust infrastructure, deployment pipelines, and monitoring systems to ensure the reliable, secure, and scalable delivery of LLM-based applications in production environments. The LLMOps Engineer will work closely with AI Tech Leads and Senior Engineers to bridge the gap between development and production deployment of GenAI solutions.
Primary Responsibilities
Design and implement infrastructure and deployment pipelines for large language model (LLM) applications in production environmentsEstablish monitoring, observability, and logging systems for GenAI applications to ensure performance, reliability, and data qualityDevelop automated testing frameworks specific to LLM applications, including evaluation of model outputs and prompt effectivenessImplement version control systems for models, prompts, and configurations to ensure reproducibility and traceabilityCreate and maintain CI/CD pipelines for seamless deployment of GenAI solutionsOptimize infrastructure and implementations for cost efficiency, considering compute resources and API usageImplement security controls and compliance measures specific to GenAI applicationsCollaborate with development teams to establish best practices for transitioning GenAI solutions from prototype to productionAutomate feedback loops for continuous improvement of deployed modelsDocument operational procedures, architecture decisions, and maintenance protocolsRequired Qualifications
5+ years of experience in DevOps, platform engineering, or related roles with at least 2+ years focused on ML/AI systemsHands-on experience with cloud infrastructure and services for AI workloads (AWS, Azure, GCP)Strong programming skills in languages commonly used for infrastructure and automation (Bash, YAML)Experience with containerization and orchestration technologies (Docker, Kubernetes) for AI workloadsKnowledge of LLM deployment patterns and associated infrastructure requirementsFamiliarity with monitoring tools and techniques for AI systems (e.g., model performance, drift detection, cost tracking)Understanding of CI/CD principles and experience implementing automated pipelinesExperience with infrastructure-as-code tools (Terraform, CloudFormation, etc.)Basic understanding of LLM architectures and their operational requirementsBachelor's degree in Computer Science, Engineering, or related technical fielddPreferred Skills
Experience deploying and managing production LLM applications at scaleKnowledge of vector database operations and optimization for RAG implementationsFamiliarity with API gateway management and rate limiting strategiesExperience with distributed tracing and debugging complex AI systemsUnderstanding of data privacy, security, and compliance considerations for GenAI applicationsKnowledge of cost optimization techniques for LLM inference and embedding generationExperience with feature flagging and A/B testing frameworks for AI applicationsFamiliarity with LLM evaluation metrics and automated testing approachesExperience with GPU resource management and optimizationSuccess Factors
Strong technical curiosity and willingness to explore new GenAI capabilitiesBalance between operational excellence and enabling rapid innovationStrong problem-solving skills for troubleshooting complex production issuesEffective communication across technical and non-technical stakeholdersProactive approach to identifying and mitigating operational risksAbility to translate business requirements into operational specificationsCommitment to continuous improvement of operational processesAdaptability to rapidly evolving GenAI technologies and deployment patternsLocation:
IND Work-at-HomeLanguage Requirements:
Time Type:
Full timeIf you are a California resident, by submitting your information, you acknowledge that you have read and have access to the Job Applicant Privacy Notice for California Residents