Blue Ash, OH, 45242, USA
13 days ago
ML Ops Engineer
Job Description We are looking for a highly skilled ML Ops Engineer to join our team. The ML Ops Engineer will be responsible for developing, deploying, and maintaining machine learning models and infrastructure. This role requires collaboration with various teams, including data science, engineering, and operations, to support and enhance our machine learning capabilities. Abilities/Skill and Other Requirements Exceptional Technical Skills are assumed Key Responsibilities Develop, deploy, and maintain machine learning models ensuring their reliability, performance, and scalability. Develop, deploy, and maintain machine learning tools. Automate ML workflows Monitor model performance and troubleshoot issues to ensure high availability and performance. Collaborate with data science, engineering, and operations teams to support and enhance the machine learning infrastructure. Implement and maintain security best practices for ML systems. Develop and maintain documentation for ML workflows, procedures, and processes. Manage infrastructure risk, develop mitigation plans, and escalate decisions and unresolved issues daily. Work with peers to develop and drive goals, define technical specifications, and detailed implementation plans for ML projects Effectively apply skills to impact ML infrastructure decisions. Focus on the benefits to be realized and the outcomes to be achieved. We are a company committed to creating inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity employer that believes everyone matters. Qualified candidates will receive consideration for employment opportunities without regard to race, religion, sex, age, marital status, national origin, sexual orientation, citizenship status, disability, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to Human Resources Request Form (https://airtable.com/app21VjYyxLDIX0ez/shrOg4IQS1J6dRiMo) . The EEOC "Know Your Rights" Poster is available here (https://www.eeoc.gov/sites/default/files/2023-06/22-088\_EEOC\_KnowYourRights6.12ScreenRdr.pdf) . To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/ . Skills and Requirements Experience with ML Ops/AI tools and frameworks such as: Jupyter, Nvidia Global Catalog (NGC), MLFlow, and RunAI (a few of these) Experience with Docker, Slurm, Python, Conda (a few of these) Experience with cloud platforms such as Azure, or Google Cloud Kubernetes PyTorch/torchrun/TorchX null We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal employment opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment without regard to race, color, ethnicity, religion,sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military oruniformed service member status, or any other status or characteristic protected by applicable laws, regulations, andordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request to HR@insightglobal.com.
Por favor confirme su dirección de correo electrónico: Send Email