Multiple Locations, MEX
1 day ago
Senior Applied Scientist
Outlook is one of the most widely used communication and productivity tools in the world, and Copilot is transforming how millions of users interact with it through the power of AI. As we continue evolving Outlook into a truly intelligent, context-aware assistant, we’re seeking a **Senior Applied Scientist** to join a team of exceptional experts at the forefront of this transformation. In this role, you will play a critical part in advancing Outlook’s Copilot initiatives in areas such as Large Language Models (LLMs), Prompt Engineering, Fine-tuning, Evaluation, Relevance, and Responsible AI (RAI). This multifaceted position involves developing end-to-end infrastructure and measurement frameworks, fostering cross-functional collaboration, and leveraging data science and AI expertise to guide strategic decisions. The successful candidate will work closely with multiple organizations and stakeholders to drive the evaluation and optimization of our LLM systems and related components. Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. **Responsibilities** **Strategic Technical Leadership** Develop and execute a comprehensive strategy for LLM evaluation, covering quality, cost, model performance, utility (user experience and prompt effectiveness), and Responsible AI considerations. Ensure alignment with company-wide initiatives and leverage emerging research. Drive cost-effective solutions such as Small Language Models (SLMs) or fine-tuned models. **Data Science Expertise** Apply strong data science skills to design experiments, analyze data, define OKRs, and establish measurement frameworks. Derive actionable insights to improve LLM systems and influence product direction and user experience based on evaluation outcomes. **Model and Prompt Evaluation** Lead efforts to assess and enhance the performance and effectiveness of language models and prompts. Drive iterative improvements, including the creation of synthetic and curated datasets. **Program Management** Oversee large-scale, cross-functional evaluation programs, ensuring alignment with organizational goals and timelines. Develop and maintain robust measurement frameworks to track and report LLM performance and user impact. Partner with engineering teams to build automated evaluation pipelines integrated into product workflows. **User Experience Enhancement** Collaborate with UX teams to evaluate and optimize user interactions with AI systems, improving satisfaction and usability. **Responsible AI (RAI)** Implement Responsible AI and DSB principles to ensure ethical, unbiased practices in model development and deployment. **Research Contributions** Form partnerships and lead deep research initiatives in LLM evaluation and user experience optimization. Contribute to the scientific community and strengthen the team’s understanding of user mental models and alignment with LLM-powered experiences. **Cross-Functional Collaboration** Work closely with engineering, research, and product teams to seamlessly integrate evaluation processes into the development lifecycle. **Qualifications** **Required Qualifications:** + Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years of related experience (e.g., statistics predictive analytics, research) + OR Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years of related experience (e.g., statistics, predictive analytics, research) + OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 1+ year(s) of related experience (e.g., statistics, predictive analytics, research) + OR equivalent experience. + Extensive experience in data science, machine learning, experimentation, and AI, with a strong track record of delivering impactful results. + Expertise of LLM in finetuning, reinforcement learning, evaluation techniques, implementing RAG techniques, agentic workflows  and industry best practices. + Proven expertise in program management and leading cross-functional teams. + Proficient development experience in Python. + Proficient understanding of responsible AI principles. **Preferred Qualifications:** + Ability to work in a fast-paced and dynamic environment.  + Excellent analytical and problem-solving skills. + Exceptional communication and presentation abilities. **Other Requirements:** Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: + **Microsoft Cloud Background Check:** This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter. Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations (https://careers.microsoft.com/v2/global/en/accessibility.html) .
Por favor confirme su dirección de correo electrónico: Send Email