Bangalore, Undisclosed, India
23 hours ago
AI Model Architect

Meet the Team


We are an innovation team on a mission to transform how enterprises harness AI. Operating with the agility of a startup and the focus of an incubator, we’re building a tight-knit group of AI and infrastructure experts driven by bold ideas and a shared goal: to rethink systems from the ground up and deliver breakthrough solutions that redefine what's possible — faster, leaner, and smarter.

We thrive in a fast-paced, experimentation-rich environment where new technologies aren’t just welcome — they’re expected. Here, you'll work side-by-side with seasoned engineers, architects, and thinkers to craft the kind of iconic products that can reshape industries and unlock entirely new models of operation for the enterprise.

If you're energized by the challenge of solving hard problems, love working at the edge of what's possible, and want to help shape the future of AI infrastructure — we'd love to meet you.

IMPACT 

Cisco is seeking a forward-thinking AI Model Architect to drive the development of AI model and dataset design supporting the next-generation AI infrastructure platform. This is a key role at the intersection of infrastructure and AI systems, where you'll drive the architecture and execution of AI and Generative AI to develop novel solutions that improve monitoring, deployment, and management of AI applications running at scale. Your work will directly impact how enterprises deploy, scale, and optimize AI workloads.

As an Architect, you will be responsible for both mentoring a high-caliber team as well as hands-on design to deliver robust AI models and workflows that learn from, recommend, and optimize the uptime, quality, and performance of customer infrastructure. You’ll also guide strategic direction on resource utilization in generative AI systems, working cross-functionally to align product direction with infrastructure capabilities.

 Key Responsibilities: 

Architect and design datasets from infrastructure and operational telemetry. Architect, select, and fine-tune/train AI and Generative AI model that can detect patterns on time series datasets.Fine-tune and train AI and Generative AI models that can interreact with tools and take actions.Understanding k8s and other infrastructure components and their usages.Build Model Context Protocol (MCP) tools to support Agentic WorkflowsDemonstrate a deep understanding of AI frameworks that support Nvidia and AMD GPUs.Plan and coordinate software engineering work, map tasks to releases, conduct code reviews, and resolve technical challenges to unblock releases.Generate architecture specifications and build proof-of-concept (POC) solutions for clarity when needed. Collaborate with product management to understand customer requirements and build architecturally sound solutions. Work closely with engineers on implementation and track progress to ensure alignment with architectural requirements.

Minimum Qualifications:

Demonstrable experience in following AI and Generative AI model training and fine-tuning. KV Cache management and context length impact in LLM inferencing. Python, PyTorch, TensorRT and other AI frameworks CUDA, Nsight and other nvidia tools. vLLM, LLM-D and other runtime for LLMs. Comprehensive understanding of software release processes Proficiency in using agents building pipelines.  Bachelor’s degree or equivalent with 10+ years of engineering experience.

 Preferred Qualifications: 

Demonstrable technical leadership through publish papers in industry conferences and publications, and issued patentsProven leadership experience in architecture and design of Retrieval Augmented Generation workflows.Demonstrable experience collecting and using system metrics in AI training/fine-tuning and inferenceMaster’s degree or equivalent.

#WeAreCisco 

#WeAreCisco where every individual brings their unique skills and perspectives together to pursue our purpose of powering an inclusive future for all.

Our passion is connection—we celebrate our employees’ diverse set of backgrounds and focus on unlocking potential. Cisconians often experience one company, many careers where learning and development are encouraged and supported at every stage. Our technology, tools, and culture pioneered hybrid work trends, allowing all to not only give their best, but be their best.

We understand our outstanding opportunity to bring communities together and at the heart of that is our people. One-third of Cisconians collaborate in our 30 employee resource organizations, called Inclusive Communities, to connect, foster belonging, learn to be informed allies, and make a difference. Dedicated paid time off to volunteer—80 hours each year—allows us to give back to causes we are passionate about, and nearly 86% do! 

Our purpose, driven by our people, is what makes us the worldwide leader in technology that powers the internet. Helping our customers reimagine their applications, secure their enterprise, transform their infrastructure, and meet their sustainability goals is what we do best. We ensure that every step we take is a step towards a more inclusive future for all. Take your next step and be you, with us!


Por favor confirme su dirección de correo electrónico: Send Email