Are you passionate about driving innovation in deep learning and eager to work on cutting-edge AI technology? Join NVIDIA's TensorRT team as a Software Engineer, and be at the forefront of technology, contributing to high-performance AI inference solutions for specialized platforms and applications. Your fresh perspective and technical skills will help shape the performance and functionality of our products, ensuring NVIDIA remains synonymous with innovation. If you're ready to tackle challenging projects, push the boundaries of AI performance, and make a significant impact in a company that values creativity, excellence, and teamwork, we want to hear from you!
What you'll be doing:
Contribute to the design and development of high-performance deep learning inference software using modern C++
Collaborate with teams across the hardware and software stack to understand and leverage new technologies to improve TensorRT's functionality and performance
Participate in the development of robust, high-quality C++ code in alignment with Modern C++ standards
Support systematic reasoning about test plans from unit to integration level
Assist in documenting the properties of functions, classes, and systems to improve robustness
Contribute to performance optimization and benchmarking efforts
Help develop new features and capabilities for TensorRT to serve specialized customer needs
What we need to see:
Masters, or PhD in relevant fields (Computer Engineering, Computer Science, Electrical Engineering, AI) or equivalent experience
Strong foundational C++ skills, including familiarity with C++11 and C++14 or newer standards
Familiarity with the C++ Standard Template Library (STL)
Familiarity with modern deep learning models and inference frameworks
Interest in performance optimization and systems programming
Demonstrated ability to take initiative and see projects through to completion
Excellent interpersonal skills and a collaborative, pragmatic approach to solving problems
Ways to stand out from the crowd:
Experience with Python and/or CUDA through coursework, internships, or personal projects
Exposure to systems programming, embedded systems, and/or compiler concepts
Experience in software performance analysis, profiling, or optimization techniques
Knowledge of C++17 or later standards
Understanding of computer architecture, memory management, or parallel computing concepts
NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative, autonomous, and love a challenge, come join our team and help us build the future of high-performance AI inference technology!
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 120,000 USD - 189,750 USD.You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until August 30, 2025.NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.