CUDA Math Libraries Intern
2026
Posted on 11/3/2025

NVIDIA
No salary listed
Aachen, Germany + 1 more
More locations: Munich, Germany
Hybrid
null
NVIDIA is looking for software engineering interns for our teams which develop accelerated mathematical and data processing libraries like cuSparse, cuBLAS, cuSOLVER, cuFFT, nvImgCodec, nvComp, nvmath-python or DALI, that are a key part of high-performance computing and deep learning software stacks. The main purpose of these libraries is to provide the fastest primitives for linear algebra (dense and sparse), quantum computing and computer vision. Does the idea of being at the heart of these projects and applying your knowledge to make an impact around world sound exciting? If yes, then come and join our team!
In this role you will be part of our team responsible for the development of libraries that provide groundbreaking functionality and performance. The internship may include extending the capabilities of existing as well as building new libraries that will be used in various AI and HPC applications. It will involve working with senior software engineers who will provide mentorship and guidance. The project will include implementing new algorithms, defining APIs, analyzing performance, finding appropriate solutions for difficult numerical corner cases, and other general software engineering work.
What you’ll be doing:
Collaborate with team members to understand software use cases and requirements
Analyze the performance of GPU or CPU implementations and find opportunities for improvements
Prototype and develop algorithms for single node and multi GPU clusters
What we need to see:
Studying towards a MS or PhD degree in Computational Science, Computer Science, Applied Mathematics, Engineering, or a related field.
Programming skills (C/C++, Python)
Parallel or GPU programming experience (AVX, NEON, OpenMP, MPI, SHMEM, CUDA or OpenCL)
Ways to stand out from the crowd:
Exposure to floating-point arithmetic and numerical error analysis.
Knowledge of GPU/CPU and network hardware architecture.
Understanding of composability and fusions, compilers, and implementation of programming languages
Experience implementing sparse or dense linear algebra algorithms.
Experience with domain-specific language design and compiler optimizations, in particular sparse compilers (MLIR or TACO)
NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative, autonomous, and love a challenge, we want to hear from you!
#LI-Hybrid
Internships by Season
Summer InternshipsFall InternshipsWinter & Spring InternshipsCo-op InternshipsLatest InternshipsInternship Search Guides
How to Find an InternshipInternship SalariesInternship DeadlinesMock Interview Prep