Software Engineer Intern
Applied Machine Learning, ML System
Posted on 9/11/2025

ByteDance
No salary listed
San Jose, CA, USA
In Person
AML-MLsys combines system engineering and the art of machine learning to develop and maintain massively distributed ML training and Inference system/services around the world, providing high-performance, highly reliable, scalable systems for LLM/AIGC/AGI.
In our team, you'll have the opportunity to build the large-scale heterogeneous system integrating with GPU/NPU/RDMA/Storage and keep it running stable and reliable, enrich your expertise in coding, performance analysis and distributed system, and be involved in the decision-making process. You'll also be part of a global team with members from the United States, China and Singapore working collaboratively towards unified project direction.
We are looking for talented individuals to join us for an internship in 2026. Internships at ByteDance aim to offer students industry exposure and hands-on experience. Watch your ambitions become reality as your inspiration brings infinite opportunities at ByteDance.
Internships at ByteDance aim to provide students with hands-on experience in developing fundamental skills and exploring potential career paths. A vibrant blend of social events and enriching development workshops will be available for you to explore. Here, you will utilize your knowledge in real-world scenarios while laying a strong foundation for personal and professional growth. It runs for 12 weeks.
Candidates can apply to a maximum of two positions and will be considered for jobs in the order you apply. The application limit is applicable to ByteDance and its affiliates' jobs globally. Applications will be reviewed on a rolling basis. We encourage you to apply as early as possible. Please state your availability clearly in your resume (Start date, End date).
Summer Start Dates:
- May 11th, 2026
- May 18th, 2026
- May 26th, 2026
- June 8th, 2026
- June 22nd, 2026
Candidates who pass resume screening will be invited to participate in ByteDance's technical online assessment.
Responsibilities:
1. Participating in online architecture design and optimization centered around LLM inference tasks, achieving high concurrency and throughput in large-scale online systems.
2. Participating in the establishment of a comprehensive system covering stability, disaster recovery, R&D efficiency, and cost, enhancing overall system stability.
3. Participating in the design and implementation of end-to-end online pipeline systems with multiple models, plugins, and storage-computation components, enabling agile, flexible, and observable continuous delivery.
4. Collaborating closely with the MLE for optimization of algorithms and systems.
5. Being proactive, optimistic, highly responsible, and demonstrating meticulous work ethic, as well as possessing strong team communication and collaboration skills.
Minimum Qualifications:
1. Currently pursuing an Undergraduate/Master in Computer Science or a related technical discipline.
2. Excellent coding skills, strong understanding of data structures, and fundamental knowledge of algorithms. Proficiency in programming languages such as C/C++, Java, Go, Python, etc.
3. Rich experience in online architecture, with the ability to troubleshoot independently.
4. Strong sense of responsibility, good learning ability, communication skills, and self-motivation.
5. Must be able to commit to a 12-week full-time work period during Summer or Fall 2026.
Preferred Qualifications:
1. Understanding of GPU hardware architecture, familiarity with GPU software stack (CUDA, cuDNN), and experience in GPU performance analysis.
2. Knowledge of LLM models, experience in accelerating LLM model optimization is preferred

Internship Search Guides
How to Find an InternshipInternship SalariesInternship DeadlinesMock Interview Prep