Research Scientist Intern

Vision-Language and Embodied AI

Posted on 11/6/2025

Meta

Meta

Compensation Overview

$3.68 - $5.83/hr

Redmond, WA, USA

In Person

California residents/workers may have additional location information via the provided link.

Reality Labs Research is seeking a Research Scientist Intern to help develop the next generation of assistance systems that guide users in contextual and adaptive environments. We welcome candidates with expertise in embodied AI, reinforcement learning, planning, multimodal learning, vision-language models, LLM interpretability, world model learning, and pose estimation (including hand and object pose). Our internships are twelve (12) to twenty-four (24) weeks long, with various start dates throughout the year.
Research Scientist Intern, Vision-Language and Embodied AI (PhD) Responsibilities
  • Plan and execute cutting-edge research on embodied AI algorithms, assistance policies, vision-language models, and world model learning for complex, real-world interaction tasks.
  • Develop, implement, and evaluate methods for improving the performance and interpretability of VLMs and related AI/ML models.
  • Leverage state-of-the-art simulators, RL/DRL, neuro-symbolic, AI planning, robotics, stochastic programming, and multimodal learning methods.
  • Write modular, reusable research code and utilize Meta’s large infrastructure to scale experimentation.
  • Collaborate cross-functionally with researchers and engineers to prototype and test models at scale.
  • Deliver clear, compelling, and creative solutions to challenging problems.
  • Work should result in publishable research in top-tier journals or conferences (e.g., NeurIPS, ICLR, CVPR, ECCV, ICML, ICCV, AAAI, IJCAI, ICRA, IEEE T-PAMI, IJCV, IEEE RA-L etc.).
Minimum Qualifications
  • Currently has, or is in the process of obtaining, a PhD in Machine Learning, Artificial Intelligence, Computer Vision, Robotics, Speech Processing, Applied Statistics, Computational Neuroscience, Algorithms, Computational Mathematics, or a related field
  • Proven research skills: problem definition, solution exploration, analysis, and presentation of results
  • 2+ years of experience in Python and machine learning libraries (Numpy, Scikit-Learn, Scipy, Pandas, Matplotlib, Tensorflow, Pytorch)
  • Understanding of at least one of the following: embodied AI, reinforcement learning, planning, transfer/few-shot/zero-shot/continual/online learning, self-supervised learning, multi/cross-modal learning, vision-language models, LLM interpretability, world model learning, hand pose estimation, or object pose estimation
  • Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment
Preferred Qualifications
  • Proven track record of significant results: grants, fellowships, patents, and first-authored publications at leading workshops or conferences (e.g., NeurIPS, ICLR, CVPR, ECCV, ICML, ICCV, AAAI, IJCAI, ICRA, IEEE T-PAMI, IJCV, IEEE RA-L etc.)
  • Experience with VLM/LLM training/fine-tuning and solving traditional CV problems (e.g., hand/body pose estimation, object pose estimation, image classification/segmentation, image/video understanding, 3D scene reconstruction)
  • Experience working and communicating cross-functionally in a team environment
  • Intent to return to the degree program after the completion of the internship/co-op
  • Availability for minimum 16 consecutive week internship
For those who live in or expect to work from California if hired for this position, please click here for additional information.
About Meta
Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today—beyond the constraints of screens, the limits of distance, and even the rules of physics.

$7,650/month to $12,134/month + benefits

Individual compensation is determined by skills, qualifications, experience, and location. Compensation details listed in this posting reflect the base hourly rate, monthly rate, or annual salary only, and do not include bonus, equity or sales incentives, if applicable. In addition to base compensation, Meta offers benefits. Learn more about benefits at Meta.


Equal Employment Opportunity
Meta is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics. You may view our Equal Employment Opportunity notice here.
Meta is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a disability, fill out the Accommodations request form.