INTERNSHIP DETAILS

Research Intern - Post-Training

CompanyMicrosoft
LocationRedmond
Work ModeOn Site
PostedDecember 16, 2025
Internship Information
Core Responsibilities
The intern will design and evaluate datasets, contribute to model training, and develop data infrastructure. They will also assess data quality and collaborate with researchers and engineers.
Internship Type
full time
Salary Range
$5,610 - $11,010
Company Size
226149
Visa Sponsorship
No
Language
English
Working Hours
40 hours
Apply Now →

You'll be redirected to
the company's application page

About The Company
Every company has a mission. What's ours? To empower every person and every organization to achieve more. We believe technology can and should be a force for good and that meaningful innovation contributes to a brighter world in the future and today. Our culture doesn’t just encourage curiosity; it embraces it. Each day we make progress together by showing up as our authentic selves. We show up with a learn-it-all mentality. We show up cheering on others, knowing their success doesn't diminish our own. We show up every day open to learning our own biases, changing our behavior, and inviting in differences. Because impact matters. Microsoft operates in 190 countries and is made up of approximately 228,000 passionate employees worldwide.
About the Role
Overview

Come build community, explore your passions and do your best work at Microsoft with thousands of university interns from every corner of the world. This opportunity will allow you to bring your aspirations, talent, potential – and excitement for the journey ahead.  

The Microsoft Human Superintelligence Post-Training team advances post-training methods for both OpenAI and open-source models. We work on continual pre-training, large-scale deep RL on large GPU fleets, data curation/synthesis at scale, and practical fine-tuning for research and product. We also build language + multimodal technologies used across Microsoft, with a special focus on code-centric models for GitHub Copilot and Visual Studio Code (completion and SWE agent models). Our work connects to efforts such as LoRA, DeBERTa, Oscar, Rho-1, Florence, and the open-source Phi family. 


We prize research innovation and bold experimentation—aiming for breakthroughs that materially advance the state of the art and ship into products. 


As a Research Intern at Microsoft, you’re stepping into a world of real impact from day one. You’ll collaborate with global teams on meaningful projects, explore cutting-edge technologies like AI, and kick start your career while doing it. With a strong focus on learning and development, this is your opportunity to grow your skills, build community, and shape your future—all while being supported every step of the way.  

 

Microsoft AI Human Superintelligence team’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond. This role is part of Microsoft AI's Superintelligence Team. The team is a startup-like team inside Microsoft AI, created to push the boundaries of AI toward Humanist Superintelligence—ultra-capable systems that remain controllable, safety-aligned, and anchored to human values. Our mission is to create AI that amplifies human potential while ensuring humanity remains firmly in control. We aim to deliver breakthroughs that benefit society—advancing science, education, and global well-being. We’re also fortunate to partner with incredible product teams giving our models the chance to reach billions of users and create immense positive impact. If you’re a brilliant, highly-ambitious and low ego individual, you’ll fit right in—come and join us as we work on our next generation of models! 

 

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate and empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.  



Responsibilities
  • Design & evaluate datasets: build high-quality datasets/benchmarks; run ablations to measure impact and improve data effectiveness 
  • Advance model training: contribute to pre-training, post-training, and RL for language and multimodal models 
  • Develop data infrastructure: extend pipelines for ingest, preprocess, filter, and annotate large, heterogeneous data 
  • Data quality & analysis: assess text, image, video, audio, and code data for quality, diversity, and relevance; propose improvements 
  • Tooling & workflows: create lightweight tools for dataset auditing, visualization, and versioning to speed iteration 
  • Research & collaboration: work with researchers/engineers to push research and product boundaries with measurable impact 


Qualifications

Required Qualifications:

  • Currently enrolled in a BS/MS/PhD program in computer science, AI/ML, data science, electrical engineering, or a related field 
  • Must have at least one additional quarter/semester of school remaining following the completion of the internship.
  • Candidate must be enrolled in a full time bachelor's, masters, MBA, or PhD program in area relevant for the role during the academic term immediately before their internship. 
  • Effective coding skills in Python and modern data/ML libraries (NumPy, Pandas, PyTorch/JAX/TF)
  • Familiarity with training/evaluating ML models and with basic data-pipeline concepts

Preferred Qualifications:

  • First-author publication(s) at top-tier AI venues (e.g., NeurIPS, ICML, ICLR, CVPR) or equivalent journals; or demonstrably comparable research impact (e.g., widely used open-source, SOTA results, benchmark wins) 
  • Experience with distributed data or training frameworks (Spark, Ray, Beam; PyTorch DDP/FSDP) and cloud ecosystems (Azure; data lakes) 
  • Exposure to large-scale, un/semi-structured datasets (images, video, audio, code) 
  • Prior work on LLMs, RL/RLHF, post-training, or multimodal models 
  • Contributions to open-source tooling or reproducible research 
  • Clear communication, self-motivated, curiosity, and a bias for hands-on experimentation 

The base pay range for this internship is USD $5610.00 - $11010.00 per month. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $7270.00 - $12030.00 per month.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-intern-pay 


This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.




Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

Key Skills
PythonData ScienceMachine LearningDeep LearningData AnalysisData CurationData InfrastructureModel TrainingResearchCollaborationDataset DesignToolingVisualizationCommunicationSelf-MotivationCuriosity
Categories
TechnologyScience & ResearchData & AnalyticsSoftwareEngineering