INTERNSHIP DETAILS
2026 Summer Intern - Large Language Models (Prescient Design / AI for Drug Discovery)
CompanyGenentech
LocationNew York
Work ModeOn Site
PostedDecember 27, 2025

Internship Information
Core Responsibilities
The intern will design and implement data transformation and modeling pipelines for large language models in drug discovery. They will collaborate with researchers and engineers to evaluate and analyze scientific datasets.
Internship Type
full time
Company Size
18119
Visa Sponsorship
No
Language
English
Working Hours
40 hours
Apply Now →
You'll be redirected to
the company's application page
About The Company
About Genentech
We're passionate about finding solutions for people facing the world's most difficult-to-treat conditions. That is why we use cutting-edge science to create and deliver innovative medicines around the globe. To us, science is personal.
Making a difference in the lives of millions starts when you make a change in yours. If you’d like to join our team, view our openings at gene.com/careers.
Our patient resource center is dedicated to getting patients and caregivers to the right resources. You can reach them at 1 (877) GENENTECH (436-3683)
Monday-Friday, 6am-5pm PST or patientinfo@gene.com.
Community Guidelines:
1. We want to foster positive conversation around the issues we are passionate about. To that end, we remove profanity, content that contains threatening language, content that is aimed at private individuals, personal information, and repeated unwanted messages.
2. Don’t mention any medicines by name — ours or anyone else’s.
Because of the fair balance rules governing our industry, we cannot post any comments that reference any pharmaceutical brand, product, or service. Please do not mention any specific medicines by name, or include any links to third party sites in your comments.
3. This isn’t the place to report or discuss side effects.
This site is not intended as a forum for reporting side effects experienced while taking a Genentech product. Instead, you should report any side effects to Genentech Drug Safety at 1-888-835-2555. You can also report side effects of any prescription product directly to the FDA at 1-800-FDA-1088 or by visiting www.FDA.gov/medwatch.
4. Don’t pitch your product or service.
Please don't use our page as a place to promote your product or pitch your services. Please also avoid posting links to external sites. We reserve the right to remove any posts that are deemed promotional.
About the Role
<h3>The Position</h3><p><b><b>2026 Summer Intern - Large Language Models (Prescient Design / AI for Drug Discovery)</b></b></p><p><br /><b><b>Department Summary</b></b><br /><br /><span><span>At Roche's AI for Drug Discovery (AIDD) group (formerly Prescient Design), we are revolutionizing drug discovery with cutting-edge machine learning techniques. We are seeking talented researchers and engineers with a passion for building machine learning systems that transform how scientific data is represented, modeled, and evaluated.</span></span></p><p></p><p><span><span>AIDD’s Foundation Model team is seeking a Machine Learning Research Intern to work on data interfaces between structured biochemical measurements and large language models, supporting next-generation foundation models for drug discovery as part of our broader Lab-in-the-Loop approach.</span></span></p><p><span><span>The intern will collaborate closely with researchers and engineers to design, implement, and evaluate data transformation and modeling pipelines, gaining hands-on experience with real-world scientific datasets and foundation-model workflows. This role is well suited for candidates who enjoy careful technical reasoning, experimentation, and building reusable components that sit at the intersection of machine learning and scientific data.</span></span></p><p></p><p><span><span>The group provides a dynamic and challenging environment for multidisciplinary research, including access to heterogeneous data sources, close links to top academic institutions around the world, as well as collaborations with internal Genentech and Roche teams.</span></span></p><p></p><p><b><span><span>This internship position is located in New York City, NY, On-Site.</span></span></b></p><p></p><p><span><b><span>The Opportunity</span></b></span></p><ul><li><p>Work on data and evaluation components that support large language models for scientific discovery and drug development.</p></li><li><p>Help define and implement interfaces between structured scientific data and natural-language model inputs and outputs.</p></li><li><p>Participate in the evaluation of LLM behavior, including robustness, calibration, and consistency across tasks and datasets.</p></li><li><p>Design and run experiments to study how data representation and preprocessing choices influence model performance.</p></li><li><p>Contribute production-quality code, documentation, and tests to shared internal libraries.</p></li></ul><p></p><p><span><b><span>Program Highlights</span></b></span></p><ul><li><p><b><span><span>Intensive 12-weeks, full-time (40 hours per week) paid internship.</span></span></b></p></li><li><p><b><span><span>Program start dates are in May/June</span></span></b></p></li><li><p><b><span><span>A stipend, based on location, will be provided to help alleviate costs associated with the internship. </span></span></b></p></li><li><p><span><span>Ownership of challenging and impactful business-critical projects.</span></span></p></li><li><p><span><span>Work with some of the most talented people in the biotechnology industry.</span></span></p></li></ul><p></p><h1><span><b><span>Who You Are</span></b></span></h1><p></p><p><span><b><span>Required Education</span></b></span></p><ul><li><p><span><span>Must be pursuing a Master's Degree (enrolled student).</span></span></p></li><li><p><span><span>Must be pursuing a PhD (enrolled student).</span></span></p></li></ul><p></p><p><span><b><span>Required Majors </span></b><br /><span>Computer Science, Machine Learning, Data Science, Bioinformatics or Computational Biology, Statistics, Applied Mathematics, Physics, or a related quantitative field</span></span></p><p></p><p><span><b><span>Required Skills: </span></b></span></p><ul><li><p><span><span>Strong programming skills, particularly in Python, with experience writing clean and maintainable code.</span></span></p></li><li><p><span><span>Solid understanding of machine learning or NLP fundamentals, including model training and evaluation concepts.</span></span></p></li><li><p><span><span>Experience working with structured scientific or technical data (e.g., tables, fields, or schemas) in the context of data analysis or modeling.</span></span></p></li><li><p><span><span>Ability to reason carefully about experimental results and communicate technical ideas clearly.</span></span></p></li></ul><p></p><p><span><b><span>Preferred Knowledge, Skills, and Qualifications</span></b></span></p><ul><li><p><span><span>Excellent communication, collaboration, and interpersonal skills.</span></span></p></li><li><p><span><span>Complements our culture and the standards that guide our daily behavior & decisions: Integrity, Courage, and Passion.</span></span></p></li><li><p><span><span>Familiarity with biological or biochemical data (e.g., proteins, antibodies, or assays).</span></span></p></li></ul><p></p><p><b>Relocation benefits are not available for this job posting. </b></p><p></p><p>The expected salary range for this position based on the primary location of the city of New York is $50.00 per hour. Actual pay will be determined based on experience, qualifications, geographic location, and other job-related factors permitted by law. This position also qualifies for paid holiday time off benefits.</p><p style="text-align:inherit"></p><p style="text-align:left"><span>Genentech is an equal opportunity employer. It is our policy and practice to employ, promote, and otherwise treat any and all employees and applicants on the basis of merit, qualifications, and competence. The company's policy prohibits unlawful discrimination, including but not limited to, discrimination on the basis of Protected Veteran status, individuals with disabilities status, and consistent with all federal, state, or local laws.</span></p><p style="text-align:inherit"></p><p style="text-align:left"><span>If you have a disability and need an accommodation in relation to the online application process, please contact us by completing this form <a target="_blank" href="https://docs.google.com/forms/d/e/1FAIpQLSdZWlsbfQOvFVIQgHE_iDzWUTlhZvj6FytIzjS7xq6IGh1H5g/viewform">Accommodations for Applicants</a>.</span></p><p style="text-align:inherit"></p><p style="text-align:inherit"></p>
Key Skills
Machine LearningNLPData AnalysisProgrammingPythonBiochemical DataTechnical CommunicationExperimental Design
Categories
Science & ResearchTechnologyHealthcareData & AnalyticsEngineering
Benefits
Paid Holiday Time Off
Prep Tools
FREE
ACE YOUR INTERVIEW IN REAL-TIME
Silent AI Co-Pilot
Real-time interview help
Listening...
"Why Genentech?"
💡 Mention their Biotechnology Research and your passion for Machine Learning
FREE
YOUR PERSONALIZED PREP ROADMAP
0-2 2026 Summer Intern - Large Language Models (Prescient Design / AI for Drug Discovery)
Interview Prep Plan
1
Week 1:Technical Foundations2
Week 2:Machine Learning3
Week 3:System DesignFREE
YOUR RESUME KNOWS THE QUESTIONS
AI Question Predictor
Based on 2026 Summer Intern - Large Language Models (Prescient Design / AI for Drug Discovery) role
Tell me about your experience with Machine Learning
Why do you want to work at Genentech?
Describe a challenging project you've led