INTERNSHIP DETAILS
2026 Summer Intern - Regev Lab - Bayesian Optimization with LLMs
CompanyGenentech
LocationDaly City
Work ModeOn Site
PostedFebruary 4, 2026

Internship Information
Core Responsibilities
The internship involves establishing formal performance guarantees and convergence properties for a new methodology integrating probabilistic search with LLM suggestions. The methodology will be validated on high throughput functional genomics data to develop new methods for automated discovery.
Internship Type
full time
Company Size
18078
Visa Sponsorship
No
Language
English
Working Hours
40 hours
Apply Now →
You'll be redirected to
the company's application page
About The Company
About Genentech
We're passionate about finding solutions for people facing the world's most difficult-to-treat conditions. That is why we use cutting-edge science to create and deliver innovative medicines around the globe. To us, science is personal.
Making a difference in the lives of millions starts when you make a change in yours. If you’d like to join our team, view our openings at gene.com/careers.
Our patient resource center is dedicated to getting patients and caregivers to the right resources. You can reach them at 1 (877) GENENTECH (436-3683)
Monday-Friday, 6am-5pm PST or patientinfo@gene.com.
Community Guidelines:
1. We want to foster positive conversation around the issues we are passionate about. To that end, we remove profanity, content that contains threatening language, content that is aimed at private individuals, personal information, and repeated unwanted messages.
2. Don’t mention any medicines by name — ours or anyone else’s.
Because of the fair balance rules governing our industry, we cannot post any comments that reference any pharmaceutical brand, product, or service. Please do not mention any specific medicines by name, or include any links to third party sites in your comments.
3. This isn’t the place to report or discuss side effects.
This site is not intended as a forum for reporting side effects experienced while taking a Genentech product. Instead, you should report any side effects to Genentech Drug Safety at 1-888-835-2555. You can also report side effects of any prescription product directly to the FDA at 1-800-FDA-1088 or by visiting www.FDA.gov/medwatch.
4. Don’t pitch your product or service.
Please don't use our page as a place to promote your product or pitch your services. Please also avoid posting links to external sites. We reserve the right to remove any posts that are deemed promotional.
About the Role
<h3>The Position</h3><p><u><b>2026 Summer Intern - Regev Lab - Bayesian Optimization with LLMs</b></u></p><p></p><p><b><b>Department Summary</b></b></p><p></p><p><span>Many real world optimization problems, such as the design of experiments in biological or chemical domains, the tuning of hyperparameters in machine learning systems, and the allocation of resources under uncertainty, are both expensive and high dimensional. Traditional algorithms for such black box or bandit optimization rely primarily on carefully chosen surrogate models, including Gaussian Processes, random forests, or Bayesian neural networks, to guide the search. While these methods provide a foundation for uncertainty quantification, they often struggle to incorporate the vast qualitative insights or latent domain knowledge available through modern generative models like LLMs. </span></p><p></p><p><span>The project aims to develop a unified framework that integrates probabilistic search with LLM suggestions, maintaining control over the optimization landscape while leveraging external information cues. Research will address the fundamental challenge of balancing data driven discovery with potentially noisy or heuristic insights through a principled synthesis of robust surrogates and agentic reasoning.</span></p><p></p><p><span>This internship position is located in</span><b><b> </b></b><b>South San Francisco, on-site. </b></p><p></p><p><b><b>The Opportunity</b></b></p><p></p><p><span>A central component of the internship involves establishing formal performance guarantees and convergence properties for the proposed methodology. By demonstrating that the framework maintains reliable behavior even when incorporating non-traditional suggestions, the project ensures the method scales effectively as experimental data accumulates. The methodology will be validated on high throughput functional genomics data where efficient search is critical due to the scale and cost of physical experiments. By establishing a robust loop that integrates generative insights with experimental results, the research aims to develop new methods for automated discovery suitable for submission to machine learning or computational biology venues.</span></p><p></p><p><b><b>Program Highlights</b></b></p><ul><li><p><b><b>Intensive </b><b>12-weeks,</b><b> full-time (40 hours per week) paid internship.</b></b></p></li><li><p><b><b>Program start dates are in</b><b> May/June 2026. </b></b></p></li><li><p><b><b>A stipend, based on location, will be provided to help alleviate costs associated with the internship. </b></b></p></li><li><p><span>Ownership of challenging and impactful business-critical projects.</span></p></li><li><p><span>Work with some of the most talented people in the biotechnology industry.</span></p></li></ul><p></p><p><b><b>Who You Are </b></b></p><p><b><b>Required Education:</b></b></p><ul><li><p>Must be pursuing a Master's Degree (enrolled student).</p></li><li><p>Must be pursuing a PhD (enrolled student).</p></li></ul><p></p><p><b><b>Required Majors: </b></b>Computer Science, Electrical Engineering, Machine Learning, Artificial Intelligence, Computational Biology, or a closely related field.</p><p></p><p><b><b>Required Skills: </b></b></p><ul><li><p><span>Advanced Python, ML Frameworks, and Experimental Management: Proficiency in Python and experience managing high-dimensional datasets and computationally intensive optimization loops using cluster or HPC job schedulers (e.g., SLURM) and multi-GPU environments; experience with optimization libraries is preferred (e.g., PyTorch, GPyTorch, BoTorch).</span></p></li><li><p><span>Bayesian Optimization and Probabilistic Modeling: A strong background in Gaussian Processes, acquisition functions, and the theoretical underpinnings of Bayesian optimization, including familiarity with regret-based analysis and convergence proofs.</span></p></li><li><p><span>LLM Integration and Agentic Workflows: Experience implementing and evaluating LLM-based agents, specifically for structured knowledge retrieval or hypothesis generation in scientific domains.</span></p></li><li><p><span>Computational Biology and Genomic Data (Optional): Data handling skills relevant to high-throughput screens, including the analysis of single-cell RNA-seq (Perturb-seq) or related transcriptomic data to evaluate model performance.</span></p></li><li><p><span>Research Rigor and Documentation: Proven ability to read and implement complex mathematical and biological research papers, design reproducible experimental pipelines, and document theoretical derivations clearly.</span></p></li><li><p><span>Scientific Communication: Strong skills in synthesizing complex results at the intersection of machine learning and biology for presentation to cross-functional research teams.</span></p></li></ul><p></p><p><b><b>Preferred Knowledge, Skills, and Qualifications</b></b></p><ul><li><p><span>Excellent communication, collaboration, and interpersonal skills.</span></p></li><li><p><span>Complements our culture and the standards that guide our daily behavior & decisions: Integrity, Courage, and Passion.</span></p></li></ul><p></p><p><b><b>Relocation benefits are not available for this job posting. </b></b></p><p></p><p><span>The expected salary range for this position based on the primary location of California<span> </span>is </span><span>$50.00 hourly. Actual pay will be determined based on experience, qualifications, geographic location, and other job-related factors permitted by law. This position also qualifies for paid holiday time off benefits.</span></p><p style="text-align:inherit"></p><p style="text-align:left"><span>Genentech is an equal opportunity employer. It is our policy and practice to employ, promote, and otherwise treat any and all employees and applicants on the basis of merit, qualifications, and competence. The company's policy prohibits unlawful discrimination, including but not limited to, discrimination on the basis of Protected Veteran status, individuals with disabilities status, and consistent with all federal, state, or local laws.</span></p><p style="text-align:inherit"></p><p style="text-align:left"><span>If you have a disability and need an accommodation in relation to the online application process, please contact us by completing this form <a target="_blank" href="https://docs.google.com/forms/d/e/1FAIpQLSdZWlsbfQOvFVIQgHE_iDzWUTlhZvj6FytIzjS7xq6IGh1H5g/viewform">Accommodations for Applicants</a>.</span></p><p style="text-align:inherit"></p><p style="text-align:inherit"></p>
Key Skills
Advanced PythonML FrameworksExperimental ManagementBayesian OptimizationProbabilistic ModelingLLM IntegrationAgentic WorkflowsComputational BiologyGenomic DataResearch RigorDocumentationScientific Communication
Categories
Science & ResearchTechnologyEngineeringData & AnalyticsHealthcare
Benefits
Paid Holiday Time Off
Prep Tools
FREE
YOUR RESUME KNOWS THE QUESTIONS
AI Question Predictor
Based on 2026 Summer Intern - Regev Lab - Bayesian Optimization with LLMs role
Tell me about your experience with Advanced Python
Why do you want to work at Genentech?
Describe a challenging project you've led
FREEYour ScoreTop Applicants
BOOST YOUR INTERVIEW CHANCES
?
»
8.5
Must-Have Skills for This Role
Advanced PythonML FrameworksExperimental ManagementBayesian OptimizationProbabilistic Modeling
FREE
STUCK ON A QUESTION? PRACTICE IT
Practice Any Question
Get instant AI feedback
"How would you design a scalable system for Genentech's use case?"
Record your answer & get scored