How often are new jobs added?

New internships are added continuously throughout the day. We monitor company career pages and hiring tools in real time, so most roles appear on InternshipsHQ shortly after they’re posted.

Why can I only view a limited number per day on the free plan?

The free plan is designed to let you explore the platform and see how fresh the listings are. Limits help us prioritize serious applicants and keep the signal high. Pro removes these limits and gives you full access.

What sources do you pull internships from?

We source internships directly from company career pages, startup hiring platforms, and applicant tracking systems like Greenhouse, Lever, Ashby, Workable, and others. This helps us surface roles earlier and avoid repost-heavy job boards.

Are these internships legitimate?

Yes. We actively filter out expired, duplicate, misleading, and low-quality listings. Each role is checked for legitimacy and posting freshness so you’re not wasting time on dead or fake opportunities.

Can I filter by location, experience level, remote, or salary?

Yes. You can filter internships by role, location, experience level, remote or hybrid status, and pay when available. Pro users get access to more advanced and saved filters.

Why are early alerts so important?

For internships, timing matters more than volume. Most callbacks happen when roles are still new and applicant pools are small. Early alerts help you apply before listings get flooded.

Do you show salary or pay information?

When companies include pay information, we display it clearly. Not all internships list salary upfront, but we prioritize transparency whenever the data is available.

Is InternshipsHQ suitable for senior or experienced roles?

InternshipsHQ is primarily built for internships, entry-level, and early-career roles. If you’re looking for senior or leadership positions, you may find limited results here.

INTERNSHIP DETAILS

Research Intern, Agent RL Training

CompanyNewsBreak

LocationMountain View

Work ModeOn Site

PostedMay 27, 2026

Internship Information

Core Responsibilities

The intern will collaborate with a mentor to apply LLMs to core business functions like content understanding and autonomous task completion. Responsibilities include running end-to-end SFT experiments, designing rewards for RL, and curating high-quality training datasets.

Internship Type

full time

Salary Range

$35 - $50

Company Size

333

Visa Sponsorship

Language

English

Working Hours

40 hours

Apply Now →

You'll be redirected to
the company's application page

About The Company

NewsBreak is the leading platform for local news and information, with more than 40 million users across America. By using new technology, NewsBreak provides community-focused news and information from over 10,000 sources in a timely and accessible way. NewsBreak is bridging the gap between new technology and traditional local media, offering an innovative digital solution that allows users to get the information they need to live safer, more vibrant, and connected lives. Based in Mountain View, California, NewsBreak connects users with local information, national publishers, and targeted advertising from local businesses, with increased traffic and revenue that helps strengthen local communities. We are always looking for great talents. Contact us via careers@newsbreak.com Our website: https://www.newsbreak.com/about Creator Program: https://www.newsbreak.com/creators Publishing Platform: https://mp.newsbreakapp.com Android App Download: https://play.google.com/store/apps/details?id=com.particlenews.newsbreak&hl=en iOS App Download: https://itunes.apple.com/us/app/news-break-personal-local/id1132762804?mt=8

About the Role

About NewsBreak

Founded in 2015, NewsBreak is the Content Intelligence platform shaping the future content economy. With over 40 million monthly active users, our flagship platform delivers highly personalized local news and information powered by advanced AI, recommendation systems, and adtech.

Recognized by Fast Company as #32 on the Top Workplaces for Innovators, we're proud to be Great Place to Work® certified and home to a dynamic team of technologists, product innovators, and business leaders who are passionate about solving meaningful challenges at scale.

Together, we reached unicorn status in 2021, and we remain committed to continuing this high-growth trajectory with the right team to fulfill our mission: building the infrastructure layer for content intelligence.

If you’re inspired to dream big, innovate fast, and make a difference, we’d love to hear from you! For more information, visit www.newsbreak.com/about

About the Role

We are looking for a Research Intern to join our Agent RL Training team. You will be paired with a full-time employee as your mentor, working together to explore, from zero to one, how to apply large language models to NewsBreak’s core business, including content understanding, recommendation, agentic web browsing, and autonomous multi-step task completion.

This is a hands-on research role. You are expected to independently drive experiments, propose novel ideas, and iterate quickly. We value self-starters with deep intellectual curiosity and the drive to push boundaries in LLM post-training and agent capabilities.

Location: Onsite in Mountain View, CA office

What You’ll Work On

Collaborate with your full-time mentor to identify high-impact research directions for applying LLMs to NewsBreak’s products
Independently run end-to-end SFT experiments on LLM-based agents, and assist with RL-related exploration such as reward design and training iteration
Curate and build high-quality training datasets: instruction-following, preference pairs, agent trajectories, and synthetic data
Contribute to public publications; we encourage and support top-venue submissions during your internship

What We’re Looking For

Requirements

Highly motivated and committed: willing to put in extra hours when needed to push projects across the finish line
Genuine passion for research: you read papers for fun, tinker with models on weekends, and care deeply about advancing the field
Independently capable of end-to-end model SFT: with basic understanding of RL-based post-training methods (RLHF, DPO, PPO, GRPO, etc.)
Excellent taste in model behavior: able to reason about what “good” looks like across user-facing domains and articulate why
Strong Python and PyTorch skills

Preferred Qualifications

Publication at a top-tier venue (NeurIPS, ICML, ICLR, ACL, EMNLP, or equivalent)
Experience with multi-node distributed training (FSDP, DeepSpeed, Megatron-LM)
Proficiency in writing custom GPU kernels with Triton or CUDA
Experience building synthetic data pipelines for agent training
Familiarity with open-source RL frameworks: TRL, OpenRLHF, veRL/vLLM

Hourly Pay: $35- $50

The US base salary range for this full-time position is listed below. Pay may vary based on a number of factors including job-related skills, level, experience, geographic location and relevant education or training. At NewsBreak, we design our overall rewards package to attract top talents. Depending on the position, the role may also be eligible for discretionary bonus and options. Your recruiter can share more details during the hiring process.

Annual Base Pay Range

$35—$50 USD

CPRA Privacy Notice for California Candidates

Key Skills

PythonPyTorchSFTRLHFDPOPPOGRPODistributed TrainingFSDPDeepSpeedMegatron-LMTritonCUDASynthetic Data PipelinesTRLOpenRLHF