INTERNSHIP DETAILS

Data Engineering Intern

CompanyRefinedScience
LocationRemote
Work ModeRemote
PostedJanuary 26, 2026
Internship Information
Core Responsibilities
Assist in building and maintaining data pipelines for various types of data. Collaborate with teams to support analytics and machine learning workflows.
Internship Type
full time
Company Size
32
Visa Sponsorship
No
Language
English
Working Hours
40 hours
Apply Now →

You'll be redirected to
the company's application page

About The Company
At RefinedScience, we seamlessly integrate top-tier clinical and biological data with expert knowledge to provide unparalleled insights. We maximize patient impact with these unique insights by optimizing clinical trial probability of success and time to actionable results. We work across biopharma and we are a trusted partner in achieving better results, faster – working together to unlock strategic advantage.
About the Role
<p><strong>Data Engineering Intern</strong></p> <p>At RefinedScience, our mission is to advance care by bringing together the best science, data and minds – disease by disease, patient by patient, cell by cell to discover pathways to life beyond disease.&nbsp; &nbsp;</p> <p>WHAT WE ARE LOOKING FOR</p> <p>We are seeking a motivated Data Engineering Intern to join our team. This internship is open to undergraduate and graduate students who are interested in building data infrastructure that supports advanced analytics, data science, and AI-driven insights in healthcare and life sciences.</p> <p>You will work closely with data scientists, bioinformaticians, and engineers to help design, build, and improve data pipelines and platforms that power RefinedScience’s research and analytics initiatives.</p> <p>KEY ACTIVITIES</p> <ul> <li>Assist in building and maintaining data pipelines for ingesting, transforming, and validating clinical, biological, and real-world data</li> <li>Support integration of data from multiple sources (e.g., clinical data, analytics outputs, external datasets)</li> <li>Help develop and optimize ETL/ELT workflows to ensure data quality and reliability</li> <li>Collaborate with data science and bioinformatics teams to support analytics and machine learning workflows</li> <li>Contribute to data modeling, documentation, and best practices for data infrastructure</li> <li>Participate in code reviews, testing, and performance improvements</li> <li>Participate in Quality Reviews and Troubleshooting</li> <li>Communicate progress and findings to cross-functional teams</li> </ul> <p>MUST HAVES</p> <ul> <li>Currently enrolled in a Bachelor’s, Master’s, or Ph.D.<strong> </strong>program in Data Engineering, Computer Science, Data Science, Software Engineering, or a related field</li> <li>Experience with Python and/or SQL through coursework, projects, or internships</li> <li>Basic understanding of data pipelines, databases, and data transformation concepts</li> <li>Familiarity with version control (e.g., Git)</li> <li>Strong analytical thinking and problem-solving skills</li> <li>Ability to learn quickly and work collaboratively in a team environment</li> </ul> <p>NICE TO HAVE</p> <ul> <li>Exposure to cloud platforms (AWS, GCP, or Azure)</li> <li>Familiarity with data tools such as Airflow, dbt, Spark, or similar frameworks</li> <li>Experience working with large or complex datasets</li> <li>Interest in healthcare, life sciences, or applied AI</li> </ul> <p><strong>Duration:</strong>&nbsp; 8 – 10 Weeks</p> <p>WHY YOU’LL LOVE REFINED SCIENCE&nbsp;</p> <p><strong>Team + Values</strong></p> <p>At RefinedScience, we seamlessly integrate top-tier clinical and biological data with expert knowledge to provide unparalleled insights.&nbsp; We maximize patient impact with these unique insights by optimizing clinical trial probability of success and time to actionable results. We work across biopharma and we are a trusted partner in achieving better results, faster – working together to unlock strategic advantage.</p> <p><strong>Our Values</strong></p> <ul> <li>Act with Purpose – We believe in rigor through deliberate and thoughtful actions</li> <li>Be Curious – Curiosity is the spark that ignites innovation and growth</li> <li>Take Ownership – True ownership leads to pride and commitment in the work we do</li> <li>Invest in Relationships – Building strong connections is the foundation for effective collaboration and trust for long term success</li> <li>Embrace Agility – We celebrate agile thinking, resilience, and adaptability</li> </ul> <p>&nbsp;</p> <p>&nbsp;</p>
Key Skills
PythonSQLData PipelinesDatabasesData TransformationVersion ControlAnalytical ThinkingProblem-SolvingCloud PlatformsData ToolsETLELTMachine LearningData ModelingDocumentationCollaboration
Categories
TechnologyHealthcareData & AnalyticsScience & ResearchEngineering