INTERNSHIP DETAILS

ZI-230 / Master Thesis Vision Language Action Models with Memory

CompanyBMW Group
LocationMunich
Work ModeOn Site
PostedMay 1, 2026
Internship Information
Core Responsibilities
The role involves exploring how memory in Vision Language Action Models (VLAs) can be utilized for reasoning and task execution in automated driving, based on related work in LLM/VLM/VLA memory usage. Responsibilities include exploring different memory approaches, evaluating learning and non-learning based methods, developing novel techniques, and deploying them in prototypes for real-world testing.
Internship Type
full time
Company Size
65671
Visa Sponsorship
No
Language
English
Working Hours
40 hours
Apply Now →

You'll be redirected to
the company's application page

About The Company
With its four brands, BMW, MINI, Rolls-Royce and BMW Motorrad, the BMW Group is the world’s leading premium manufacturer of automobiles and motorcycles and also provides premium financial services. The BMW Group production network comprises over 30 production sites worldwide; the company has a global sales network in more than 140 countries. In 2025, the BMW Group sold 2.46 million passenger vehicles and more than 202,500 motorcycles worldwide. The profit before tax in the financial year 2025 was € 10.2 billion on revenues amounting to € 133,5 billion. As of 31 December 2025, the BMW Group had a workforce of 154,540 employees. The economic success of the BMW Group has always been based on long-term thinking and respon-sible action. Sustainability is a key element of the BMW Group’s corporate strategy and covers all products – from the supply chain through production to the end of their useful life.
About the Role

THEORETISCH DIE BESTE ENTSCHEIDUNG. PRAKTISCH AUCH.

TEILE DEINE LEIDENSCHAFT.

Nur hochprofessionelle Abläufe in dynamischen Teams produzieren innovative Spitzentechnologie. Aber Fahrfreude wird bei uns von der Entwicklung bis zur Fertigung vor allem auch mit Spaß an der Arbeit und Begeisterung für das gemeinsame Projekt realisiert. Deshalb geben wir Studierenden bei uns nicht nur die Gelegenheit zum Zuhören, sondern auch zum Mitreden und Weiterdenken.

We, the BMW Group, offer you an interesting and varied master's thesis within the area of memory for VLAs in automated driving. Based on related work on memory usage in LLM/VLMs and VLAs, you will explore how these models can be utilized for reasoning and task execution.

 

What awaits you?

  • Close collaboration in a team within the BMW research department.     
  • Explore different memory in VLM and VLA approaches for automated driving applications.
  • Evaluate several learning and non-learning based memory approaches in automated driving tasks. 
  • Work on state-of-the-art techniques and develop novel approaches.
  • Deploy them in our prototypes for real world testing.  

 

Please note that your thesis must be supervised by a university on your part.  

 

What should you bring along? 

  • Knowledge in machine learning and computer vision.   
  • Knowledge in Vision Language Models(VLM) and/or Vision Language Action Models(VLA).   
  • Experience in memory for language models preferred.
  • Proficiency with Python and deep learning frameworks (PyTorch or TensorFlow).   

 

What do we offer?

  • Comprehensive mentoring & onboarding.
  • Personal & professional development.
  • Flexible working hours.
  • Digital offers & mobile working.
  • Attractive remuneration.
  • Apartment offers for students (subject to availability & only Munich).
  • And many other benefits - see bmw.jobs/benefits

 

You are enthused by new technologies and an innovative environment? Apply now!

 

At the BMW Group, we see diversity and inclusion in all its dimensions as a strength for our teams. Equal opportunities are a particular concern for us, and the equal treatment of applicants and employees is a fundamental principle of our corporate policy. That is why our recruiting decisions are also based on personality, experience and skills.

Find out more about diversity at the BMW Group at bmwgroup.jobs/diversity

Key Skills
Machine LearningComputer VisionVision Language ModelsVision Language Action ModelsMemory for Language ModelsPythonPyTorchTensorFlowReasoningTask Execution
Categories
Science & ResearchEngineeringSoftwareData & AnalyticsTechnology
Benefits
Comprehensive mentoringOnboardingPersonal developmentProfessional developmentFlexible working hoursDigital offersMobile workingAttractive remunerationApartment offers for students