INTERNSHIP DETAILS

Software Engineering Intern, Kernel Optimization (Serbia)

CompanyTenstorrent University Jobs
LocationBelgrade
Work ModeOn Site
PostedApril 1, 2026
Internship Information
Core Responsibilities
Contribute to the design, development, and maintenance of software components connected to Tenstorrent hardware, and implement and optimize kernels for convolution-based operations. Collaborate with machine learning engineers to integrate optimized kernels into machine learning frameworks.
Internship Type
full time
Company Size
1123
Visa Sponsorship
No
Language
English
Working Hours
40 hours
Apply Now →

You'll be redirected to
the company's application page

About The Company
Tenstorrent is a next-generation computing company that builds computers for AI. Headquartered in the U.S. with offices in Austin, Texas, and Silicon Valley, and global offices in Toronto, Belgrade, Seoul, Tokyo, and Bangalore, Tenstorrent brings together experts in the field of computer architecture, ASIC design, RISC-V technology, advanced systems, and neural network compilers. Tenstorrent is backed by Eclipse Ventures and Real Ventures, Archerman Capital, Samsung Catalyst Fund, and Hyundai Motor Group among others. Join us: www.tenstorrent.com/careers.
About the Role

Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.

At Tenstorrent, we believe the future of computing must be open, which is why our interns don’t just watch from the sidelines - they help build the core of it. We provide a "code-to-career" pipeline where students collaborate with industry experts to solve high-stakes problems in RISC-V and AI hardware-software co-design. By joining us, you are taking an internship to democratize high-performance computers that are accessible to everyone.

During your internship, you will partner with highly skilled engineers, experience many approaches to problem‑solving, and work on a carefully defined engineering project in relation to convolutions and their optimizations for Tenstorrent hardware, guided by a dedicated mentor. We are looking for a minimum of 3 months for this role with the potential for extension to 6 months.

This role is onsite, based in our Belgrade office.

 

Who You Are

  • Enrolled in the final year of BSc or MSc studies in Computer Science, Computer Engineering, Software Engineering, Electronics, Math, or a related field.
  • Comfortable with one or more programming languages, including C++ and Python.
  • You bring a solid foundation in algorithms and data structures and basic familiarity with machine learning frameworks and concepts.
  • You are curious about kernel development, low‑level and tensor optimizations, and performance profiling, and want to deepen your skills in these areas.

 

What We Need

  • Contribute to the design, development, and maintenance of Tenstorrent software components connected to our hardware platform, and implement and optimize kernels for convolution‑based operations.
  • Analyze and optimize low‑level code to improve performance and efficiency with a strong emphasis on tensor optimization, and run performance profiling to identify and address bottlenecks.
  • Collaborate with machine learning engineers and data scientists to integrate optimized kernels and low‑level routines into machine learning frameworks and pipelines.
  • Write unit tests, debug kernel‑level code for stability and reliability, and present your project approach, accomplishments, and findings to the team at the end of your internship.

 

What You Will Learn

  • How to take a well‑scoped engineering project related to convolutions and their optimizations on Tenstorrent hardware from idea to implementation with guidance from a dedicated mentor.
  • Hands‑on experience building, optimizing, and profiling kernels for convolution‑based operations and tuning low‑level code for performance.
  • Practical skills integrating optimized kernels and low‑level routines into machine learning frameworks and pipelines, and hardening them through testing and debugging.
  • How to communicate technical work clearly by sharing your observations, approach, and results with the broader team.

 

Hiring Timelines

This internship opportunity is available throughout our 3 terms with the following corresponding recruitment cycles:

  • Winter Term: Mar–May work term, Nov–Jan recruit.
  • Summer Term: Aug–Oct work term, Jan–May recruit.
  • Fall Term: Oct–Dec work term, Apr–May recruit.

Please note these timelines are for reference only. Actual timelines may vary.

 

Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.




This offer of employment is contingent upon the applicant being eligible to access U.S. export-controlled technology. Due to U.S. export laws, including those codified in the U.S. Export Administration Regulations (EAR), the Company is required to ensure compliance with these laws when transferring technology to nationals of certain countries (such as EAR Country Groups D:1, E1, and E2). These requirements apply to persons located in the U.S. and all countries outside the U.S. As the position offered will have direct and/or indirect access to information, systems, or technologies subject to these laws, the offer may be contingent upon your citizenship/permanent residency status or ability to obtain prior license approval from the U.S. Commerce Department or applicable federal agency. If employment is not possible due to U.S. export laws, any offer of employment will be rescinded.

Key Skills
C++PythonAlgorithmsData StructuresMachine LearningKernel DevelopmentTensor OptimizationPerformance Profiling
Categories
TechnologyEngineeringSoftware