Research Internship-Multimodal LLM - Speech/Music/Audio/Vision/Language 🎓

Description

Tencent AI Lab is dedicated to advancing cutting-edge AI technologies, with a particular focus on innovative breakthroughs in large foundation models. The lab's long-term ambition is to drive the development of Artificial General Intelligence (AGI), and ultimately, Artificial Superintelligence (ASI). We are seeking research interns who are interested in developing novel speech/music/audio/vision/language processing techniques and large multimodal models for our Seattle area office located at Bellevue WA for the year 2026.

Every research intern will work with researchers on a research project aimed at attacking one of the core problems by inventing cutting edge techniques. We encourage discussions and collaborations between researchers and interns. Interns are also encouraged to publish the results from the internship. Our projects span a wide range of areas, including developing more effective multimodal pretraining and post-training strategies for audio, speech, music, image, and video understanding and generation. We aim to enable fully duplex conversations, design more efficient large-model architectures, enhance multimodal memory and reasoning capabilities, and advance novel audio, speech, music, image, and video processing techniques—such as encoding, tokenization, and representation learning—with a focus on multimodal applications and end-to-end large models.

Details

Location
Bellevue, WA
Term
Summer 2026
Posted
1/22/2026

Other Internships at Tencent

See All →

Research Intern 🎓

Tencent

Bellevue, WASummer 2026
View internship details

Research Internship - Agent 🎓

Tencent

Bellevue, WASummer 2026
View internship details

Research Internship - Agent 🎓

Tencent

Bellevue, WASummer 2026
View internship details

Research Internship-Multimodal LLM - Speech/Music/Audio/Vision/Language 🎓

Tencent

Bellevue, WASummer 2026
View internship details