Xuchen Song currently serves as a Tech Lead and Staff Research Engineer at TikTok, where he spearheads the Multimodal Team in pioneering advancements in large-scale multimodal learning and innovative applications of Large Language Models (LLMs). His role is pivotal in shaping the future of content...
Xuchen Song currently serves as a Tech Lead and Staff Research Engineer at TikTok, where he spearheads the Multimodal Team in pioneering advancements in large-scale multimodal learning and innovative applications of Large Language Models (LLMs). His role is pivotal in shaping the future of content recommendation and music generation on the platform, leveraging his extensive expertise in machine learning, image processing, and speech processing. Under his leadership, the team is focused on developing cutting-edge algorithms that enhance user engagement through personalized content delivery, utilizing multimodal data to create a seamless and immersive experience.
One of Xuchen’s key projects involves the integration of multimodal recommendation systems that analyze user interactions across various content types, including video, audio, and text. This initiative not only improves the accuracy of content suggestions but also enriches the overall user experience by tailoring recommendations to individual preferences. Additionally, Xuchen is at the forefront of end-to-end music generation, employing advanced techniques in Natural Language Processing (NLP) and Natural Language Understanding (NLU) to create dynamic soundscapes that resonate with TikTok’s diverse user base.
With a strong foundation in programming languages such as C++, and frameworks like Keras, PyTorch, and TensorFlow, Xuchen is adept at translating complex theoretical concepts into practical applications. His work is characterized by a commitment to innovation and excellence, making significant contributions to the evolving landscape of social media technology. As TikTok continues to redefine digital interaction, Xuchen Song's leadership and vision are instrumental in driving the platform's success in the realm of multimodal content creation and recommendation.