
Hao-Wen (Herman) Dong
University of Michigan · Performing Arts Technology
Generative AI for Music: Challenges and Opportunities
Generative AI has been transforming the way we interact with technology and consume content. The recent successes of LLM-based chatbots, AI assistants, text-to-image, text-to-video, and text-to-music systems have showcased how AI can augment human creativity and boost human productivity. In the next decade, generative AI technology will also reshape how we create music in the music, film, TV, podcast, and gaming industries across the entertainment, commercial, and education sectors. In the first half of this talk, I will introduce some of our recent work on the various applications of generative AI in music creation, including multitrack music generation, automatic instrumentation, and violin performance synthesis. In the second half, I will discuss the unique challenges of applying, scaling, and deploying generative AI music models in practice. Finally, I will discuss research opportunities towards controllable and interactable generative AI systems for music.
Hao-Wen (Herman) Dong is an Assistant Professor in the Performing Arts Technology Department at the University of Michigan. Herman's research aims to empower music and audio creation with machine learning. His long-term goal is to lower the barrier of entry for music composition and democratize audio content creation. He is broadly interested in music generation, audio synthesis, multimodal machine learning, and music information retrieval. Herman received his PhD in Computer Science from UC San Diego, where he worked with Julian McAuley and Taylor Berg-Kirkpatrick. His research has been recognized by the UCSD CSE Doctoral Award for Excellence in Research, KAUST Rising Stars in AI, UChicago and UCSD Rising Stars in Data Science, ICASSP Rising Stars in Signal Processing, and UCSD GPSA Interdisciplinary Research Award.





