1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.
10 years of news and resources for members of the IEEE Signal Processing Society
Dr. Dong Yu is an IEEE Fellow, ISCA Fellow, and an ACM distinguished scientist. He is currently taking the role of a distinguished scientist and vice general manager at Tencent AI Lab. Prior to joining Tencent America in 2017, he was a principal researcher at the speech and dialog research group, Microsoft Research (Redmond), Microsoft, where he joined in 1998. He holds a Ph.D. degree in computer science from University of Idaho, an MS degree in computer science from Indiana University at Bloomington, an MS degree (with honor) in pattern recognition and intelligent control from Chinese Academy of Sciences, and a BS degree (with honor) in electrical engineering from Zhejiang University.
I am currently taking the role of distinguished scientist and vice general manager at Tencent AI Lab. Prior to joining Tencent America in 2017, I was a principal researcher at the speech and dialog research group, Microsoft Research (Redmond), Microsoft, where I joined in 1998.
My research has been focusing on speech processing and recognition, multi-modal interactive systems, and recently natural language processing. My works have been recognized by the prestigious IEEE Signal Processing Society 2013, 2016, and 2020 best paper awards. I was honored to be elevated an IEEE Fellow and ISCA Fellow in recognition of my contributions in deep-learning-based speech recognition and processing.
I have served the IEEE community over the years as the associate editors, conference organizers, local section chairs, and technical committee members. Currently I am serving as the chair of the speech and language processing technical committee and the technical program co-chair of ICASSP 2021.
There were many challenges along the way. The biggest challenge is the existence of a long period of time when the field of speech recognition advanced really slowly despite significant efforts devoted to the field by many researchers and practitioners. There were pessimistic forecasts and the entice that we may make bigger impact or get better financial return if we choose to work on different problems. Fortunately challenges also come with opportunities. We finally made breakthrough.
There are several important factors. For example, I was lucky to collaborate with many talented researchers and interns in my career. The supportive research environment at Microsoft Research and Tencent AI Lab allowed us to attack hard problems and to pursue bigger impact even though this may mean slow or no progress within some period of time. My research methodology also contributed to my success. I have been trying very hard to innovate through better understanding of the problems and tools, and through better balance between exploitation and exploration.
My work significantly improved the recognition accuracy of automatic speech recognition systems and caused a paradigm shift in both academia and industry. Nowadays many users feel that the ASR systems’ accuracy surpassed the threshold for adoption. This improvement significantly enhanced their experiences in using applications such as dictation and digital assistants.
Many innovations happen when you break the constraints, which may be imposed by your way of thinking, the angle you look at things, the available tools and devices, or the existing architecture.
Failures are not only inevitable but also part of our normal research life. Over the years, we have tried many innovative techniques and majority of them did not lead to the progresses we have hoped. However, our final success was built upon those failures as we learn no less, if not more, from failures than from successes.
It is very common for researchers, scientist or engineers to be skeptical of new ideas, esp. if the idea is truly novel or against their existing belief and philosophy. This is very healthy for scientific advancement. Our work on deep-learning-based speech recognition experienced the similar debate. We gradually convinced others by demonstrating the superiority of our new technique on challenging benchmarks, by helping others to reproduce our results, and by continuously improving the techniques. I think as long as the results and evidences are true and firm, the novel techniques or paradigms will be accepted and further investigated by the community sooner or later.
© Copyright 2021 IEEE – All rights reserved. Use of this website signifies your agreement to the IEEE Terms and Conditions.
A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.