SPS SLTC/AASP TC Webinar: End-to-End Automatic Speech Recognition

Top Reasons to Join SPS Today!

1. IEEE Signal Processing Magazine
2. Signal Processing Digital Library*
3. Inside Signal Processing Newsletter
4. SPS Resource Center
5. Career advancement & recognition
6. Discounts on conferences and publications
7. Professional networking
8. Communities for students, young professionals, and women
9. Volunteer opportunities
10. Coming soon! PDH/CEU credits
Click here to learn more.

webinar_general_dsi.jpg

May

10

Webinars

SPS SLTC/AASP TC Webinar: End-to-End Automatic Speech Recognition

Date: 10 May 2024
Time: 1:00 PM ET (New York Time)
Presenter(s): Dr. Jinyu Li

Abstract

The field of automatic speech recognition (ASR) is now dominated by the end-to-end (E2E) models that directly map speech to text. In this talk, the presenter will give an overview of the E2E ASR models and introduce the recent progress from an industry perspective. To design an E2E model that has high accuracy and low latency, a masking strategy was applied to Transformer Transducer. He will discuss technologies that can use text-only data for general model training through pretraining and adaptation to a new domain through augmentation and factorization. Our presenter will also discuss how to build multilingual ASR models to serve all the users. Then, he will extend E2E modeling for streaming multi-speaker ASR and finally ending the talk with some new research opportunities he can explore.

Biography

Melissa Handa Jinyu Li (M’08, SM’21) received the B.E. and M.E. degrees in electrical engineering and information system from University of Science and Technology of China, Hefei, China, in 1997 and 2000, respectively. He received the Ph.D. degree in electrical and computer engineering from Georgia Institute of Technology, Atlanta, GA, USA in 2008.

He currently serves as a Partner Applied Science Manager for Microsoft, Redmond, WA, USA since 2008 and leads a dynamic team dedicated to designing and enhancing speech modeling algorithms and technologies. Their aim is to ensure that Microsoft products maintain cutting-edge quality within the industry. From 2000 to 2003, he was a Researcher in the Intel China Research Center and Research Manager in iFlytek, China. His diverse research areas include end-to-end modeling for speech recognition and speech translation, deep learning, acoustic modeling, and noise robustness.

Dr. Li has been a member of IEEE Speech and Language Processing Technical Committee since 2017. He also served as the associate editor of IEEE/ACM Transactions on Audio, Speech and Language Processing from 2015 to 2020. He was awarded as the Industrial Distinguished Leader at Asia-Pacific Signal and Information Processing Association (APSIPA) in 2021 and APSIPA Sadaoki Furui Prize Paper Award in 2023.

Website Link:

Register

Tags:

SPS SLTC/AASP Webinar

SPS on Twitter

DEADLINE EXTENDED: The 2023 IEEE International Workshop on Machine Learning for Signal Processing is now accepting… https://t.co/NLH2u19a3y
ONE MONTH OUT! We are celebrating the inaugural SPS Day on 2 June, honoring the date the Society was established in… https://t.co/V6Z3wKGK1O
The new SPS Scholarship Program welcomes applications from students interested in pursuing signal processing educat… https://t.co/0aYPMDSWDj
CALL FOR PAPERS: The IEEE Journal of Selected Topics in Signal Processing is now seeking submissions for a Special… https://t.co/NPCGrSjQbh
Test your knowledge of signal processing history with our April trivia! Our 75th anniversary celebration continues:… https://t.co/4xal7voFER

IEEE SPS Educational Resources

IEEE SPS Resource Center

IEEE SPS YouTube Channel

© Copyright 2024 IEEE – All rights reserved. Use of this website signifies your agreement to the IEEE Terms and Conditions.
A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.

abstract_general_5.jpg

Learning From the Hidden Letters

congrats_celebrate_general.jpg

An Exciting Juncture for Signal Processing Research: On Building Bridges, Challenges, and Opportunities

newsletter_general.jpg

Statistical Principles of Time Reversal

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

SPS SLTC/AASP TC Webinar: End-to-End Automatic Speech Recognition

Conferences & Events

Top Reasons to Join SPS Today!

webinar_general_dsi.jpg

May

10

SPS SLTC/AASP TC Webinar: End-to-End Automatic Speech Recognition

Abstract

Biography

Event Types

Events

ISBI_2025.jpg

Farhan_Baqai.jpg

Farhan_Baqai.jpg

world_general.jpg

icip_2024.jpg

ISBI_2024.jpg

SPS on Twitter

IEEE SPS Educational Resources

Learning From the Hidden Letters

An Exciting Juncture for Signal Processing Research: On Building Bridges, Challenges, and Opportunities

Statistical Principles of Time Reversal

What is Signal Processing?

Popular Pages

Today's:

All time:

Last viewed:

SPS SLTC/AASP TC Webinar: End-to-End Automatic Speech Recognition

Search form

You are here

Conferences & Events

Top Reasons to Join SPS Today!

May

10

Abstract

Biography

Event Types

Events

SPS on Twitter

IEEE SPS Educational Resources