39th IBIMA Computer Science Conference 30-31 May 2022

30-31 May 2022

Paper Submission: CLOSED

U.S.A. ISBN: 978-0-9998551-9-5
U.S.A. Library of Congress:
ISSN: 2767-9640

Overview of Deep Learning Voice Conversion Methods using Disentangling Speaker from Linguistic Content

Tomasz WALCZYNA and Zbigniew PIOTROWSKI

Abstract:

In voice conversion, the user identity is an attribute that characterizes the utterance we want to swap with the other person's identity while keeping the content of the utterance unchanged. Voice conversion algorithms incorporate various types of speech processing techniques such as utterance analysis, speaker classifiers, and vocoders. This paper presents an overview of state-of-the-art voice conversion methods leveraging disentangling speaker from linguistic content.