paper

All-for-One and One-For-All: Deep learning-based feature fusion for Synthetic Speech Detection

Daniele Mari, Davide Salvi, Paolo Bestagini, Simone Milani

All-for-One and One-For-All: Deep learning-based feature fusion for Synthetic Speech Detection

Name: All-for-One and One-For-All: Deep learning-based feature fusion for Synthetic Speech Detection
Author: Daniele Mari, Davide Salvi, Paolo Bestagini, Simone Milani

Daniele Mari, Davide Salvi, Paolo Bestagini, Simone Milani

paper2023-07-28English

Start Reading

deep learning portfolioarxiv

Description

Recent advances in deep learning and computer vision have made the synthesis and counterfeiting of multimedia content more accessible than ever, leading to possible threats and dangers from malicious users. In the audio field, we are witnessing the growth of speech deepfake generation techniques, which solicit the development of synthetic speech detection algorithms to counter possible mischievous uses such as frauds or identity thefts. In this paper, we consider three different feature sets pro...