Deep Neural Networks for Shimmer Approximation in Synthesized Audio Signal

García, Mario Alejandro; Destéfanis, Eduardo A.

Buscar material

Busque entre los 171119 recursos disponibles en el repositorio

Subir material

Suba sus trabajos a SEDICI, para mejorar notoriamente su visibilidad e impacto

Red de Universidades con Carreras en Informática (RedUNCI)
→
Eventos
→
CACIC
→
CACIC 2017

Deep Neural Networks for Shimmer Approximation in Synthesized Audio Signal

Autores: García, Mario Alejandro | Destéfanis, Eduardo A.

2017

Tipo de documento: Objeto de conferencia

Resumen

Shimmer is a classical acoustic measure of the amplitude perturbation of a signal. This kind of variation in the human voice allow to characterize some properties, not only of the voice itself, but of the person who speaks. During the last years deep learning techniques have become the state of the art for recognition tasks on the voice. In this work the relationship between shimmer and deep neural networks is analyzed. A deep learning model is created. It is able to approximate shimmer value of a simple synthesized audio signal (stationary and without formants) taking the spectrogram as input feature. It is concluded firstly, that for this kind of synthesized signal, a neural network like the one we proposed can approximate shimmer, and secondly, that the convolution layers can be designed in order to preserve the information of shimmer and transmit it to the following layers.

Notas

Eje: XVIII Workshop de Agentes y Sistemas Inteligentes (WASI).

Información general

Fecha de exposición: octubre 2017

Fecha de publicación: octubre 2017

Idioma del documento: Inglés

Evento: XXIII Congreso Argentino de Ciencias de la Computación (La Plata, 2017).

Institución de origen: Red de Universidades con Carreras en Informática (RedUNCI)

ISBN: 978-950-34-1539-9

Páginas: 43-52

Palabras claves: shimmer ; voice quality ; deep learning ; deep neural network ; convolutional neural network

Materias: Ciencias Informáticas

Descargar archivos

Documento completo
Descargar archivo (1.014Mb) - PDF

BASE

GoogleScholar

Creado el: 10 de noviembre de 2017

Disponible en SEDICI desde: 10 de noviembre de 2017

Por favor, utilice uno de estos identificadores(URI) para citar o enlazar este ítem:

http://sedici.unlp.edu.ar/handle/10915/63484

Mostrar el registro completo del ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)

CACIC → CACIC 2017

Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)

Excepto donde se diga explícitamente, este item se publica bajo la siguiente licencia Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0)

Iniciar sesión