Saliency-based modeling of acoustic scenes using sparse non-negative matrix factorization - Université Pierre et Marie Curie Accéder directement au contenu
Communication Dans Un Congrès Année : 2013

Saliency-based modeling of acoustic scenes using sparse non-negative matrix factorization

Résumé

The modelling of auditory scenes is a challenging task in Computational Auditory Scene Analysis. A method based on sparse Non-negative Matrix Factorization that can be used with no prior knowledge of the audio content to establish the similarity between scenes is proposed. The method is evaluated on a corpus of soundscapes of train stations issued from a perceptual study and results are compared with the human perception. The proposed method, by being able to focus on salient events within the scene, achieves better performances than a state-of-the-art Bag-of-Frames approach though not reaching the human performances.

Domaines

Son [cs.SD]
Fichier principal
Vignette du fichier
Cauchi13-SparseSliencyNMG.pdf (396.88 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-00940075 , version 1 (31-01-2014)

Identifiants

  • HAL Id : hal-00940075 , version 1

Citer

Benjamin Cauchi, Mathieu Lagrange, Nicolas Misdariis, Arshia Cont. Saliency-based modeling of acoustic scenes using sparse non-negative matrix factorization. Workshop on Image and Audio Analysis for Multimedia Interactive, Jul 2013, Paris, France. ⟨hal-00940075⟩
226 Consultations
269 Téléchargements

Partager

Gmail Facebook X LinkedIn More