Articolo in rivista, 2023, ENG, 10.1007/s10844-023-00792-2
Lo Scudo, Fabrizio; Ritacco, Ettore; Caroprese, Luciano; Manco, Giuseppe
Univ Calabria; Univ Udine; Univ G dAnnunzio; CNR
In real-world applications, audio surveillance is often performed by large models that can detect many types of anomalies. However, typical approaches are based on centralized solutions characterized by significant issues related to privacy and data transport costs. In addition, the large size of these models prevented a shift to contexts with limited resources, such as edge devices computing. In this work we propose conv-SPAD, a method for convolutional SPectral audio-based Anomaly Detection that takes advantage of common tools for spectral analysis and a simple autoencoder to learn the underlying condition of normality of real scenarios. Using audio data collected from real scenarios and artificially corrupted with anomalous sound events, we test the ability of the proposed model to learn normal conditions and detect anomalous events. It shows performances in line with larger models, often outperforming them. Moreover, the model's small size makes it usable in contexts with limited resources, such as edge devices hardware.
Journal of intelligent information systems
Self-supervised learning, Audio-based anomaly detection, Edge-device computing, Pattern recognition
Manco Giuseppe, Ritacco Ettore
ID: 486218
Year: 2023
Type: Articolo in rivista
Creation: 2023-09-11 16:31:11.000
Last update: 2023-09-11 16:44:21.000
CNR authors
CNR institutes
External IDs
CNR OAI-PMH: oai:it.cnr:prodotti:486218
DOI: 10.1007/s10844-023-00792-2
ISI Web of Science (WOS): 001018064900001
Scopus: 2-s2.0-85163655115