Python Librosa - Search News

edgar-montano/audio-analysis

A comprehensive Python tool for audio analysis and feature extraction, designed for ML/AI audio processing workflows. Built with librosa and supporting multiple output formats, this tool provides ...

IEEE

A Multimodal Deep Learning Framework for Depression Detection Using Vision Transformers and Large Language Models

Abstract: This study proposes a novel multimodal deep learning framework for depression detection, integrating visual, audio, and textual data. Using OpenFace and Librosa for feature extraction, the ...

IEEE

Securevision-Pro: A Real-Time AI Surveillance Suite for Violence, Hazard, and Safety Detection with Audio-Visual Intelligence

Abstract: SecureVision-Pro is a multimodal surveillance system that integrates visual intelligence using YOLOv11 then acoustic analytics, implemented using Librosa, to detect violence, fire, smoke, ...

GitHub

markjosims/librispeech-asr-errors

Word error rate (WER) is the standard metric of evaluation for Automatic Speech Recognition (ASR) models. WER can be understood as the ratio of the number of edits ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results