Sentimental analysis using audio and video has become an essential technique for understanding public opinion on online platforms. Unlike text-based methods, audio and video provide natural expressions such as tone, pitch, facial reactions, and behavioral cues, which help in identifying the true emotional state of a person. This work focuses on a dual-modality sentiment analysis system where audio is processed using speech-to-text conversion and linguistic feature extraction, while video frames are analyzed to detect facial expressions and behavioral cues. The results from both modalities are combined to produce a more accurate sentiment classification. This approach improves reliability, reduces noise-based errors, and provides a more realistic sentiment outcome for social media reviews and real-time interactions.
IRE Journals:
Pooja B R, Prajwal Gowda N, R M Guruprasad, Tabreek Malk, Abdul Rehaman "Sentimental Analysis using Text, Audio, Video Data" Iconic Research And Engineering Journals Volume 9 Issue 6 2025 Page 55-57 https://doi.org/10.64388/IREV9I6-1712512
IEEE:
Pooja B R, Prajwal Gowda N, R M Guruprasad, Tabreek Malk, Abdul Rehaman
"Sentimental Analysis using Text, Audio, Video Data" Iconic Research And Engineering Journals, 9(6) https://doi.org/10.64388/IREV9I6-1712512