cm0002@piefed.world to Linux@programming.devEnglish · 2 days agoFFmpeg 8.0 merges OpenAI "Whisper Filter" for automatic speech recognition, Vulkan AV1 encoding, & VP9 decodingwww.phoronix.comexternal-linkmessage-square14fedilinkarrow-up161arrow-down15file-textcross-posted to: linux@lemmy.ml
arrow-up156arrow-down1external-linkFFmpeg 8.0 merges OpenAI "Whisper Filter" for automatic speech recognition, Vulkan AV1 encoding, & VP9 decodingwww.phoronix.comcm0002@piefed.world to Linux@programming.devEnglish · 2 days agomessage-square14fedilinkfile-textcross-posted to: linux@lemmy.ml
https://www.phoronix.com/news/FFmpeg-Vulkan-AV1-Encoding https://www.phoronix.com/news/FFmpeg-Lands-Whisper
minus-squarechrisbtoo@lemmy.calinkfedilinkarrow-up18·2 days agoHopefully the speech recognition is better than whatever the fuck most online video platforms use for automatic subtitles at the moment.
minus-squarepirateKaiser@sh.itjust.workslinkfedilinkarrow-up6·2 days agoI’ve built an app with Whisper, the level of ‘hit or miss’ entirely depends on the size of the model and language. Even audio quality is a lesser factor in my experience. So, it depends…
minus-squaredata1701d (He/Him)@startrek.websitelinkfedilinkEnglisharrow-up1·1 day agoI was messing around with HomeAssistant the other day, which uses the same speech recognition engine, and I found it to be decent.
Hopefully the speech recognition is better than whatever the fuck most online video platforms use for automatic subtitles at the moment.
I’ve built an app with Whisper, the level of ‘hit or miss’ entirely depends on the size of the model and language. Even audio quality is a lesser factor in my experience. So, it depends…
I was messing around with HomeAssistant the other day, which uses the same speech recognition engine, and I found it to be decent.