WIP chore: audio recognition (opus) #79

marinakr · 2025-03-30T12:28:41Z

draft for an audio recognition (does not work yet, prototype)
TO DO: fix rooms.js, room.ex

mickel8 · 2025-03-30T12:34:29Z

Hi @marinakr, that would be super cool! Why closing?

marinakr · 2025-03-30T12:43:29Z

Hi @mickel8! I accidentally created the PR to main branch instead of my fork
It was not ready yet at, draft with few debug messages
The current issue that audio stream separated from video stream is not working (predictions is nonsense) and I am trying to get separate audio stream from room.js
Current implementation is not working, data sent from rooms.js (pushed to channel) produces {:error, :no_keyframe} on decode
It probably should be attached to RTCPeerConnection
But simple attach audiotrack does not send audio data where you expect it in

  {:ex_webrtc, _pc, {:rtp, track_id, nil, packet}},
        %{audio_track: %{id: track_id}} = state

mickel8 · 2025-03-31T08:06:45Z

I think you should focus on making sure that audio packets are actually received on the backend side. It looks like you don't add the audio track to the peer connection?

marinakr · 2025-03-31T09:07:31Z

I have managed to get the audio (actually I already did it in not committed, we could use audio from video, but results was not good so I decided to get audio separately)
In my branch I have the audio, the current issue is I have nonsense predictions (asked this question in elixir forum)
Everything seems to be working but text prediction have low quality

UPD:
fixed predictions, will do PR for audio ~next weekend

marinakr · 2025-04-04T08:11:34Z

Updated PR: #81

init: added audio decoder (doesn't work, missing key frame)

870ad40

marinakr force-pushed the audio_recognition_opus branch from 3253083 to 870ad40 Compare March 30, 2025 12:30

marinakr closed this Mar 30, 2025

marinakr deleted the audio_recognition_opus branch March 30, 2025 12:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WIP chore: audio recognition (opus) #79

WIP chore: audio recognition (opus) #79

Uh oh!

marinakr commented Mar 30, 2025

Uh oh!

mickel8 commented Mar 30, 2025

Uh oh!

marinakr commented Mar 30, 2025

Uh oh!

mickel8 commented Mar 31, 2025

Uh oh!

marinakr commented Mar 31, 2025 •

edited

Loading

Uh oh!

marinakr commented Apr 4, 2025

Uh oh!

Uh oh!

WIP chore: audio recognition (opus) #79

WIP chore: audio recognition (opus) #79

Uh oh!

Conversation

marinakr commented Mar 30, 2025

Uh oh!

mickel8 commented Mar 30, 2025

Uh oh!

marinakr commented Mar 30, 2025

Uh oh!

mickel8 commented Mar 31, 2025

Uh oh!

marinakr commented Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

marinakr commented Apr 4, 2025

Uh oh!

Uh oh!

marinakr commented Mar 31, 2025 •

edited

Loading