Skip to content

WIP chore: audio recognition (opus) #79

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed

Conversation

marinakr
Copy link

draft for an audio recognition (does not work yet, prototype)
TO DO: fix rooms.js, room.ex

@marinakr marinakr force-pushed the audio_recognition_opus branch from 3253083 to 870ad40 Compare March 30, 2025 12:30
@marinakr marinakr closed this Mar 30, 2025
@marinakr marinakr deleted the audio_recognition_opus branch March 30, 2025 12:32
@mickel8
Copy link
Member

mickel8 commented Mar 30, 2025

Hi @marinakr, that would be super cool! Why closing?

@marinakr
Copy link
Author

Hi @mickel8! I accidentally created the PR to main branch instead of my fork
It was not ready yet at, draft with few debug messages
The current issue that audio stream separated from video stream is not working (predictions is nonsense) and I am trying to get separate audio stream from room.js
Current implementation is not working, data sent from rooms.js (pushed to channel) produces {:error, :no_keyframe} on decode
It probably should be attached to RTCPeerConnection
But simple attach audiotrack does not send audio data where you expect it in

  {:ex_webrtc, _pc, {:rtp, track_id, nil, packet}},
        %{audio_track: %{id: track_id}} = state

@mickel8
Copy link
Member

mickel8 commented Mar 31, 2025

I think you should focus on making sure that audio packets are actually received on the backend side. It looks like you don't add the audio track to the peer connection?

@marinakr
Copy link
Author

marinakr commented Mar 31, 2025

I have managed to get the audio (actually I already did it in not committed, we could use audio from video, but results was not good so I decided to get audio separately)
In my branch I have the audio, the current issue is I have nonsense predictions (asked this question in elixir forum)
Everything seems to be working but text prediction have low quality

UPD:
fixed predictions, will do PR for audio ~next weekend

@marinakr
Copy link
Author

marinakr commented Apr 4, 2025

Updated PR: #81

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants