Android real-time transcription #4

salehsoleimani · 2025-02-22T08:18:23Z

Hey! thanks for the great repo.
any chances to run whisper real-time on mobile devices? according to your docs 1.5GB is too much for mobile devices. chances around maybe ~300MB memory usage?

niedev · 2025-02-22T13:58:05Z

@salehsoleimani this library is derived from RTranslator, an Android app, so 1.5GB is not too much for mobile devices (although it is certainly quite heavy).

salehsoleimani · 2025-02-23T10:43:14Z

can you provide an android example in this repo?

salehsoleimani · 2025-02-23T12:32:35Z

are you sure it works real time on mobile devices? i tried out RTranslator and It didn't seem to be real-time! it takes at least 3~6 seconds to process each cunck

eix128 · 2025-02-24T09:53:25Z

@salehsoleimani
This is a heavily modified of RTranslator library as niedev as said.
But its not same really.This has been optimized more and currently i dont have so much time to update currently.
There are many libraries appear after whisper.One of them as i told to niedev is , Sensevoice.
You can check it out if you want really small footprint.Its newer library currently in the market.Some people say it whisper killer.

niedev · 2025-02-27T21:06:10Z

are you sure it works real time on mobile devices? i tried out RTranslator and It didn't seem to be real-time! it takes at least 3~6 seconds to process each cunck

Depends a lot on the phone you use (mine takes 1.6/2 seconds for each cunk), but yeah the audio is always processed in cuncks, no matter how small or fast the model is. For a true real time speech recognition with whisper, the only option I know is the stream version of whisper.cpp

salehsoleimani · 2025-02-27T21:11:35Z

are you sure it works real time on mobile devices? i tried out RTranslator and It didn't seem to be real-time! it takes at least 3~6 seconds to process each cunck

Depends a lot on the phone you use (mine takes 1.6/2 seconds for each cunk), but yeah the audio is always processed in cuncks, no matter how small or fast the model is. For a true real time speech recognition with whisper, the only option I know is the stream version of whisper.cpp

thanks for the reply. where you mentioned 1.6 seconds per chunk you mean RTranslator or this repo?.... do you have any examples for this code you've replied?

niedev · 2025-02-27T21:16:11Z

I mean RTranslator, and what do you mean with an example?

salehsoleimani · 2025-02-27T21:17:18Z

I mean RTranslator, and what do you mean with an example?

an example for android implementation

niedev · 2025-02-27T21:24:42Z

Oh ok, there is a Whisper.cpp example app for Android but it doesn't implement the stream inference for Whisper, but you could implement yourself understanding how the stream version works and implementing it on Android in C++ (the code is in the example I linked in the previous message, and the issue linked in that page explain how it works)

salehsoleimani · 2025-02-27T21:25:39Z

Oh ok, there is a Whisper.cpp example app for Android but it doesn't implement the stream inference for Whisper, but you could implement yourself understanding how the stream version works and implementing it on Android in C++ (the code is in the example I linked in the previous message, and the issue linked in that page explain how it works)

nice thanks i appreciate it

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Android real-time transcription #4

Android real-time transcription #4

salehsoleimani commented Feb 22, 2025

niedev commented Feb 22, 2025

salehsoleimani commented Feb 23, 2025

salehsoleimani commented Feb 23, 2025

eix128 commented Feb 24, 2025 •

edited

Loading

niedev commented Feb 27, 2025

salehsoleimani commented Feb 27, 2025

niedev commented Feb 27, 2025

salehsoleimani commented Feb 27, 2025

niedev commented Feb 27, 2025

salehsoleimani commented Feb 27, 2025

Android real-time transcription #4

Android real-time transcription #4

Comments

salehsoleimani commented Feb 22, 2025

niedev commented Feb 22, 2025

salehsoleimani commented Feb 23, 2025

salehsoleimani commented Feb 23, 2025

eix128 commented Feb 24, 2025 • edited Loading

niedev commented Feb 27, 2025

salehsoleimani commented Feb 27, 2025

niedev commented Feb 27, 2025

salehsoleimani commented Feb 27, 2025

niedev commented Feb 27, 2025

salehsoleimani commented Feb 27, 2025

eix128 commented Feb 24, 2025 •

edited

Loading