You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* Release memory when transcribing single files
Co-authored-by: keleftheriou <[email protected]>
* Add method to load from file into float array iteratively
- Reduces peak memory by doing the array conversion while loading in chunks so the array copy size is lower
- Previously copied the entire buffer which spiked the memory 2x
* Fix leak
* Use vad by default in examples
* Fix vad thread issue
* Fix unused warning
* Revert change to early stop callback
* Fix warnings
- Optional cli commands are deprecated
- @_disfavoredOverload required @available to prevent infinite loop
* PR review - simplify early stop test logic
Co-authored-by: Andrey Leonov <[email protected]>
* Cleanup from review
---------
Co-authored-by: keleftheriou <[email protected]>
Co-authored-by: Andrey Leonov <[email protected]>
Logging.debug("Decoding loop without wait: \(result.timings.decodingLoop), with wait: \(resultWithWait.timings.decodingLoop)")
579
582
580
583
// Assert that the decoding predictions per token are not slower with the waiting
581
584
XCTAssertEqual(decodingTimePerTokenWithWait, decodingTimePerToken, accuracy: decodingTimePerToken,"Decoding predictions per token should not be significantly slower with waiting")
0 commit comments