You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Mozilla Voice STT is an open source Speech-To-Text engine, using a model trained by machine learning techniques based on `Baidu's Deep Speech research paper <https://arxiv.org/abs/1412.5567>`_. Mozilla Voice STT uses Google's `TensorFlow <https://www.tensorflow.org/>`_ to make the implementation easier.
15
+
DeepSpeech is an open source Speech-To-Text engine, using a model trained by machine learning techniques based on `Baidu's Deep Speech research paper <https://arxiv.org/abs/1412.5567>`_. Project DeepSpeech uses Google's `TensorFlow <https://www.tensorflow.org/>`_ to make the implementation easier.
16
16
17
17
Documentation for installation, usage, and training models are available on `deepspeech.readthedocs.io <http://deepspeech.readthedocs.io/?badge=latest>`_.
@@ -26,14 +26,14 @@ If you'd like to build the language bindings or the decoder package, you'll also
26
26
Dependencies
27
27
------------
28
28
29
-
If you follow these instructions, you should compile your own binaries of Mozilla Voice STT (built on TensorFlow using Bazel).
29
+
If you follow these instructions, you should compile your own binaries of DeepSpeech (built on TensorFlow using Bazel).
30
30
31
31
For more information on configuring TensorFlow, read the docs up to the end of `"Configure the Build" <https://www.tensorflow.org/install/source#configure_the_build>`_.
32
32
33
33
Checkout source code
34
34
^^^^^^^^^^^^^^^^^^^^
35
35
36
-
Clone Mozilla Voice STT source code (TensorFlow will come as a submdule):
36
+
Clone DeepSpeech source code (TensorFlow will come as a submdule):
37
37
38
38
.. code-block::
39
39
@@ -56,24 +56,24 @@ After you have installed the correct version of Bazel, configure TensorFlow:
56
56
cd tensorflow
57
57
./configure
58
58
59
-
Compile Mozilla Voice STT
60
-
-------------------------
59
+
Compile DeepSpeech
60
+
------------------
61
61
62
-
Compile ``libmozilla_voice_stt.so``
62
+
Compile ``libdeepspeech.so``
63
63
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
64
64
65
-
Within your TensorFlow directory, there should be a symbolic link to the Mozilla Voice STT ``native_client`` directory. If it is not present, create it with the follow command:
65
+
Within your TensorFlow directory, there should be a symbolic link to the DeepSpeech ``native_client`` directory. If it is not present, create it with the follow command:
66
66
67
67
.. code-block::
68
68
69
69
cd tensorflow
70
70
ln -s ../native_client
71
71
72
-
You can now use Bazel to build the main Mozilla Voice STT library, ``libmozilla_voice_stt.so``. Add ``--config=cuda`` if you want a CUDA build.
72
+
You can now use Bazel to build the main DeepSpeech library, ``libdeepspeech.so``. Add ``--config=cuda`` if you want a CUDA build.
The generated binaries will be saved to ``bazel-bin/native_client/``.
79
79
@@ -82,12 +82,12 @@ The generated binaries will be saved to ``bazel-bin/native_client/``.
82
82
Compile ``generate_scorer_package``
83
83
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
84
84
85
-
Following the same setup as for ``libmozilla_voice_stt.so`` above, you can rebuild the ``generate_scorer_package`` binary by adding its target to the command line: ``//native_client:generate_scorer_package``.
85
+
Following the same setup as for ``libdeepspeech.so`` above, you can rebuild the ``generate_scorer_package`` binary by adding its target to the command line: ``//native_client:generate_scorer_package``.
86
86
Using the example from above you can build the library and that binary at the same time:
While we test only on RPi3 Raspbian Buster and LePotato ARMBian Buster, anything compatible with ``armv7-a cortex-a53`` or ``armv8-a cortex-a53`` should be fine.
177
177
178
-
The ``mozilla_voice_stt`` binary can also be cross-built, with ``TARGET=rpi3`` or ``TARGET=rpi3-armv8``. This might require you to setup a system tree using the tool ``multistrap`` and the multitrap configuration files: ``native_client/multistrap_armbian64_buster.conf`` and ``native_client/multistrap_raspbian_buster.conf``.
178
+
The ``deepspeech`` binary can also be cross-built, with ``TARGET=rpi3`` or ``TARGET=rpi3-armv8``. This might require you to setup a system tree using the tool ``multistrap`` and the multitrap configuration files: ``native_client/multistrap_armbian64_buster.conf`` and ``native_client/multistrap_raspbian_buster.conf``.
179
179
The path of the system tree can be overridden from the default values defined in ``definitions.mk`` through the ``RASPBIAN`` ``make`` variable.
180
180
181
181
.. code-block::
182
182
183
183
cd ../DeepSpeech/native_client
184
-
make TARGET=<system> mozilla_voice_stt
184
+
make TARGET=<system> deepspeech
185
185
186
186
Android devices support
187
187
-----------------------
@@ -193,53 +193,53 @@ Please refer to TensorFlow documentation on how to setup the environment to buil
193
193
Using the library from Android project
194
194
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
195
195
196
-
We provide up-to-date and tested STT usable as an ``AAR`` package,
196
+
We provide uptodate and tested ``libdeepspeech`` usable as an ``AAR`` package,
197
197
for Android versions starting with 7.0 to 11.0. The package is published on
0 commit comments