Using a Pretrained Model - Youngmi-Park/automatic-speech-recognition GitHub Wiki
Pre-trained model๋ก ํ ์คํธํ๋ ์์
-
Setup python environment. Install virtualenv package.
-
Create a DeepSpeech virtual environment
$ virtualenv -p python3 $HOME/tmp/deepspeech-venv/
- Activating the environment
$ source $HOME/tmp/deepspeech-venv/bin/activate
- Get the git-lfs repo:
curl -s https://packagecloud.io/install/repositories/github/git-lfs/script.deb.sh | sudo bash
- Installing DeepSpeech Python bindings
$ pip3 install deepspeech
$ pip3 install --upgrade deepspeech
$ pip3 install deepspeech-gpu
$ pip3 install --upgrade deepspeech-gpu
- install git-lfs:
sudo apt-get install git-lfs
- Download the DeepSpeech github repository
$ git clone https://github.com/mozilla/DeepSpeech
- Getting the pre-trained model
wget https://github.com/mozilla/DeepSpeech/releases/download/v0.9.3/deepspeech-0.9.3-models.pbmm
wget https://github.com/mozilla/DeepSpeech/releases/download/v0.9.3/deepspeech-0.9.3-models.scorer
deepspeech --model deepspeech-0.9.3-models.pbmm --scorer deepspeech-0.9.3-models.scorer --audio my_audio_file.wav
deepspeech --model deepspeech-0.9.3-models.pbmm --scorer deepspeech-0.9.3-models.scorer --audio audio/2830-3980-0043.wav
deepspeech --model deepspeech-0.9.3-models.pbmm --scorer deepspeech-0.9.3-models.scorer --audio audio/4507-16021-0012.wav
deepspeech --model deepspeech-0.9.3-models.pbmm --scorer deepspeech-0.9.3-models.scorer --audio audio/8455-210777-0068.wav