Using a Pretrained Model - Youngmi-Park/automatic-speech-recognition GitHub Wiki

Pre-trained model๋กœ ํ…Œ์ŠคํŠธํ•˜๋Š” ์ˆœ์„œ

  1. Setup python environment. Install virtualenv package.

  2. Create a DeepSpeech virtual environment

$ virtualenv -p python3 $HOME/tmp/deepspeech-venv/
  1. Activating the environment
$ source $HOME/tmp/deepspeech-venv/bin/activate
  1. Get the git-lfs repo:
curl -s https://packagecloud.io/install/repositories/github/git-lfs/script.deb.sh | sudo bash
  1. Installing DeepSpeech Python bindings
$ pip3 install deepspeech
$ pip3 install --upgrade deepspeech
$ pip3 install deepspeech-gpu
$ pip3 install --upgrade deepspeech-gpu
  1. install git-lfs:
sudo apt-get install git-lfs
  1. Download the DeepSpeech github repository
$ git clone https://github.com/mozilla/DeepSpeech
  1. Getting the pre-trained model
wget https://github.com/mozilla/DeepSpeech/releases/download/v0.9.3/deepspeech-0.9.3-models.pbmm
wget https://github.com/mozilla/DeepSpeech/releases/download/v0.9.3/deepspeech-0.9.3-models.scorer
deepspeech --model deepspeech-0.9.3-models.pbmm --scorer deepspeech-0.9.3-models.scorer --audio my_audio_file.wav
deepspeech --model deepspeech-0.9.3-models.pbmm --scorer deepspeech-0.9.3-models.scorer --audio audio/2830-3980-0043.wav
deepspeech --model deepspeech-0.9.3-models.pbmm --scorer deepspeech-0.9.3-models.scorer --audio audio/4507-16021-0012.wav
deepspeech --model deepspeech-0.9.3-models.pbmm --scorer deepspeech-0.9.3-models.scorer --audio audio/8455-210777-0068.wav