speech-server

Installation

To set up the environment, follow these steps:

Create a virtual environment and install the required packages:
```
python3.11 -m venv server
source server/bin/activate
pip install -r requirements.txt
```
- If you are using the Nemo-based ASR model, use the following requirements file:
```
pip install -r requirements_wx_nemo.txt
```

Add the following lines to server/bin/activate to ensure that the necessary libraries are accessible:

export LD_LIBRARY_PATH=/path/to/environment/server/lib64/python3.11/site-packages/nvidia/cublas/lib:/path/to/environment/server/lib64/python3.11/site-packages/nvidia/cudnn/lib

If you are using the Nemo version, also add the following:

export CPATH=$HOME/python-dev/include:$CPATH

If Java 11.0 is not installed, set up the Java environment variables. Add the following to server/bin/activate:
```
export JAVA_HOME=/path/to/java/installation/jdk-11.0.16.1+1
export PATH=$JAVA_HOME/bin:$PATH
```
If ffmpeg 7.0.2 is not installed, you may need to download it from here and build it locally. Then, set up the enivronment variables:
```
export JAVA_HOME=/path/to/java/installation/jdk-11.0.16.1+1
export FFMPEG=/path/to/ffmpeg/installation/ffmpeg-7.0.2/build
export PATH=$JAVA_HOME/bin:$FFMPEG/bin:$PATH
```
ffmpeg 7.0.2 is very important to have if we want to make API calls to this server from the Safari browser.

Make sure to replace /path/to/environment/ and /path/to/java/installation/ with the actual paths to your environment and Java installation.

Files Description

whisperx_model.py: Contains the model definition.
whisperx_handler.py: Handles data input/output operations.
archive.sh: Used to create the .mar file for the model in the model_store folder.
config*.json: Configuration for corresponding models.
client_webpage.html: An example of a client-side HTML file to send audio this torchserve server and getting the transcription

Browser integration

The ASR models are being hosted right now on the slatelab server (need to change at some point)
Port forwarding needed. First run ssh -L 8080:localhost:8080 user@cse-d01187744s.coeit.osu.edu
Then run client_webpage.html. You will see a page like below.

Model Update Process

To update the model:

Always archive the model first by running:
```
./archive.sh
```
After archiving, start the server by running:
```
./start_server.sh
```

To stop the server, run:

./stop_server.sh

Training and deploying your own models

You will need to define my_model.py and my_handler.py (an example is shown for a CTC model I trained)
Include all extra files you import under --extra-files as shown in archive.sh
Change line 90 in client_webpage.html to your defined endpoint. For example, http://localhost:8080/predictions/my_asr

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.gitignore		.gitignore
README.md		README.md
archive.sh		archive.sh
client_webpage.html		client_webpage.html
config.properties		config.properties
config_fw.json		config_fw.json
config_nm.json		config_nm.json
config_wx.json		config_wx.json
encoders.py		encoders.py
fasterWhisper_handler.py		fasterWhisper_handler.py
fasterWhisper_model.py		fasterWhisper_model.py
hc_ctc_all.yaml		hc_ctc_all.yaml
models.py		models.py
my_handler.py		my_handler.py
my_model.py		my_model.py
nemo_handler.py		nemo_handler.py
nemo_model.py		nemo_model.py
post.sh		post.sh
requirements.txt		requirements.txt
requirements_wx_nemo.txt		requirements_wx_nemo.txt
start_server.sh		start_server.sh
stop_server.sh		stop_server.sh
util.py		util.py
webpage_look.png		webpage_look.png
whisperx_handler.py		whisperx_handler.py
whisperx_model.py		whisperx_model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

speech-server

Installation

Files Description

Browser integration

Model Update Process

Training and deploying your own models

About

Uh oh!

Releases

Packages

Uh oh!

Languages

OSU-slatelab/speech-server

Folders and files

Latest commit

History

Repository files navigation

speech-server

Installation

Files Description

Browser integration

Model Update Process

Training and deploying your own models

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages