Bert Celery Triton

created by Weber Huang

Description

This repo is refer to triton-inference-server and Triton Inference Server 介紹與範例 for building a text classification training task using Pytorch, Transformer BERT, celery worker and deploying with Triton Inference Sever.

Noted that this repo doesn't contain client code, you can refer to triton-inference-server client docs to setup your client machine or using celery cluster to develop inference client, in this way you have to bind the same massage broker.

Usage

Clone the repo

$ git clone <this repo>

$ cd <this repo>

create a .env file with

# set LEVEL to info if you dont wanna log with verbose mode
# change the redis localhost to your ip
LEVEL=debug
REDIS=redis://localhost:6379/0

run service, this will setup

celery_server: for model training and validation
redis: message broker and backend for celery_worker

triton server: for inference usage, model deployment

run triton server respectively

docker run --rm --name test_triton_server --shm-size=1g --ulimit memlock=-1 --ulimit stack=67108864 -p 8000:8000 -p 8001:8001 -p 8002:8002 -v "$(pwd)"/model/torch_script:/models nvcr.io/nvidia/tritonserver:22.06-py3 tritonserver --model-store=/models --model-control-mode=poll

$ docker-compose -f docker-compose.yml --env-file .env up

Test if the model is loaded
- 8000: HTTP protocol
- 8001: GRPC protocol

$ curl -v localhost:8000/v2/health/ready

$ curl localhost:8000/v2/models/audience_bert/versions/1/stats

$ GET v2/models[/${MODEL_NAME}[/versions/${MODEL_VERSION}]]/stats

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
config		config
data		data
data_set		data_set
metrics		metrics
model/torch_script/audience_bert		model/torch_script/audience_bert
notebooks		notebooks
train		train
utils		utils
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
docker-compose.yml		docker-compose.yml
main.py		main.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Bert Celery Triton

created by Weber Huang

Description

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

Weber12321/Triton_server_bert_celery

Folders and files

Latest commit

History

Repository files navigation

Bert Celery Triton

created by Weber Huang

Description

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages