GitHub / InseeFrLab / auto-tuning-vllm
Auto-tuning for vllm. Getting the best performance out of your LLM deployment (vllm+guidellm+optuna)
JSON API: https://data.code.gouv.fr/api/v1/hosts/GitHub/repositories/InseeFrLab%2Fauto-tuning-vllm
Fork of openshift-psap/auto-tuning-vllm
Stars: 1
Forks: 0
Open issues: 4
License: apache-2.0
Language:
Size: 43.8 MB
Dependencies parsed at:
0
Created at: 1 day ago
Updated at: 1 day ago
Pushed at: 15 days ago
Last synced at: about 22 hours ago
No dependencies found