GitHub topics: llm-serving
France-Travail/happy_vllm
A REST API for vLLM, production ready
Language: Python - Size: 857 KB - Last synced at: about 17 hours ago - Pushed at: 8 months ago - Stars: 29 - Forks: 5
France-Travail/benchmark_llm_serving
A library to benchmark LLMs via their API exposure
Language: Python - Size: 8.04 MB - Last synced at: about 17 hours ago - Pushed at: about 1 year ago - Stars: 8 - Forks: 1