GitHub / France-Travail / vllm-ft
A high-throughput and memory-efficient inference and serving engine for LLMs
JSON API: https://data.code.gouv.fr/api/v1/hosts/GitHub/repositories/France-Travail%2Fvllm-ft
Stars: 9
Forks: 2
Open issues: 1
License: apache-2.0
Language: Python
Size: 130 MB
Dependencies parsed at: Pending
Created at: about 1 year ago
Updated at: about 15 hours ago
Pushed at: about 16 hours ago
Last synced at: 18 minutes ago
Funding Links https://github.com/sponsors/vllm-project, https://opencollective.com/vllm
Readme
Loading...