llm-hosting
Python
★ 26
updated 1y ago
This repository is designed for deploying and managing server processes that handle embeddings using the Infinity Embedding model or Large Language Models with an OpenAI compatible vLLM server.
No plain-English explanation yet — one is being written right now. Check back in a minute.