Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.
WOKHEI This is an open source repository surfaced by WokHei. The description above is from the original GitHub repo. Click "View repository" to explore the code, issues, and contributors.
Discussion
Join the discussion
Sign in for a verified badge and your comments appear instantly. Or post anonymously — anonymous comments are held briefly for moderation.