📝

Tuna — Deploy and Serve LLM Models on GPU Infrastructure

2329

0次下载

2次浏览

2026/3/9

Tuna is a hybrid GPU inference orchestrator. It lets you deploy, serve, and manage LLM models (Llama, Qwen, Mistral, DeepSeek, Gemma, and any HuggingFace model) on serverless GPUs from **Modal, RunPod, Cerebrium, Google Cloud Run, Baseten, or Azure Container Apps**, with optional **spot instance fallback on AWS** via SkyPilot. Every deployment gets an **OpenAI-compatible `/v1/chat/completions` endpoint**.

developmentopenai ai aws chat openclaw

广告位 300x250

资源信息

数据来源: bigquery-gharchive
分类: development
创建时间: 2026/3/9
更新时间: 2026/3/14

登录后发表评论

加载中...

Zoom Automation via Rube MCP

Automate Zoom operations including meeting scheduling, webinar management, cloud recording retrieval, participant tracking, and usage reporting through Composio's Zoom toolkit.

mcpautomationsickn33agentic-skills+1

21942

查看

🔗

Gemini API Development Skill

The Gemini API provides access to Google's most advanced AI models. Key capabilities include: - **Text generation** - Chat, completion, summarization - **Multimodal understanding** - Process images, audio, video, and documents

developmentaiapidocument+2

查看

🔗

Agentic MCP Server Builder

来自 openclaw/skills 技能

agentmcpopenclawarchive+1

2329

查看

🔗

prettier-ignore

description: Skill for creating AI agent projects using the VoltAgent framework. Guide for CLI setup and manual bootstrapping. license: MIT metadata:

developmentaiagent

查看

Tuna — Deploy and Serve LLM Models on GPU Infrastructure

资源信息

评论 (0)