Build AI assistants
that actually work
Power OpenClaw, Hermes Agent, or your own chatbot with Kyma. One API key, active models, streaming, tool use. Free to start.
Agent-Ready Models
Qwen 3.6 Plus for strong general reasoning, Kimi K2.5 for tool use, and DeepSeek R1 for deeper planning workflows.
Streaming & Tool Use
Full streaming support, function calling, JSON mode. Everything your agent framework needs, OpenAI-compatible.
100+ Languages
Qwen models excel at Vietnamese, Chinese, Japanese and 100+ languages. Build multilingual assistants effortlessly.
Best Models for Assistants
Ranked by conversational quality, reasoning depth, and real user adoption.
Alibaba's flagship. 131K context with strong reasoning and multilingual performance. Great for multi-step agent workflows.
qwen-3.6-plus#1 on Kyma
Meta's best open model. Fast on Groq, great for general assistant tasks and conversations.
llama-3.3-70bMost popular
Google's open model. FREE inference — lowest cost per token. Supports image input.
gemma-4-31bLowest cost
Top reasoning model with chain-of-thought. 96% cheaper than OpenAI o1. Ideal for complex multi-agent planning.
deepseek-r1Best reasoning
Google's 1M token context. Perfect for processing entire codebases, documents, or conversation history.
gemini-2.5-flash1M context
Setup in 30 Seconds
1. Get your free API key → 2. Pick your tool below → 3. Paste the config.
Settings → Providers → OpenAI Compatible Base URL: https://kymaapi.com/v1 API Key: ky-xxxxx (your Kyma key) Model: qwen-3.6-plus Tip: OpenClaw works best with qwen-3.6-plus or llama-3.3-70b for general tasks.
# .hermes/config.yaml provider: type: openai-compatible base_url: https://kymaapi.com/v1 api_key: ky-xxxxx # your Kyma key model: qwen-3.6-plus
from openai import OpenAI
client = OpenAI(
base_url="https://kymaapi.com/v1",
api_key="ky-xxxxx" # your Kyma key
)
# Multi-turn conversation
messages = [
{"role": "system", "content": "You are a helpful assistant."},
{"role": "user", "content": "Summarize this document..."},
]
response = client.chat.completions.create(
model="qwen-3.6-plus",
messages=messages,
stream=True
)from langchain_openai import ChatOpenAI
llm = ChatOpenAI(
base_url="https://kymaapi.com/v1",
api_key="ky-xxxxx", # your Kyma key
model="qwen-3.6-plus",
streaming=True
)
response = llm.invoke("Plan a 3-step workflow...")import OpenAI from "openai";
const client = new OpenAI({
baseURL: "https://kymaapi.com/v1",
apiKey: "ky-xxxxx", // your Kyma key
});
// Streaming assistant
const stream = await client.chat.completions.create({
model: "qwen-3.6-plus",
messages: [
{ role: "system", content: "You are a helpful assistant." },
{ role: "user", content: "Help me plan..." },
],
stream: true,
});# Any OpenAI-compatible chatbot framework:
Base URL: https://kymaapi.com/v1
API Key: ky-xxxxx
Model: qwen-3.6-plus (best quality)
or gemma-4-31b (lowest cost)
or llama-3.3-70b (fastest)
# Supports: streaming, function calling,
# tool use, JSON mode, system promptsWhat People Build
🤖 Personal AI Assistant
Multi-agent teams with OpenClaw — planning, research, coding, all orchestrated by AI.
💬 Customer Support Bot
Multilingual chatbot that understands context, handles complex queries, and escalates when needed.
📊 Data Analysis Agent
Text-to-SQL pipeline that converts questions into database queries. Popular with MiniMax M2.5.
✍️ Content Creation
Audio drama scripts, blog posts, social media — powered by Gemini 2.5 Flash's 1M context window.
Build your AI assistant — free
$0.50 free credits on signup. No credit card required.
Get Free API Key