AI Models

20 models Free & Paid Aktualisieren: 5 hours trước

Google: Gemma 4 26B A4B

Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at a fraction of the compute cost. Supports multimodal input including…

by google |Apr 2026 |262K context |$0.1300/M input |$0.4000/M output

262K tokens ⓘ

Google: Gemma 4 31B

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function calling, and multilingual support across 140+ languages. Strong on coding,…

by google |Apr 2026 |262K context |$0.1400/M input |$0.4000/M output

262K tokens ⓘ

Qwen: Qwen3.6 Plus (free)

Qwen 3.6 Plus builds on a hybrid architecture that combines efficient linear attention with sparse mixture-of-experts routing, enabling strong scalability and high-performance inference. Compared to the 3.5 series, it delivers major gains in agentic coding, front-end development, and overall reasoning,…

by qwen |Apr 2026 |1M context |Miễn phí input |Miễn phí output

1M tokens ⓘ

Z.ai: GLM 5V Turbo

GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, built for vision-based coding and agent-driven tasks. It natively handles image, Video, and text inputs, excels at long-horizon planning, complex coding, and task execution, and works seamlessly with agents to complete…

by z-ai |Apr 2026 |203K context |$1.20/M input |$4.00/M output

203K tokens ⓘ

Arcee AI: Trinity Large Thinking

Trinity Large Thinking is a powerful open source reasoning model from the team at Arcee AI. It shows strong performance in PinchBench, agentic workloads, and reasoning tasks. It is free in open claw for the first five days. Launch video:…

by arcee-ai |Apr 2026 |262K context |$0.2200/M input |$0.8500/M output

262K tokens ⓘ

xAI: Grok 4.20 Multi-Agent

Grok 4.20 Multi-Agent is a variant of xAI’s Grok 4.20 designed for collaborative, agent-based workflows. Multiple agents operate in parallel to conduct deep research, coordinate tool use, and synthesize information across complex tasks. Reasoning effort behavior: - low / medium:…

by x-ai |Beschädigen 2026 |2M context |$2.00/M input |$6.00/M output

2M tokens ⓘ

xAI: Grok 4.20

Grok 4.20 is xAI's newest flagship model with industry-leading speed and agentic tool calling capabilities. It combines the lowest hallucination rate on the market with strict prompt adherance, delivering consistently precise and truthful responses. Reasoning can be enabled/disabled using the…

by x-ai |Beschädigen 2026 |2M context |$2.00/M input |$6.00/M output

2M tokens ⓘ

Google: Lyria 3 Pro Preview

Full-length songs are priced at $0.08 per song. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate high-quality, 48kHz stereo audio from text prompts or from images. These models…

by google |Beschädigen 2026 |1M context |Miễn phí input |Miễn phí output

1M tokens ⓘ

Google: Lyria 3 Clip Preview

30 second duration clips are priced at $0.04 per clip. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate high-quality, 48kHz stereo audio from text prompts or from images.…

by google |Beschädigen 2026 |1M context |Miễn phí input |Miễn phí output

1M tokens ⓘ

Kwaipilot: KAT-Coder-Pro V2

KAT-Coder-Pro V2 is the latest high-performance model in KwaiKAT’s KAT-Coder series, designed for complex enterprise-grade software engineering and SaaS integration. It builds on the agentic coding strengths of earlier versions, with a focus on large-scale production environments, multi-system coordination, and…

by kwaipilot |Beschädigen 2026 |256K context |$0.3000/M input |$1.20/M output

256K tokens ⓘ

Reka Edge

Reka Edge is an extremely efficient 7B multimodal vision-language model that accepts image/video+text inputs and generates text outputs. This model is optimized specifically to deliver industry-leading performance in image understanding, video analysis, object detection, and agentic tool-use.

by rekaai |Beschädigen 2026 |16K context |$0.1000/M input |$0.1000/M output

Xiaomi: MiMo-V2-Omni

MiMo-V2-Omni is a frontier omni-modal model that natively processes image, Video, and audio inputs within a unified architecture. It combines strong multimodal perception with agentic capability - visual grounding, multi-step planning, tool use, and code execution - making it well-suited…

by xiaomi |Beschädigen 2026 |262K context |$0.4000/M input |$2.00/M output

262K tokens ⓘ

Xiaomi: MiMo-V2-Pro

MiMo-V2-Pro is Xiaomi's flagship foundation model, featuring over 1T total parameters and a 1M context length, deeply optimized for agentic scenarios. It is highly adaptable to general agent frameworks like OpenClaw. It ranks among the global top tier in the…

by xiaomi |Beschädigen 2026 |1M context |$1.00/M input |$3.00/M output

1M tokens ⓘ

MiniMax: MiniMax M2.7

MiniMax-M2.7 is a next-generation large language model designed for autonomous, real-world productivity and continuous improvement. Built to actively participate in its own evolution, M2.7 integrates advanced agentic capabilities through multi-agent collaboration, enabling it to plan, execute, and refine complex tasks…

by minimax |Beschädigen 2026 |205K context |$0.3000/M input |$1.20/M output

205K tokens ⓘ

OpenAI: GPT-5.4 Nano

GPT-5.4 nano is the most lightweight and cost-efficient variant of the GPT-5.4 family, optimized for speed-critical and high-volume tasks. It supports text and image inputs and is designed for low-latency use cases such as classification, data extraction, ranking, and sub-agent…

by openai |Beschädigen 2026 |400K context |$0.2000/M input |$1.25/M output

400K tokens ⓘ

OpenAI: GPT-5.4 Mini

GPT-5.4 mini brings the core capabilities of GPT-5.4 to a faster, more efficient model optimized for high-throughput workloads. It supports text and image inputs with strong performance across reasoning, coding, and tool use, while reducing latency and cost for large-scale…

by openai |Beschädigen 2026 |400K context |$0.7500/M input |$4.50/M output

400K tokens ⓘ

Mistral: Mistral Small 4

Mistral Small 4 is the next major release in the Mistral Small family, unifying the capabilities of several flagship Mistral models into a single system. It combines strong reasoning from Magistral, multimodal understanding from Pixtral, and agentic coding capabilities from…

by mistralai |Beschädigen 2026 |262K context |$0.1500/M input |$0.6000/M output

262K tokens ⓘ

Z.ai: GLM 5 Turbo

GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios. It is deeply optimized for real-world agent workflows involving long execution chains, with improved complex instruction decomposition, tool…

by z-ai |Beschädigen 2026 |203K context |$1.20/M input |$4.00/M output

203K tokens ⓘ

NVIDIA: Nemotron 3 Super (free)

NVIDIA Nemotron 3 Super is a 120B-parameter open hybrid MoE model, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer Mixture-of-Experts architecture with multi-token prediction (MTP), it delivers over 50%…

by nvidia |Beschädigen 2026 |262K context |Miễn phí input |Miễn phí output

262K tokens ⓘ

NVIDIA: Nemotron 3 Super

by nvidia |Beschädigen 2026 |262K context |$0.1000/M input |$0.5000/M output

262K tokens ⓘ

AI Models

Konto

🔑 Lấy lại mật khẩu