GPT-5.5 Pro is OpenAI’s high-capability model optimized for deep reasoning and accuracy on complex, high-stakes workloads. It features a 1M+ token context window (922K input, 128K output) with support for...
AI Models
GPT-5.5 is OpenAI’s frontier model designed for complex professional workloads, building on GPT-5.4 with stronger reasoning, higher reliability, and improved token efficiency on hard tasks. It features a 1M+ token...
DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total parameters and 49B activated parameters, supporting a 1M-token context window. It is designed for advanced reasoning, coding,...
DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...
Ling-2.6-1T is an instant (instruct) model from inclusionAI and the company’s trillion-parameter flagship, designed for real-world agents that require fast execution and high efficiency at scale. It uses a “fast...
Hy3 preview is a high-efficiency Mixture-of-Experts model from Tencent designed for agentic workflows and production use. It supports configurable reasoning levels across disabled, low, and high modes, allowing it to...
MiMo-V2.5-Pro is Xiaomi’s flagship model, delivering strong performance in general agentic capabilities, complex software engineering, and long-horizon tasks, with top rankings on benchmarks such as ClawEval, GDPVal, and SWE-bench Pro....
MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance at roughly half the inference cost, while surpassing MiMo-V2-Omni in multimodal perception across image and video understanding...
[GPT-5.4](https://openrouter.ai/openai/gpt-5.4) Image 2 combines OpenAI's GPT-5.4 model with state-of-the-art image generation capabilities from GPT Image 2. It enables rich multimodal workflows, allowing users to seamlessly move between reasoning, coding, and...
Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....
This model always redirects to the latest model in the Claude Opus family.
The Pareto Router maintains a tiered shortlist of strong coding models, ranked by [Artificial Analysis](https://artificialanalysis.ai/) coding percentiles. Set min_coding_score between 0 and 1 on the [pareto-router plugin](https://openrouter.ai/docs/guides/routing/routers/pareto-router#the-min_coding_score-parameter) to control how...
Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration. It handles complex end-to-end coding tasks across Python, Rust, and Go, and...
Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents. Building on the coding and agentic strengths of Opus 4.6, it delivers stronger performance on...
GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on...
Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...
Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...
Qwen 3.6 Plus builds on a hybrid architecture that combines efficient linear attention with sparse mixture-of-experts routing, enabling strong scalability and high-performance inference. Compared to the 3.5 series, it delivers...
GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, ビジョンベースのコーディングとエージェント駆動のタスク向けに構築. It natively handles image, ビデオ, そしてテキスト入力, excels at long-horizon planning, 複雑なコーディング,...
Trinity Large Thinking は、Arce AI チームによる強力なオープンソース推論モデルです。. It shows strong performance in PinchBench, エージェントのワークロード, そして推論タスク. 起動ビデオ: https://youtu.be/Gc82AXLa0Rg?si=4RLn6WBz33qT--B7...
Grok 4.20 Multi-Agent is a variant of xAI’s Grok 4.20 designed for collaborative, エージェントベースのワークフロー. 複数のエージェントが並行して動作し、詳細な調査を実施します, 座標ツールの使用, and synthesize information...
Grok 4.20 is a reasoning model from xAI with industry-leading speed and agentic tool calling capabilities. It combines the lowest hallucination rate on the market with strict prompt adherance, delivering...
Full-length songs are priced at $0.08 per song. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate high-quality, 48kHz...
30 second duration clips are priced at $0.04 per clip. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate...
KAT-Coder-Pro V2 is the latest high-performance model in KwaiKAT’s KAT-Coder series, designed for complex enterprise-grade software engineering and SaaS integration. It builds on the agentic coding strengths of earlier versions,...
Reka Edge is an extremely efficient 7B multimodal vision-language model that accepts image/video+text inputs and generates text outputs. This model is optimized specifically to deliver industry-leading performance in image understanding,...
MiniMax-M2.7 is a next-generation large language model designed for autonomous, real-world productivity and continuous improvement. Built to actively participate in its own evolution, M2.7 integrates advanced agentic capabilities through multi-agent...
GPT-5.4 nano は、GPT-5.4 ファミリの中で最も軽量でコスト効率の高いバージョンです。, スピードが重視される大量のタスク向けに最適化. It supports text and image inputs and is designed for low-latency...
GPT-5.4 mini は、GPT-5.4 のコア機能をさらに高速化します。, 高スループットのワークロード向けに最適化された、より効率的なモデル. 推論全体で強力なパフォーマンスを備えたテキストと画像の入力をサポートします, coding,...
Mistral Small 4 Mistral Small ファミリーの次のメジャー リリースです, unifying the capabilities of several flagship Mistral models into a single system. It combines strong reasoning from...
GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios. It is deeply optimized for real-world agent workflows...
NVIDIA Nemotron 3 Super は 120B パラメータのオープンハイブリッド MoE モデルです, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...
NVIDIA Nemotron 3 Super は 120B パラメータのオープンハイブリッド MoE モデルです, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...
Seed-2.0-Lite は多用途です, 強力なマルチモーダル機能とエージェント機能を提供しながら、遅延を大幅に短縮する、コスト効率の高いエンタープライズ主力製品, making it a practical default choice for most production workloads across...
Qwen3.5-9B は、Qwen3.5 ファミリのマルチモーダル基礎モデルです。, 強力な推論を提供するように設計されている, coding, and visual understanding in an efficient 9B-parameter architecture. It uses a unified vision-language design...
GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture with enhanced reasoning capabilities for complex, high-stakes tasks. It features a 1M+ token context window (922K input, 128K...
GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (922K input, 128K output) with support for...
Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM). Instead of generating tokens sequentially, Mercury 2 produces and refines multiple tokens in parallel, achieving...
GPT-5.3 Chat is an update to ChatGPT's most-used model that makes everyday conversations smoother, more useful, and more directly helpful. It delivers more accurate answers with better contextualization and significantly...
Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across...
Seed-2.0-mini targets latency-sensitive, high-concurrency, and cost-sensitive scenarios, emphasizing fast response and flexible inference deployment. It delivers performance comparable to ByteDance-Seed-1.6, 256k コンテキストをサポート, 4つの推論努力モード (最小/低/中/高), 多面的な理解,...
Gemini 3.1 Flash画像プレビュー, a.k.a. "Nano Banana 2," is Google’s latest state of the art image generation and editing model, delivering Pro-level visual quality at Flash speed. It combines...
Qwen3.5 シリーズ 35B-A3B は、線形注意メカニズムと専門家のまばらな混合モデルを統合するハイブリッド アーキテクチャで設計されたネイティブ ビジョン言語モデルです。, より高い推論効率を実現. Its overall...
Qwen3.5 27B ネイティブ ビジョン言語 Dense モデルには、リニア アテンション メカニズムが組み込まれています, delivering fast response times while balancing inference speed and performance. Its overall capabilities are comparable to those of...
Qwen3.5 122B-A10B ネイティブ ビジョン言語モデルは、線形注意メカニズムと専門家混合モデルを統合したハイブリッド アーキテクチャに基づいて構築されています。, より高い推論効率を実現. In terms of...
Qwen3.5 ネイティブ ビジョン言語 Flash モデルは、線形注意メカニズムと専門家のまばらな混合モデルを統合するハイブリッド アーキテクチャに基づいて構築されています。, より高い推論効率を実現. Compared to the...
LFM2-24B-A2B is the largest model in the LFM2 family of hybrid architectures designed for efficient on-device deployment. Built as a 24B parameter Mixture-of-Experts model with only 2B active parameters per...
Gemini 3.1 Pro Preview Custom Tools は Gemini の亜種です 3.1 Pro that improves tool selection behavior by preventing overuse of a general bash tool when more efficient third-party...







