AI Models

340 models Free & Paid Cập nhật: 7 hours trước

OpenAI: GPT-5.5 Pro

GPT-5.5 Pro is OpenAI’s high-capability model optimized for deep reasoning and accuracy on complex, high-stakes workloads. It features a 1M+ token context window (922K input, 128K output) with support for...

による openai |4月 2026 |1.1M context |$30.00/M input |$180.00/M output

1.1M tokens ⓘ

OpenAI: GPT-5.5

GPT-5.5 is OpenAI’s frontier model designed for complex professional workloads, building on GPT-5.4 with stronger reasoning, higher reliability, and improved token efficiency on hard tasks. It features a 1M+ token...

による openai |4月 2026 |1.1M context |$5.00/M input |$30.00/M output

1.1M tokens ⓘ

deepseek: DeepSeek V4 Pro

DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total parameters and 49B activated parameters, supporting a 1M-token context window. It is designed for advanced reasoning, coding,...

による deepseek |4月 2026 |1M context |$0.4350/M input |$0.8700/M output

1M tokens ⓘ

deepseek: DeepSeek V4 Flash

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...

による deepseek |4月 2026 |1M context |$0.0900/M input |$0.1800/M output

1M tokens ⓘ

inclusionAI: Ling-2.6-1T

Ling-2.6-1T is an instant (instruct) model from inclusionAI and the company’s trillion-parameter flagship, designed for real-world agents that require fast execution and high efficiency at scale. It uses a “fast...

による inclusionai |4月 2026 |262K context |$0.0750/M input |$0.6250/M output

262K tokens ⓘ

Tencent: Hy3 preview

Hy3 preview is a high-efficiency Mixture-of-Experts model from Tencent designed for agentic workflows and production use. It supports configurable reasoning levels across disabled, low, and high modes, allowing it to...

による tencent |4月 2026 |262K context |$0.0630/M input |$0.2100/M output

262K tokens ⓘ

シャオミ: MiMo-V2.5-Pro

MiMo-V2.5-Pro is Xiaomi’s flagship model, delivering strong performance in general agentic capabilities, complex software engineering, and long-horizon tasks, with top rankings on benchmarks such as ClawEval, GDPVal, and SWE-bench Pro....

によるシャオミ |4月 2026 |1M context |$0.4350/M input |$0.8700/M output

1M tokens ⓘ

シャオミ: MiMo-V2.5

MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance at roughly half the inference cost, while surpassing MiMo-V2-Omni in multimodal perception across image and video understanding...

によるシャオミ |4月 2026 |1M context |$0.1050/M input |$0.2800/M output

1M tokens ⓘ

OpenAI: GPT-5.4 Image 2

[GPT-5.4](https://openrouter.ai/openai/gpt-5.4) Image 2 combines OpenAI's GPT-5.4 model with state-of-the-art image generation capabilities from GPT Image 2. It enables rich multimodal workflows, allowing users to seamlessly move between reasoning, coding, and...

による openai |4月 2026 |272K context |$8.00/M input |$15.00/M output

272K tokens ⓘ

inclusionAI: Ling-2.6-flash

Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....

による inclusionai |4月 2026 |262K context |$0.0100/M input |$0.0300/M output

262K tokens ⓘ

人間的: Claude Opus Latest

This model always redirects to the latest model in the Claude Opus family.

による ~anthropic |4月 2026 |1M context |$5.00/M input |$25.00/M output

1M tokens ⓘ

Pareto Code Router

The Pareto Router maintains a tiered shortlist of strong coding models, ranked by [Artificial Analysis](https://artificialanalysis.ai/) coding percentiles. Set min_coding_score between 0 and 1 on the [pareto-router plugin](https://openrouter.ai/docs/guides/routing/routers/pareto-router#the-min_coding_score-parameter) to control how...

による openrouter |4月 2026 |2M context |Miễn phí input |Miễn phí output

2M tokens ⓘ

MoonshotAI: Kimi K2.6

Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration. It handles complex end-to-end coding tasks across Python, Rust, and Go, and...

による moonshotai |4月 2026 |262K context |$0.6600/M input |$3.41/M output

262K tokens ⓘ

人間的: Claude Opus 4.7

Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents. Building on the coding and agentic strengths of Opus 4.6, it delivers stronger performance on...

による anthropic |4月 2026 |1M context |$5.00/M input |$25.00/M output

1M tokens ⓘ

Z.ai: GLM 5.1

GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on...

による z-ai |4月 2026 |203K context |$0.9660/M input |$3.04/M output

203K tokens ⓘ

グーグル: Gemma 4 26B A4B

Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (MoE) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...

によるグーグル |4月 2026 |262K context |$0.0600/M input |$0.3300/M output

262K tokens ⓘ

グーグル: Gemma 4 26B A4B (free)

によるグーグル |4月 2026 |262K context |Miễn phí input |Miễn phí output

262K tokens ⓘ

グーグル: Gemma 4 31b

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...

によるグーグル |4月 2026 |262K context |$0.1200/M input |$0.3500/M output

262K tokens ⓘ

グーグル: Gemma 4 31b (free)

によるグーグル |4月 2026 |262K context |Miễn phí input |Miễn phí output

262K tokens ⓘ

Qwen: Qwen3.6 Plus

Qwen 3.6 Plus builds on a hybrid architecture that combines efficient linear attention with sparse mixture-of-experts routing, enabling strong scalability and high-performance inference. Compared to the 3.5 series, it delivers...

による qwen |4月 2026 |1M context |$0.3250/M input |$1.95/M output

1M tokens ⓘ

Z.ai: GLM 5V Turbo

GLM-5V-Turbo is Z.ai’s first native multimodal agent foundation model, ビジョンベースのコーディングとエージェント駆動のタスク向けに構築. It natively handles image, ビデオ, そしてテキスト入力, excels at long-horizon planning, 複雑なコーディング,...

による z-ai |4月 2026 |203K context |$1.20/M input |$4.00/M output

203K tokens ⓘ

Arcee AI: 三位一体の大きな思考

Trinity Large Thinking は、Arce AI チームによる強力なオープンソース推論モデルです。. It shows strong performance in PinchBench, エージェントのワークロード, そして推論タスク. 起動ビデオ: https://youtu.be/Gc82AXLa0Rg?si=4RLn6WBz33qT--B7...

による arcee-ai |4月 2026 |262K context |$0.2500/M input |$0.8000/M output

262K tokens ⓘ

xAI: Grok 4.20 マルチエージェント

Grok 4.20 Multi-Agent is a variant of xAI’s Grok 4.20 designed for collaborative, エージェントベースのワークフロー. 複数のエージェントが並行して動作し、詳細な調査を実施します, 座標ツールの使用, and synthesize information...

による x-ai |3月 2026 |2M context |$1.25/M input |$2.50/M output

2M tokens ⓘ

xAI: Grok 4.20

Grok 4.20 is a reasoning model from xAI with industry-leading speed and agentic tool calling capabilities. It combines the lowest hallucination rate on the market with strict prompt adherance, delivering...

による x-ai |3月 2026 |2M context |$1.25/M input |$2.50/M output

2M tokens ⓘ

グーグル: Lyria 3 Pro Preview

Full-length songs are priced at $0.08 per song. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate high-quality, 48kHz...

によるグーグル |3月 2026 |1M context |Miễn phí input |Miễn phí output

1M tokens ⓘ

グーグル: Lyria 3 Clip Preview

30 second duration clips are priced at $0.04 per clip. Lyria 3 is Google's family of music generation models, available through the Gemini API. With Lyria 3, you can generate...

によるグーグル |3月 2026 |1M context |Miễn phí input |Miễn phí output

1M tokens ⓘ

パイロットはどこですか？: KAT-コーダー-プロ V2

KAT-Coder-Pro V2 is the latest high-performance model in KwaiKAT’s KAT-Coder series, designed for complex enterprise-grade software engineering and SaaS integration. It builds on the agentic coding strengths of earlier versions,...

による kwaipilot |3月 2026 |256K context |$0.3000/M input |$1.20/M output

256K tokens ⓘ

スイートエッジ

Reka Edge is an extremely efficient 7B multimodal vision-language model that accepts image/video+text inputs and generates text outputs. This model is optimized specifically to deliver industry-leading performance in image understanding,...

による rekaai |3月 2026 |16K context |$0.1000/M input |$0.1000/M output

MiniMax: MiniMax M2.7

MiniMax-M2.7 is a next-generation large language model designed for autonomous, real-world productivity and continuous improvement. Built to actively participate in its own evolution, M2.7 integrates advanced agentic capabilities through multi-agent...

による minimax |3月 2026 |205K context |$0.1800/M input |$0.7200/M output

205K tokens ⓘ

OpenAI: GPT-5.4 Nano

GPT-5.4 nano は、GPT-5.4 ファミリの中で最も軽量でコスト効率の高いバージョンです。, スピードが重視される大量のタスク向けに最適化. It supports text and image inputs and is designed for low-latency...

による openai |3月 2026 |400K context |$0.2000/M input |$1.25/M output

400K tokens ⓘ

OpenAI: GPT-5.4 Mini

GPT-5.4 mini は、GPT-5.4 のコア機能をさらに高速化します。, 高スループットのワークロード向けに最適化された、より効率的なモデル. 推論全体で強力なパフォーマンスを備えたテキストと画像の入力をサポートします, coding,...

による openai |3月 2026 |400K context |$0.7500/M input |$4.50/M output

400K tokens ⓘ

Mistral: Mistral Small 4

Mistral Small 4 Mistral Small ファミリーの次のメジャーリリースです, unifying the capabilities of several flagship Mistral models into a single system. It combines strong reasoning from...

による mistralai |3月 2026 |262K context |$0.1500/M input |$0.6000/M output

262K tokens ⓘ

Z.ai: GLM 5 ターボ

GLM-5 Turbo is a new model from Z.ai designed for fast inference and strong performance in agent-driven environments such as OpenClaw scenarios. It is deeply optimized for real-world agent workflows...

による z-ai |3月 2026 |262K context |$1.20/M input |$4.00/M output

262K tokens ⓘ

NVIDIA: Nemotron 3 素晴らしい (free)

NVIDIA Nemotron 3 Super は 120B パラメータのオープンハイブリッド MoE モデルです, activating just 12B parameters for maximum compute efficiency and accuracy in complex multi-agent applications. Built on a hybrid Mamba-Transformer...

による nvidia |3月 2026 |1M context |Miễn phí input |Miễn phí output

1M tokens ⓘ

NVIDIA: Nemotron 3 素晴らしい

による nvidia |3月 2026 |1M context |$0.0850/M input |$0.4000/M output

1M tokens ⓘ

バイトダンスシード: シード-2.0-Lite

Seed-2.0-Lite は多用途です, 強力なマルチモーダル機能とエージェント機能を提供しながら、遅延を大幅に短縮する、コスト効率の高いエンタープライズ主力製品, making it a practical default choice for most production workloads across...

によるバイトダンスシード |3月 2026 |262K context |$0.2500/M input |$2.00/M output

262K tokens ⓘ

Qwen: クウェン3.5-9B

Qwen3.5-9B は、Qwen3.5 ファミリのマルチモーダル基礎モデルです。, 強力な推論を提供するように設計されている, coding, and visual understanding in an efficient 9B-parameter architecture. It uses a unified vision-language design...

による qwen |3月 2026 |262K context |$0.1000/M input |$0.1500/M output

262K tokens ⓘ

OpenAI: GPT-5.4 Pro

GPT-5.4 Pro is OpenAI's most advanced model, building on GPT-5.4's unified architecture with enhanced reasoning capabilities for complex, high-stakes tasks. It features a 1M+ token context window (922K input, 128K...

による openai |3月 2026 |1.1M context |$30.00/M input |$180.00/M output

1.1M tokens ⓘ

OpenAI: GPT-5.4

GPT-5.4 is OpenAI’s latest frontier model, unifying the Codex and GPT lines into a single system. It features a 1M+ token context window (922K input, 128K output) with support for...

による openai |3月 2026 |1.1M context |$2.50/M input |$15.00/M output

1.1M tokens ⓘ

Inception: Mercury 2

Mercury 2 is an extremely fast reasoning LLM, and the first reasoning diffusion LLM (dLLM). Instead of generating tokens sequentially, Mercury 2 produces and refines multiple tokens in parallel, achieving...

による inception |3月 2026 |128K context |$0.2500/M input |$0.7500/M output

128K tokens ⓘ

OpenAI: GPT-5.3 Chat

GPT-5.3 Chat is an update to ChatGPT's most-used model that makes everyday conversations smoother, more useful, and more directly helpful. It delivers more accurate answers with better contextualization and significantly...

による openai |3月 2026 |128K context |$1.75/M input |$14.00/M output

128K tokens ⓘ

グーグル: Gemini 3.1 Flash Lite Preview

Gemini 3.1 Flash Lite Preview is Google's high-efficiency model optimized for high-volume use cases. It outperforms Gemini 2.5 Flash Lite on overall quality and approaches Gemini 2.5 Flash performance across...

によるグーグル |3月 2026 |1M context |$0.2500/M input |$1.50/M output

1M tokens ⓘ

バイトダンスシード: Seed-2.0-Mini

Seed-2.0-mini targets latency-sensitive, high-concurrency, and cost-sensitive scenarios, emphasizing fast response and flexible inference deployment. It delivers performance comparable to ByteDance-Seed-1.6, 256k コンテキストをサポート, 4つの推論努力モード (最小/低/中/高), 多面的な理解,...

によるバイトダンスシード |2月 2026 |262K context |$0.1000/M input |$0.4000/M output

262K tokens ⓘ

グーグル: Nano Banana 2 (Gemini 3.1 Flash画像プレビュー)

Gemini 3.1 Flash画像プレビュー, a.k.a. "Nano Banana 2," is Google’s latest state of the art image generation and editing model, delivering Pro-level visual quality at Flash speed. It combines...

によるグーグル |2月 2026 |131K context |$0.5000/M input |$3.00/M output

131K tokens ⓘ

Qwen: クウェン3.5-35B-A3B

Qwen3.5 シリーズ 35B-A3B は、線形注意メカニズムと専門家のまばらな混合モデルを統合するハイブリッドアーキテクチャで設計されたネイティブビジョン言語モデルです。, より高い推論効率を実現. Its overall...

による qwen |2月 2026 |262K context |$0.1400/M input |$1.00/M output

262K tokens ⓘ

Qwen: クウェン3.5-27B

Qwen3.5 27B ネイティブビジョン言語 Dense モデルには、リニアアテンションメカニズムが組み込まれています, delivering fast response times while balancing inference speed and performance. Its overall capabilities are comparable to those of...

による qwen |2月 2026 |262K context |$0.1950/M input |$1.56/M output

262K tokens ⓘ

Qwen: クウェン3.5-122B-A10B

Qwen3.5 122B-A10B ネイティブビジョン言語モデルは、線形注意メカニズムと専門家混合モデルを統合したハイブリッドアーキテクチャに基づいて構築されています。, より高い推論効率を実現. In terms of...

による qwen |2月 2026 |262K context |$0.2600/M input |$2.08/M output

262K tokens ⓘ

Qwen: Qwen3.5-フラッシュ

Qwen3.5 ネイティブビジョン言語 Flash モデルは、線形注意メカニズムと専門家のまばらな混合モデルを統合するハイブリッドアーキテクチャに基づいて構築されています。, より高い推論効率を実現. Compared to the...

による qwen |2月 2026 |1M context |$0.0650/M input |$0.2600/M output

1M tokens ⓘ

LiquidAI: LFM2-24B-A2B

LFM2-24B-A2B is the largest model in the LFM2 family of hybrid architectures designed for efficient on-device deployment. Built as a 24B parameter Mixture-of-Experts model with only 2B active parameters per...

による液体 |2月 2026 |128K context |$0.0300/M input |$0.1200/M output

128K tokens ⓘ

グーグル: Gemini 3.1 プロプレビューカスタムツール

Gemini 3.1 Pro Preview Custom Tools は Gemini の亜種です 3.1 Pro that improves tool selection behavior by preventing overuse of a general bash tool when more efficient third-party...

によるグーグル |2月 2026 |1M context |$2.00/M input |$12.00/M output

1M tokens ⓘ

AI Models

アカウント

🔑 Lấy lại mật khẩu