AI Models

358 models Free & Paid Cập nhật: 9 hours trước

Grok Build 0.1 is xAI’s fast coding model trained specifically for agentic software engineering workflows. It supports text and image inputs with text output, and is optimized for interactive coding...

による |5月 2026 |256K コンテキスト |$1.00/M入力 |$2.00/M出力
256K トークン

ジェミニ 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...

による |5月 2026 |1M コンテキスト |$1.50/M入力 |$9.00/M出力
1M トークン

Fast-mode variant of [Opus 4.7](/anthropic/claude-opus-4.7) - identical capabilities with higher output speed at premium 6x pricing. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode

による |5月 2026 |1M コンテキスト |$30.00/M入力 |$150.00/M出力
1M トークン

Perceptron Mk1 (Mark One) is Perceptron's highest-quality vision-language model for video and embodied reasoning.** It accepts image and video inputs paired with natural language queries, and produces detailed visual understanding...

による |5月 2026 |33K コンテキスト |$0.1500/M入力 |$1.50/M出力

Ring-2.6-1T is a 1T-parameter-scale thinking model with 63B active parameters, built for real-world agent workflows that require both strong capability and operational efficiency. It is optimized for coding agents, tool...

による |5月 2026 |262K コンテキスト |$0.0750/M入力 |$0.6250/M出力
262K トークン

ジェミニ 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, ビデオ, audio, and PDF inputs, and is designed for lightweight agentic...

による |5月 2026 |1M コンテキスト |$0.2500/M入力 |$1.50/M出力
1M トークン

CoBuddy is a code generation model from Baidu, optimized for coding tasks and AI Agent workflows. It features high inference throughput and low end-to-end latency, with native support for tool...

による |5月 2026 |131K コンテキスト |Miễn phí input |Miễn phí output
131K トークン

GPT Chat Latest points to OpenAI's stable API alias `chat-latest` that always resolves to the latest Instant chat model used in ChatGPT. As OpenAI rolls out new Instant model updates...

による |5月 2026 |400K コンテキスト |$5.00/M入力 |$30.00/M出力
400K トークン

グロク 4.3 is a reasoning model from xAI. It accepts text and image inputs with text output, and is suited for agentic workflows, instruction-following tasks, and applications requiring high factual...

による |Apr 2026 |1M コンテキスト |$1.25/M入力 |$2.50/M出力
1M トークン

Granite 4.1 8B is a dense, decoder-only 8-billion-parameter language model from IBM, part of the Granite 4.1 家族. It supports a 131K-token context window and is designed for enterprise tasks...

による |Apr 2026 |131K コンテキスト |$0.0500/M入力 |$0.1000/M出力
131K トークン

Mistral Medium 3.5 is a dense 128B instruction-following model from Mistral AI. It supports text and image inputs with text output, and is designed for agentic workflows, コーディング, and complex...

による |Apr 2026 |262K コンテキスト |$1.50/M入力 |$7.50/M出力
262K トークン

Owl Alpha is a high-performance foundation model designed for agentic workloads. Natively supports tool use, and long-context tasks, with strong performance in code generation, automated workflows, and complex instruction execution....

による |Apr 2026 |1M コンテキスト |Miễn phí input |Miễn phí output
1M トークン

NVIDIA Nemotron™ 3 Nano Omni is a 30B-A3B open multimodal model designed to function as a perception and context sub-agent in enterprise agent systems. It accepts text, image, ビデオ, and...

による |Apr 2026 |256K コンテキスト |Miễn phí input |Miễn phí output
256K トークン

Laguna XS.2 is the second-generation model in the XS size class from [Poolside](https://poolside.ai), their efficient coding agent series. It combines tool calling and reasoning capabilities with a compact footprint, offering...

による |Apr 2026 |131K コンテキスト |Miễn phí input |Miễn phí output
131K トークン

Laguna M.1 is the flagship coding agent model from [Poolside](https://poolside.ai), optimized for complex software engineering tasks. Designed for agentic coding workflows, it supports tool calling and reasoning, with a 128K...

による |Apr 2026 |131K コンテキスト |Miễn phí input |Miễn phí output
131K トークン

This model always redirects to the latest model in the Anthropic Claude Haiku family.

による |Apr 2026 |200K コンテキスト |$1.00/M入力 |$5.00/M出力
200K トークン

This model always redirects to the latest model in the OpenAI GPT Mini family.

による |Apr 2026 |400K コンテキスト |$0.7500/M入力 |$4.50/M出力
400K トークン

This model always redirects to the latest model in the Google Gemini Pro family.

による |Apr 2026 |1M コンテキスト |$2.00/M入力 |$12.00/M出力
1M トークン

This model always redirects to the latest model in the MoonshotAI Kimi family.

による |Apr 2026 |262K コンテキスト |$0.7300/M入力 |$3.49/M出力
262K トークン

This model always redirects to the latest model in the Google Gemini Flash family.

による |Apr 2026 |1M コンテキスト |$1.50/M入力 |$9.00/M出力
1M トークン

This model always redirects to the latest model in the Anthropic Claude Sonnet family.

による |Apr 2026 |1M コンテキスト |$3.00/M入力 |$15.00/M出力
1M トークン

This model always redirects to the latest model in the OpenAI GPT family.

による |Apr 2026 |1.1M コンテキスト |$5.00/M入力 |$30.00/M出力
1.1M トークン

Qwen3.5 Plus (April 2026) is a large-scale multimodal language model from Alibaba. It accepts text, image, and video input and produces text output, with a 1M token context window. This...

による |Apr 2026 |1M コンテキスト |$0.3000/M入力 |$1.80/M出力
1M トークン

Qwen3.6 Flash is a fast, efficient language model from Alibaba's Qwen 3.6 series. It supports text, image, and video input with a 1M token context window. Tiered pricing kicks in...

による |Apr 2026 |1M コンテキスト |$0.1875/M入力 |$1.13/M出力
1M トークン

Qwen3.6-35B-A3B is an open-weight multimodal model from Alibaba Cloud with 35 billion total parameters and 3 billion active parameters per token. It uses a hybrid sparse mixture-of-experts architecture combining Gated...

による |Apr 2026 |262K コンテキスト |$0.1490/M入力 |$1.00/M出力
262K トークン

Qwen3.6-Max-Preview is a proprietary frontier model from Alibaba Cloud built on a sparse mixture-of-experts architecture with approximately 1 trillion total parameters. It is optimized for agentic coding, ツールの使用, and...

による |Apr 2026 |262K コンテキスト |$1.04/M入力 |$6.24/M出力
262K トークン

Qwen3.6 27B is a dense 27-billion-parameter language model from the Qwen Team at Alibaba, released in April 2026. It features hybrid multimodal capabilities — accepting text, image, and video inputs...

による |Apr 2026 |262K コンテキスト |$0.3200/M入力 |$3.20/M出力
262K トークン

GPT-5.5 Pro is OpenAI’s high-capability model optimized for deep reasoning and accuracy on complex, high-stakes workloads. It features a 1M+ token context window (922K input, 128K output) with support for...

による |Apr 2026 |1.1M コンテキスト |$30.00/M入力 |$180.00/M出力
1.1M トークン

GPT-5.5 is OpenAI’s frontier model designed for complex professional workloads, building on GPT-5.4 with stronger reasoning, higher reliability, and improved token efficiency on hard tasks. It features a 1M+ token...

による |Apr 2026 |1.1M コンテキスト |$5.00/M入力 |$30.00/M出力
1.1M トークン

DeepSeek V4 Pro is a large-scale Mixture-of-Experts model from DeepSeek with 1.6T total parameters and 49B activated parameters, supporting a 1M-token context window. It is designed for advanced reasoning, コーディング,...

による |Apr 2026 |1M コンテキスト |$0.4350/M入力 |$0.8700/M出力
1M トークン

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...

による |Apr 2026 |1M コンテキスト |$0.1120/M入力 |$0.2240/M出力
1M トークン

DeepSeek V4 Flash is an efficiency-optimized Mixture-of-Experts model from DeepSeek with 284B total parameters and 13B activated parameters, supporting a 1M-token context window. It is designed for fast inference and...

による |Apr 2026 |1M コンテキスト |Miễn phí input |Miễn phí output
1M トークン

Ling-2.6-1T is an instant (instruct) model from inclusionAI and the company’s trillion-parameter flagship, designed for real-world agents that require fast execution and high efficiency at scale. It uses a “fast...

による |Apr 2026 |262K コンテキスト |$0.0750/M入力 |$0.6250/M出力
262K トークン

Hy3 preview is a high-efficiency Mixture-of-Experts model from Tencent designed for agentic workflows and production use. It supports configurable reasoning levels across disabled, 低い, and high modes, allowing it to...

による |Apr 2026 |262K コンテキスト |$0.0660/M入力 |$0.2600/M出力
262K トークン

MiMo-V2.5-Pro is Xiaomi’s flagship model, delivering strong performance in general agentic capabilities, complex software engineering, and long-horizon tasks, with top rankings on benchmarks such as ClawEval, GDPVal, and SWE-bench Pro....

による |Apr 2026 |1M コンテキスト |$1.00/M入力 |$3.00/M出力
1M トークン

MiMo-V2.5 is a native omnimodal model by Xiaomi. It delivers Pro-level agentic performance at roughly half the inference cost, while surpassing MiMo-V2-Omni in multimodal perception across image and video understanding...

による |Apr 2026 |1M コンテキスト |$0.4000/M入力 |$2.00/M出力
1M トークン

[GPT-5.4](https://openrouter.ai/openai/gpt-5.4) Image 2 combines OpenAI's GPT-5.4 model with state-of-the-art image generation capabilities from GPT Image 2. It enables rich multimodal workflows, allowing users to seamlessly move between reasoning, コーディング, and...

による |Apr 2026 |272K コンテキスト |$8.00/M入力 |$15.00/M出力
272K トークン

Ling-2.6-flash is an instant (instruct) model from inclusionAI with 104B total parameters and 7.4B active parameters, designed for real-world agents that require fast responses, strong execution, and high token efficiency....

による |Apr 2026 |262K コンテキスト |$0.0100/M入力 |$0.0300/M出力
262K トークン

This model always redirects to the latest model in the Claude Opus family.

による |Apr 2026 |1M コンテキスト |$5.00/M入力 |$25.00/M出力
1M トークン

The Pareto Router maintains a tiered shortlist of strong coding models, ranked by [Artificial Analysis](https://artificialanalysis.ai/) coding percentiles. Set min_coding_score between 0 and 1 on the [pareto-router plugin](https://openrouter.ai/docs/guides/routing/routers/pareto-router#the-min_coding_score-parameter) to control how...

による |Apr 2026 |2M コンテキスト |Miễn phí input |Miễn phí output
2M トークン

Qianfan-OCR-Fast is a domain-specific multimodal large model purpose-built for OCR. By leveraging specialized OCR training data while preserving versatile multimodal intelligence, it provides a powerful performance upgrade over Qianfan-OCR.

による |Apr 2026 |66K コンテキスト |$0.6800/M入力 |$2.81/M出力
66K トークン

Kimi K2.6 is Moonshot AI's next-generation multimodal model, designed for long-horizon coding, coding-driven UI/UX generation, and multi-agent orchestration. It handles complex end-to-end coding tasks across Python, Rust, and Go, and...

による |Apr 2026 |262K コンテキスト |$0.7300/M入力 |$3.49/M出力
262K トークン

Opus 4.7 is the next generation of Anthropic's Opus family, built for long-running, asynchronous agents. Building on the coding and agentic strengths of Opus 4.6, it delivers stronger performance on...

による |Apr 2026 |1M コンテキスト |$5.00/M入力 |$25.00/M出力
1M トークン

Fast-mode variant of [Opus 4.6](/anthropic/claude-opus-4.6) - identical capabilities with higher output speed at premium 6x pricing. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode

による |Apr 2026 |1M コンテキスト |$30.00/M入力 |$150.00/M出力
1M トークン

GLM-5.1 delivers a major leap in coding capability, with particularly significant gains in handling long-horizon tasks. Unlike previous models built around minute-level interactions, GLM-5.1 can work independently and continuously on...

による |Apr 2026 |203K コンテキスト |Miễn phí input |Miễn phí output
203K トークン

Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (教育省) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...

による |Apr 2026 |262K コンテキスト |$0.0600/M入力 |$0.3300/M出力
262K トークン

Gemma 4 26B A4B IT is an instruction-tuned Mixture-of-Experts (教育省) model from Google DeepMind. Despite 25.2B total parameters, only 3.8B activate per token during inference — delivering near-31B quality at...

による |Apr 2026 |262K コンテキスト |Miễn phí input |Miễn phí output
262K トークン

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...

による |Apr 2026 |262K コンテキスト |$0.1200/M入力 |$0.3700/M出力
262K トークン

Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...

による |Apr 2026 |262K コンテキスト |Miễn phí input |Miễn phí output
262K トークン

Qwen 3.6 Plus builds on a hybrid architecture that combines efficient linear attention with sparse mixture-of-experts routing, enabling strong scalability and high-performance inference. Compared to the 3.5 series, it delivers...

による |Apr 2026 |1M コンテキスト |$0.3250/M入力 |$1.95/M出力
1M トークン