AI Models

340 models 무료 & Paid Cập nhật: 6 hours trước

오픈AI: GPT-5.1 Chat

GPT-5.1 Chat (AKA Instant is the fast, lightweight member of the 5.1 가족, optimized for low-latency chat while retaining strong general intelligence. It uses adaptive reasoning to selectively “think” on...

~에 의해 개방하다 |11 월 2025 |128K context |$1.25/M input |$10.00/M output

128K tokens ⓘ

오픈AI: GPT-5.1-Codex

GPT-5.1-Codex is a specialized version of GPT-5.1 optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks....

~에 의해 개방하다 |11 월 2025 |400K context |$1.25/M input |$10.00/M output

400K tokens ⓘ

오픈AI: GPT-5.1-Codex-Mini

GPT-5.1-Codex-Mini is a smaller and faster version of GPT-5.1-Codex

~에 의해 개방하다 |11 월 2025 |400K context |$0.2500/M input |$2.00/M output

400K tokens ⓘ

MoonshotAI: Kimi K2 Thinking

Kimi K2 Thinking is Moonshot AI’s most advanced open reasoning model to date, extending the K2 series into agentic, long-horizon reasoning. Built on the trillion-parameter Mixture-of-Experts (MoE) architecture introduced in...

~에 의해 moonshotai |11 월 2025 |262K context |$0.6000/M input |$2.50/M output

262K tokens ⓘ

Amazon: Nova Premier 1.0

Amazon Nova Premier is the most capable of Amazon’s multimodal models for complex reasoning tasks and for use as the best teacher for distilling custom models.

~에 의해 amazon |Oct 2025 |1M context |$2.50/M input |$12.50/M output

1M tokens ⓘ

Perplexity: Sonar Pro Search

Exclusively available on the OpenRouter API, Sonar Pro's new Pro Search mode is Perplexity's most advanced agentic search system. It is designed for deeper reasoning and analysis. Pricing is based...

~에 의해 perplexity |Oct 2025 |200K context |$3.00/M input |$15.00/M output

200K tokens ⓘ

미스트랄: Voxtral Small 24B 2507

Voxtral Small is an enhancement of Mistral Small 3, incorporating state-of-the-art audio input capabilities while retaining best-in-class text performance. It excels at speech transcription, translation and audio understanding. Input audio...

~에 의해 미스트랄 |Oct 2025 |32K context |$0.1000/M input |$0.3000/M output

오픈AI: gpt-oss-safeguard-20b

gpt-oss-safeguard-20b is a safety reasoning model from OpenAI built upon gpt-oss-20b. This open-weight, 21B-parameter Mixture-of-Experts (MoE) model offers lower latency for safety tasks like content classification, LLM filtering, and trust...

~에 의해 개방하다 |Oct 2025 |131K context |$0.0750/M input |$0.3000/M output

131K tokens ⓘ

엔비디아: Nemotron Nano 12B 2 VL (free)

NVIDIA Nemotron Nano 2 VL is a 12-billion-parameter open multimodal reasoning model designed for video understanding and document intelligence. It introduces a hybrid Transformer-Mamba architecture, combining transformer-level accuracy with Mamba’s...

~에 의해 엔비디아 |Oct 2025 |128K context |Miễn phí input |Miễn phí output

128K tokens ⓘ

MiniMax: MiniMax M2

MiniMax-M2 is a compact, high-efficiency large language model optimized for end-to-end coding and agentic workflows. With 10 billion activated parameters (230 billion total), it delivers near-frontier intelligence across general reasoning,...

~에 의해 미니맥스 |Oct 2025 |205K context |$0.2550/M input |$1.02/M output

205K tokens ⓘ

Qwen: Qwen3 VL 32B Instruct

Qwen3-VL-32B-Instruct is a large-scale multimodal vision-language model designed for high-precision understanding and reasoning across text, images, and video. With 32 billion parameters, it combines deep visual perception with advanced text...

~에 의해 qwen |Oct 2025 |262K context |$0.1040/M input |$0.4160/M output

262K tokens ⓘ

IBM: Granite 4.0 Micro

Granite-4.0-H-Micro is a 3B parameter from the Granite 4 family of models. These models are the latest in a series of models released by IBM. They are fine-tuned for long...

~에 의해 ibm-granite |Oct 2025 |131K context |$0.0170/M input |$0.1120/M output

131K tokens ⓘ

오픈AI: GPT-5 Image Mini

GPT-5 Image Mini combines OpenAI's advanced language capabilities, powered by [GPT-5 Mini](https://openrouter.ai/openai/gpt-5-mini), with GPT Image 1 Mini for efficient image generation. This natively multimodal model features superior instruction following, text...

~에 의해 개방하다 |Oct 2025 |400K context |$2.50/M input |$2.00/M output

400K tokens ⓘ

인류학: Claude Haiku 4.5

Claude Haiku 4.5 is Anthropic’s fastest and most efficient model, delivering near-frontier intelligence at a fraction of the cost and latency of larger Claude models. Matching Claude Sonnet 4’s performance...

~에 의해 인류의 |Oct 2025 |200K context |$1.00/M input |$5.00/M output

200K tokens ⓘ

Qwen: Qwen3 VL 8B Thinking

Qwen3-VL-8B-Thinking is the reasoning-optimized variant of the Qwen3-VL-8B multimodal model, designed for advanced visual and textual reasoning across complex scenes, documents, and temporal sequences. It integrates enhanced multimodal alignment and...

~에 의해 qwen |Oct 2025 |256K context |$0.1170/M input |$1.37/M output

256K tokens ⓘ

Qwen: Qwen3 VL 8B Instruct

Qwen3-VL-8B-Instruct is a multimodal vision-language model from the Qwen3-VL series, built for high-fidelity understanding and reasoning across text, images, and video. It features improved multimodal fusion with Interleaved-MRoPE for long-horizon...

~에 의해 qwen |Oct 2025 |256K context |$0.1170/M input |$0.4550/M output

256K tokens ⓘ

오픈AI: GPT-5 Image

[GPT-5](https://openrouter.ai/openai/gpt-5) Image combines OpenAI's GPT-5 model with state-of-the-art image generation capabilities. It offers major improvements in reasoning, code quality, and user experience while incorporating GPT Image 1's superior instruction following,...

~에 의해 개방하다 |Oct 2025 |400K context |$10.00/M input |$10.00/M output

400K tokens ⓘ

오픈AI: o3 Deep Research

o3-deep-research is OpenAI's advanced model for deep research, designed to tackle complex, multi-step research tasks. Note: This model always uses the 'web_search' tool which adds additional cost.

~에 의해 개방하다 |Oct 2025 |200K context |$10.00/M input |$40.00/M output

200K tokens ⓘ

오픈AI: o4 Mini Deep Research

o4-mini-deep-research is OpenAI's faster, more affordable deep research model—ideal for tackling complex, multi-step research tasks. Note: This model always uses the 'web_search' tool which adds additional cost.

~에 의해 개방하다 |Oct 2025 |200K context |$2.00/M input |$8.00/M output

200K tokens ⓘ

엔비디아: 야마 3.3 Nemotron Super 49B V1.5

Llama-3.3-Nemotron-Super-49B-v1.5 is a 49B-parameter, English-centric reasoning/chat model derived from Meta’s Llama-3.3-70B-Instruct with a 128K context. It’s post-trained for agentic workflows (RAG, tool calling) via SFT across math, code, science, and...

~에 의해 엔비디아 |Oct 2025 |131K context |$0.4000/M input |$0.4000/M output

131K tokens ⓘ

Google: Nano Banana (쌍둥이자리 2.5 Flash Image)

쌍둥이자리 2.5 Flash Image, a.k.a. "Nano Banana," is now generally available. It is a state of the art image generation model with contextual understanding. It is capable of image generation,...

~에 의해 google |Oct 2025 |33K context |$0.3000/M input |$2.50/M output

Qwen: Qwen3 VL 30B A3B Thinking

Qwen3-VL-30B-A3B-Thinking is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Thinking variant enhances reasoning in STEM, math, and complex tasks. It excels...

~에 의해 qwen |Oct 2025 |131K context |$0.1300/M input |$1.56/M output

131K tokens ⓘ

Qwen: Qwen3 VL 30B A3B Instruct

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...

~에 의해 qwen |Oct 2025 |262K context |$0.1300/M input |$0.5200/M output

262K tokens ⓘ

오픈AI: GPT-5 Pro

GPT-5 Pro is OpenAI’s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and...

~에 의해 개방하다 |Oct 2025 |400K context |$15.00/M input |$120.00/M output

400K tokens ⓘ

Z.ai: GLM 4.6

Compared with GLM-4.5, this generation brings several key improvements: Longer context window: The context window has been expanded from 128K to 200K tokens, enabling the model to handle more complex...

~에 의해 당신은 |9 월 2025 |203K context |$0.4300/M input |$1.74/M output

203K tokens ⓘ

인류학: Claude Sonnet 4.5

Claude Sonnet 4.5 is Anthropic’s most advanced Sonnet model to date, optimized for real-world agents and coding workflows. It delivers state-of-the-art performance on coding benchmarks such as SWE-bench Verified, with...

~에 의해 인류의 |9 월 2025 |1M context |$3.00/M input |$15.00/M output

1M tokens ⓘ

DeepSeek: DeepSeek V3.2 Exp

DeepSeek-V3.2-Exp is an experimental large language model released by DeepSeek as an intermediate step between V3.1 and future architectures. It introduces DeepSeek Sparse Attention (DSA), a fine-grained sparse attention mechanism...

~에 의해 깊은 탐색 |9 월 2025 |164K context |$0.2700/M input |$0.4100/M output

164K tokens ⓘ

TheDrummer: Cydonia 24B V4.1

Uncensored and creative writing model based on Mistral Small 3.2 24B with good recall, prompt adherence, and intelligence.

~에 의해 thedrummer |9 월 2025 |131K context |$0.3000/M input |$0.5000/M output

131K tokens ⓘ

Relace: Relace Apply 3

Relace Apply 3 is a specialized code-patching LLM that merges AI-suggested edits straight into your source files. It can apply updates from GPT-4o, 클로드, and others into your files at...

~에 의해 relace |9 월 2025 |256K context |$0.8500/M input |$1.25/M output

256K tokens ⓘ

Google: 쌍둥이자리 2.5 Flash Lite Preview 09-2025

쌍둥이자리 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 가족, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

~에 의해 google |9 월 2025 |1M context |$0.1000/M input |$0.4000/M output

1M tokens ⓘ

Qwen: Qwen3 VL 235B A22B Thinking

Qwen3-VL-235B-A22B Thinking is a multimodal model that unifies strong text generation with visual understanding across images and video. The Thinking model is optimized for multimodal reasoning in STEM and math....

~에 의해 qwen |9 월 2025 |131K context |$0.2600/M input |$2.60/M output

131K tokens ⓘ

Qwen: Qwen3 VL 235B A22B Instruct

Qwen3-VL-235B-A22B Instruct is an open-weight multimodal model that unifies strong text generation with visual understanding across images and video. The Instruct model targets general vision-language use (VQA, document parsing, chart/table...

~에 의해 qwen |9 월 2025 |262K context |$0.2000/M input |$0.8800/M output

262K tokens ⓘ

Qwen: Qwen3 Max

Qwen3-Max is an updated release built on the Qwen3 series, offering major improvements in reasoning, instruction following, multilingual support, and long-tail knowledge coverage compared to the January 2025 버전. It...

~에 의해 qwen |9 월 2025 |262K context |$0.7800/M input |$3.90/M output

262K tokens ⓘ

Qwen: Qwen3 Coder Plus

Qwen3 Coder Plus is Alibaba's proprietary version of the Open Source Qwen3 Coder 480B A35B. It is a powerful coding agent model specializing in autonomous programming via tool calling and...

~에 의해 qwen |9 월 2025 |1M context |$0.6500/M input |$3.25/M output

1M tokens ⓘ

오픈AI: GPT-5 Codex

GPT-5-Codex is a specialized version of GPT-5 optimized for software engineering and coding workflows. It is designed for both interactive development sessions and long, independent execution of complex engineering tasks....

~에 의해 개방하다 |9 월 2025 |400K context |$1.25/M input |$10.00/M output

400K tokens ⓘ

DeepSeek: DeepSeek V3.1 Terminus

DeepSeek-V3.1 Terminus is an update to [DeepSeek V3.1](/deepseek/deepseek-chat-v3.1) that maintains the model's original capabilities while addressing issues reported by users, including language consistency and agent capabilities, further optimizing the model's...

~에 의해 깊은 탐색 |9 월 2025 |164K context |$0.2700/M input |$0.9500/M output

164K tokens ⓘ

Qwen: Qwen3 Coder Flash

Qwen3 Coder Flash is Alibaba's fast and cost efficient version of their proprietary Qwen3 Coder Plus. It is a powerful coding agent model specializing in autonomous programming via tool calling...

~에 의해 qwen |9 월 2025 |1M context |$0.1950/M input |$0.9750/M output

1M tokens ⓘ

Qwen: Qwen3 Next 80B A3B Thinking

Qwen3-Next-80B-A3B-Thinking is a reasoning-first chat model in the Qwen3-Next line that outputs structured “thinking” traces by default. It’s designed for hard multi-step problems; math proofs, code synthesis/debugging, logic, and agentic...

~에 의해 qwen |9 월 2025 |262K context |$0.0975/M input |$0.7800/M output

262K tokens ⓘ

Qwen: Qwen3 Next 80B A3B Instruct (free)

Qwen3-Next-80B-A3B-Instruct is an instruction-tuned chat model in the Qwen3-Next series optimized for fast, stable responses without “thinking” traces. It targets complex tasks across reasoning, code generation, knowledge QA, and multilingual...

~에 의해 qwen |9 월 2025 |262K context |Miễn phí input |Miễn phí output

262K tokens ⓘ

Qwen: Qwen3 Next 80B A3B Instruct

~에 의해 qwen |9 월 2025 |262K context |$0.0900/M input |$1.10/M output

262K tokens ⓘ

Qwen: Qwen Plus 0728 (thinking)

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning model with a balanced performance, speed, and cost combination.

~에 의해 qwen |9 월 2025 |1M context |$0.2600/M input |$0.7800/M output

1M tokens ⓘ

Qwen: Qwen Plus 0728

Qwen Plus 0728, based on the Qwen3 foundation model, is a 1 million context hybrid reasoning model with a balanced performance, speed, and cost combination.

~에 의해 qwen |9 월 2025 |1M context |$0.2600/M input |$0.7800/M output

1M tokens ⓘ

엔비디아: Nemotron Nano 9B V2 (free)

NVIDIA-Nemotron-Nano-9B-v2 is a large language model (LLM) trained from scratch by NVIDIA, and designed as a unified model for both reasoning and non-reasoning tasks. It responds to user queries and...

~에 의해 엔비디아 |9 월 2025 |128K context |Miễn phí input |Miễn phí output

128K tokens ⓘ

MoonshotAI: Kimi K2 0905

Kimi K2 0905 is the September update of [Kimi K2 0711](moonshotai/kimi-k2). It is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32...

~에 의해 moonshotai |9 월 2025 |262K context |$0.6000/M input |$2.50/M output

262K tokens ⓘ

Qwen: Qwen3 30B A3B Thinking 2507

Qwen3-30B-A3B-Thinking-2507 is a 30B parameter Mixture-of-Experts reasoning model optimized for complex tasks requiring extended multi-step thinking. The model is designed specifically for “thinking mode,” where internal reasoning traces are separated...

~에 의해 qwen |8월 2025 |131K context |$0.1300/M input |$1.56/M output

131K tokens ⓘ

Nous: Hermes 4 70비

Hermes 4 70B is a hybrid reasoning model from Nous Research, built on Meta-Llama-3.1-70B. It introduces the same hybrid mode as the larger 405B release, allowing the model to either...

~에 의해 nousresearch |8월 2025 |131K context |$0.1300/M input |$0.4000/M output

131K tokens ⓘ

Nous: Hermes 4 405비

Hermes 4 is a large-scale reasoning model built on Meta-Llama-3.1-405B and released by Nous Research. It introduces a hybrid reasoning mode, where the model can choose to deliberate internally with...

~에 의해 nousresearch |8월 2025 |131K context |$1.00/M input |$3.00/M output

131K tokens ⓘ

DeepSeek: DeepSeek V3.1

DeepSeek-V3.1 is a large hybrid reasoning model (671B parameters, 37B active) that supports both thinking and non-thinking modes via prompt templates. It extends the DeepSeek-V3 base with a two-phase long-context...

~에 의해 깊은 탐색 |8월 2025 |164K context |$0.2100/M input |$0.7900/M output

164K tokens ⓘ

미스트랄: Mistral Medium 3.1

Mistral Medium 3.1 is an updated version of Mistral Medium 3, which is a high-performance enterprise-grade language model designed to deliver frontier-level capabilities at significantly reduced operational cost. It balances...

~에 의해 미스트랄 |8월 2025 |131K context |$0.4000/M input |$2.00/M output

131K tokens ⓘ

Z.ai: GLM 4.5V

GLM-4.5V is a vision-language foundation model for multimodal agent applications. Built on a Mixture-of-Experts (MoE) architecture with 106B parameters and 12B activated parameters, it achieves state-of-the-art results in video understanding,...

~에 의해 당신은 |8월 2025 |66K context |$0.6000/M input |$1.80/M output

66K tokens ⓘ

AI Models

계정

🔑 Lấy lại mật khẩu