AI Models

357 モデル Free & Paid Cập nhật: 38 minutes trước

GPT-5-Nano is the smallest and fastest variant in the GPT-5 system, optimized for developer tools, rapid interactions, and ultra-low latency environments. While limited in reasoning depth compared to its larger...

による |8月 2025 |400K コンテキスト |$0.0500/M入力 |$0.4000/M出力
400K トークン

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (教育省) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...

による |8月 2025 |131K コンテキスト |$0.0390/M入力 |$0.1800/M出力
131K トークン

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (教育省) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...

による |8月 2025 |131K コンテキスト |Miễn phí input |Miễn phí output
131K トークン

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 ライセンス. It uses a Mixture-of-Experts (教育省) architecture with 3.6B active parameters per forward pass, optimized for...

による |8月 2025 |131K コンテキスト |Miễn phí input |Miễn phí output
131K トークン

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 ライセンス. It uses a Mixture-of-Experts (教育省) architecture with 3.6B active parameters per forward pass, optimized for...

による |8月 2025 |131K コンテキスト |$0.0300/M入力 |$0.1400/M出力
131K トークン

Claude Opus 4.1 is an updated version of Anthropic’s flagship model, offering improved performance in coding, 推論, and agentic tasks. It achieves 74.5% on SWE-bench Verified and shows notable gains...

による |8月 2025 |200K コンテキスト |$15.00/M入力 |$75.00/M出力
200K トークン

Mistral's cutting-edge language model for coding released end of July 2025. Codestral specializes in low-latency, high-frequency tasks such as fill-in-the-middle (FIM), code correction and test generation. [Blog Post](https://mistral.ai/news/codestral-25-08)

による |8月 2025 |256K コンテキスト |$0.3000/M入力 |$0.9000/M出力
256K トークン

Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (教育省) model with 128 experts (8 active per forward pass), designed for advanced code generation, repository-scale understanding, and agentic tool use. Built on the...

による |7月 2025 |160K コンテキスト |$0.0700/M入力 |$0.2700/M出力
160K トークン

Qwen3-30B-A3B-Instruct-2507 is a 30.5B-parameter mixture-of-experts language model from Qwen, with 3.3B active parameters per inference. It operates in non-thinking mode and is designed for high-quality instruction following, multilingual understanding, and...

による |7月 2025 |262K コンテキスト |$0.0900/M入力 |$0.3000/M出力
262K トークン

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (教育省) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly...

による |7月 2025 |131K コンテキスト |$0.6000/M入力 |$2.20/M出力
131K トークン

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (教育省) architecture but with a more compact parameter...

による |7月 2025 |131K コンテキスト |Miễn phí input |Miễn phí output
131K トークン

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (教育省) architecture but with a more compact parameter...

による |7月 2025 |131K コンテキスト |$0.1300/M入力 |$0.8500/M出力
131K トークン

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (教育省) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...

による |7月 2025 |262K コンテキスト |$0.1495/M入力 |$1.50/M出力
262K トークン

GLM 4 32B is a cost-effective foundation language model. It can efficiently perform complex tasks and has significantly enhanced capabilities in tool use, online search, and code-related intelligent tasks. It...

による |7月 2025 |128K コンテキスト |$0.1000/M入力 |$0.1000/M出力
128K トークン

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (教育省) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, ツールの使用, and long-context reasoning over...

による |7月 2025 |1M コンテキスト |Miễn phí input |Miễn phí output
1M トークン

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (教育省) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, ツールの使用, and long-context reasoning over...

による |7月 2025 |1M コンテキスト |$0.2200/M入力 |$1.80/M出力
1M トークン

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...

による |7月 2025 |128K コンテキスト |$0.1000/M入力 |$0.2000/M出力
128K トークン

ジェミニ 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 家族, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

による |7月 2025 |1M コンテキスト |$0.1000/M入力 |$0.4000/M出力
1M トークン

Qwen3-235B-A22B-Instruct-2507 is a multilingual, instruction-tuned mixture-of-experts language model based on the Qwen3-235B architecture, with 22B active parameters per forward pass. It is optimized for general-purpose text generation, including instruction following,...

による |7月 2025 |262K コンテキスト |$0.0710/M入力 |$0.1000/M出力
262K トークン

Switchpoint AI's router instantly analyzes your request and directs it to the optimal AI from an ever-evolving library. As the world of LLMs advances, our router gets smarter, ensuring you...

による |7月 2025 |131K コンテキスト |$0.8500/M入力 |$3.40/M出力
131K トークン

Kimi K2 Instruct is a large-scale Mixture-of-Experts (教育省) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32 billion active per forward pass. It is optimized for...

による |7月 2025 |131K コンテキスト |$0.5700/M入力 |$2.30/M出力
131K トークン

Devstral Medium is a high-performance code generation and agentic reasoning model developed jointly by Mistral AI and All Hands AI. Positioned as a step up from Devstral Small, it achieves...

による |7月 2025 |131K コンテキスト |$0.4000/M入力 |$2.00/M出力
131K トークン

Devstral Small 1.1 is a 24B parameter open-weight language model for software engineering agents, developed by Mistral AI in collaboration with All Hands AI. Finetuned from Mistral Small 3.1 and...

による |7月 2025 |131K コンテキスト |$0.1000/M入力 |$0.3000/M出力
131K トークン

Venice Uncensored Dolphin Mistral 24B Venice Edition is a fine-tuned variant of Mistral-Small-24B-Instruct-2501, developed by dphn.ai in collaboration with Venice.ai. This model is designed as an “uncensored” instruct-tuned LLM, preserving...

による |7月 2025 |33K コンテキスト |Miễn phí input |Miễn phí output

Hunyuan-A13B is a 13B active parameter Mixture-of-Experts (教育省) language model developed by Tencent, with a total parameter count of 80B and support for reasoning via Chain-of-Thought. It offers competitive benchmark...

による |7月 2025 |131K コンテキスト |$0.1400/M入力 |$0.5700/M出力
131K トークン

Morph's high-accuracy apply model for complex code edits. ~4,500 tokens/sec with 98% accuracy for precise code transformations. The model requires the prompt to be in the following format: {instruction} {initial_code}...

による |7月 2025 |262K コンテキスト |$0.9000/M入力 |$1.90/M出力
262K トークン

Morph's fastest apply model for code edits. ~10,500 tokens/sec with 96% accuracy for rapid code transformations. The model requires the prompt to be in the following format: {instruction} {initial_code} {edit_snippet}...

による |7月 2025 |82K コンテキスト |$0.8000/M入力 |$1.20/M出力
82K トークン

ERNIE-4.5-VL-424B-A47B is a multimodal Mixture-of-Experts (教育省) model from Baidu’s ERNIE 4.5 series, featuring 424B total parameters with 47B active per token. It is trained jointly on text and image data...

による |Jun 2025 |131K コンテキスト |$0.4200/M入力 |$1.25/M出力
131K トークン

ERNIE-4.5-300B-A47B is a 300B parameter Mixture-of-Experts (教育省) language model developed by Baidu as part of the ERNIE 4.5 series. It activates 47B parameters per token and supports text generation in...

による |Jun 2025 |131K コンテキスト |$0.2800/M入力 |$1.10/M出力
131K トークン

Mistral-Small-3.2-24B-Instruct-2506 is an updated 24B parameter model from Mistral optimized for instruction following, repetition reduction, and improved function calling. Compared to the 3.1 release, バージョン 3.2 significantly improves accuracy on...

による |Jun 2025 |128K コンテキスト |$0.0750/M入力 |$0.2000/M出力
128K トークン

MiniMax-M1 is a large-scale, open-weight reasoning model designed for extended context and high-efficiency inference. It leverages a hybrid Mixture-of-Experts (教育省) architecture paired with a custom "lightning attention" mechanism, allowing it...

による |Jun 2025 |1M コンテキスト |$0.4000/M入力 |$2.20/M出力
1M トークン

ジェミニ 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, コーディング, mathematics, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

による |Jun 2025 |1M コンテキスト |$0.3000/M入力 |$2.50/M出力
1M トークン

ジェミニ 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, コーディング, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

による |Jun 2025 |1M コンテキスト |$1.25/M入力 |$10.00/M出力
1M トークン

The o-series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o3-pro model uses more compute to think harder and provide consistently...

による |Jun 2025 |200K コンテキスト |$20.00/M入力 |$80.00/M出力
200K トークン

ジェミニ 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, コーディング, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

による |Jun 2025 |1M コンテキスト |$1.25/M入力 |$10.00/M出力
1M トークン

May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active...

による |5月 2025 |164K コンテキスト |$0.5000/M入力 |$2.15/M出力
164K トークン

Claude Opus 4 is benchmarked as the world’s best coding model, at time of release, bringing sustained performance on complex, long-running tasks and agent workflows. It sets new benchmarks in...

による |5月 2025 |200K コンテキスト |$15.00/M入力 |$75.00/M出力
200K トークン

Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, excelling in both coding and reasoning tasks with improved precision and controllability. Achieving state-of-the-art performance on SWE-bench (72.7%),...

による |5月 2025 |1M コンテキスト |$3.00/M入力 |$15.00/M出力
1M トークン

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputs—including text, visual data, and audio—enabling diverse tasks...

による |5月 2025 |33K コンテキスト |$0.0600/M入力 |$0.1200/M出力

Mistral Medium 3 is a high-performance enterprise-grade language model designed to deliver frontier-level capabilities at significantly reduced operational cost. It balances state-of-the-art reasoning and multimodal performance with 8× lower cost...

による |5月 2025 |131K コンテキスト |$0.4000/M入力 |$2.00/M出力
131K トークン

ジェミニ 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, コーディング, mathematics, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

による |5月 2025 |1M コンテキスト |$1.25/M入力 |$10.00/M出力
1M トークン

Spotlight is a 7‑billion‑parameter vision‑language model derived from Qwen 2.5‑VL and fine‑tuned by Arcee AI for tight image‑text grounding tasks. It offers a 32 k‑token context window, enabling rich multimodal...

による |5月 2025 |131K コンテキスト |$0.1800/M入力 |$0.1800/M出力
131K トークン

Maestro Reasoning is Arcee's flagship analysis model: ある 32 B‑parameter derivative of Qwen 2.5‑32 B tuned with DPO and chain‑of‑thought RL for step‑by‑step logic. Compared to the earlier 7 B...

による |5月 2025 |131K コンテキスト |$0.9000/M入力 |$3.30/M出力
131K トークン

Virtuoso‑Large is Arcee's top‑tier general‑purpose LLM at 72 Bパラメータ, tuned to tackle cross‑domain reasoning, creative writing and enterprise QA. Unlike many 70 B peers, it retains the 128 k...

による |5月 2025 |131K コンテキスト |$0.7500/M入力 |$1.20/M出力
131K トークン

Coder‑Large is a 32 B‑parameter offspring of Qwen 2.5‑Instruct that has been further trained on permissively‑licensed GitHub, CodeSearchNet and synthetic bug‑fix corpora. It supports a 32k context window, enabling multi‑file...

による |5月 2025 |33K コンテキスト |$0.5000/M入力 |$0.8000/M出力

Llama Guard 4 is a Llama 4 Scout-derived multimodal pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM...

による |4月 2025 |164K コンテキスト |$0.1800/M入力 |$0.1800/M出力
164K トークン

Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (教育省) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique...

による |4月 2025 |131K コンテキスト |$0.0900/M入力 |$0.4500/M出力
131K トークン

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math,...

による |4月 2025 |131K コンテキスト |$0.0500/M入力 |$0.4000/M出力
131K トークン

Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...

による |4月 2025 |132K コンテキスト |$0.1000/M入力 |$0.2400/M出力
132K トークン

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...

による |4月 2025 |131K コンテキスト |$0.0800/M入力 |$0.2800/M出力
131K トークン