AI Models

340 型号 自由的 & Paid Cập nhật: 7 hours trước

AI21: Jamba Large 1.7

Jamba Large 1.7 is the latest model in the Jamba open family, offering improvements in grounding, 遵循指令, and overall efficiency. Built on a hybrid SSM-Transformer architecture with a 256K context...

by ai21 |Aug 2025 |256K 上下文 |$2.00/米输入 |$8.00/米输出

256K 代币 ⓘ

开放人工智能: GPT-5 Chat

GPT-5 Chat is designed for advanced, natural, multimodal, and context-aware conversations for enterprise applications.

by 开放性 |Aug 2025 |128K 上下文 |$1.25/米输入 |$10.00/米输出

128K 代币 ⓘ

开放人工智能: GPT-5

GPT-5 is OpenAI’s most advanced model, offering major improvements in reasoning, code quality, and user experience. It is optimized for complex tasks that require step-by-step reasoning, instruction following, and accuracy...

by 开放性 |Aug 2025 |400K 上下文 |$1.25/米输入 |$10.00/米输出

400K 代币 ⓘ

开放人工智能: GPT-5 Mini

GPT-5 Mini is a compact version of GPT-5, designed to handle lighter-weight reasoning tasks. It provides the same instruction-following and safety-tuning benefits as GPT-5, but with reduced latency and cost....

by 开放性 |Aug 2025 |400K 上下文 |$0.2500/米输入 |$2.00/米输出

400K 代币 ⓘ

开放人工智能: GPT-5 Nano

GPT-5-Nano is the smallest and fastest variant in the GPT-5 system, optimized for developer tools, rapid interactions, and ultra-low latency environments. While limited in reasoning depth compared to its larger...

by 开放性 |Aug 2025 |400K 上下文 |$0.0500/米输入 |$0.4000/米输出

400K 代币 ⓘ

开放人工智能: gpt-oss-120b (自由的)

gpt-oss-120b is an open-weight, 117B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized...

by 开放性 |Aug 2025 |131K 上下文 |自由输入 |自由输出

131K 代币 ⓘ

开放人工智能: gpt-oss-120b

by 开放性 |Aug 2025 |131K 上下文 |$0.0300/米输入 |$0.1500/米输出

131K 代币 ⓘ

开放人工智能: gpt-oss-20b

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) architecture with 3.6B active parameters per forward pass, optimized for...

by 开放性 |Aug 2025 |131K 上下文 |$0.0290/米输入 |$0.1400/米输出

131K 代币 ⓘ

开放人工智能: gpt-oss-20b (自由的)

by 开放性 |Aug 2025 |131K 上下文 |自由输入 |自由输出

131K 代币 ⓘ

人择: Claude Opus 4.1

Claude Opus 4.1 is an updated version of Anthropic’s flagship model, offering improved performance in coding, 推理, and agentic tasks. It achieves 74.5% on SWE-bench Verified and shows notable gains...

by 人择的 |Aug 2025 |200K 上下文 |$15.00/米输入 |$75.00/米输出

200K 代币 ⓘ

米斯特拉尔: Codestral 2508

Mistral's cutting-edge language model for coding released end of July 2025. Codestral specializes in low-latency, high-frequency tasks such as fill-in-the-middle (FIM), code correction and test generation. [Blog Post](https://mistral.ai/news/codestral-25-08)

by 米斯特拉尔斯 |Aug 2025 |256K 上下文 |$0.3000/米输入 |$0.9000/米输出

256K 代币 ⓘ

Qwen: Qwen3 Coder 30B A3B Instruct

Qwen3-Coder-30B-A3B-Instruct is a 30.5B parameter Mixture-of-Experts (MoE) model with 128 experts (8 active per forward pass), designed for advanced code generation, repository-scale understanding, and agentic tool use. Built on the...

by qwen |Jul 2025 |160K 上下文 |$0.0700/米输入 |$0.2700/米输出

160K 代币 ⓘ

Qwen: Qwen3 30B A3B Instruct 2507

Qwen3-30B-A3B-Instruct-2507 is a 30.5B-parameter mixture-of-experts language model from Qwen, with 3.3B active parameters per inference. It operates in non-thinking mode and is designed for high-quality instruction following, multilingual understanding, and...

by qwen |Jul 2025 |131K 上下文 |$0.0482/米输入 |$0.1931/米输出

131K 代币 ⓘ

Z.ai: GLM 4.5

GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly...

by z-ai |Jul 2025 |131K 上下文 |$0.6000/米输入 |$2.20/米输出

131K 代币 ⓘ

Z.ai: GLM 4.5 Air

GLM-4.5-Air is the lightweight variant of our latest flagship model family, also purpose-built for agent-centric applications. Like GLM-4.5, it adopts the Mixture-of-Experts (MoE) architecture but with a more compact parameter...

by z-ai |Jul 2025 |131K 上下文 |$0.1300/米输入 |$0.8500/米输出

131K 代币 ⓘ

Qwen: Qwen3 235B A22B Thinking 2507

Qwen3-235B-A22B-Thinking-2507 is a high-performance, open-weight Mixture-of-Experts (MoE) language model optimized for complex reasoning tasks. It activates 22B of its 235B parameters per forward pass and natively supports up to 262,144...

by qwen |Jul 2025 |262K 上下文 |$0.1495/米输入 |$1.50/米输出

262K 代币 ⓘ

Qwen: Qwen3 Coder 480B A35B (自由的)

Qwen3-Coder-480B-A35B-Instruct is a Mixture-of-Experts (MoE) code generation model developed by the Qwen team. It is optimized for agentic coding tasks such as function calling, tool use, and long-context reasoning over...

by qwen |Jul 2025 |1M 上下文 |自由输入 |自由输出

1M代币 ⓘ

Qwen: Qwen3 Coder 480B A35B

by qwen |Jul 2025 |1M 上下文 |$0.2200/米输入 |$1.80/米输出

1M代币 ⓘ

ByteDance: UI-TARS 7B

UI-TARS-1.5 is a multimodal vision-language agent optimized for GUI-based environments, including desktop interfaces, web browsers, mobile systems, and games. Built by ByteDance, it builds upon the UI-TARS framework with reinforcement...

by bytedance |Jul 2025 |128K 上下文 |$0.1000/米输入 |$0.2000/米输出

128K 代币 ⓘ

谷歌: Gemini 2.5 Flash Lite

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance...

by 谷歌 |Jul 2025 |1M 上下文 |$0.1000/米输入 |$0.4000/米输出

1M代币 ⓘ

Qwen: Qwen3 235B A22B Instruct 2507

Qwen3-235B-A22B-Instruct-2507 is a multilingual, instruction-tuned mixture-of-experts language model based on the Qwen3-235B architecture, with 22B active parameters per forward pass. It is optimized for general-purpose text generation, including instruction following,...

by qwen |Jul 2025 |262K 上下文 |$0.0900/米输入 |$0.1000/米输出

262K 代币 ⓘ

Switchpoint Router

Switchpoint AI's router instantly analyzes your request and directs it to the optimal AI from an ever-evolving library. As the world of LLMs advances, our router gets smarter, ensuring you...

by switchpoint |Jul 2025 |131K 上下文 |$0.8500/米输入 |$3.40/米输出

131K 代币 ⓘ

MoonshotAI: Kimi K2 0711

Kimi K2 Instruct is a large-scale Mixture-of-Experts (MoE) language model developed by Moonshot AI, featuring 1 trillion total parameters with 32 billion active per forward pass. It is optimized for...

by moonshotai |Jul 2025 |131K 上下文 |$0.5700/米输入 |$2.30/米输出

131K 代币 ⓘ

Venice: Uncensored (自由的)

Venice Uncensored Dolphin Mistral 24B Venice Edition is a fine-tuned variant of Mistral-Small-24B-Instruct-2501, developed by dphn.ai in collaboration with Venice.ai. This model is designed as an “uncensored” instruct-tuned LLM, preserving...

by cognitivecomputations |Jul 2025 |33K 上下文 |自由输入 |自由输出

Tencent: Hunyuan A13B Instruct

Hunyuan-A13B is a 13B active parameter Mixture-of-Experts (MoE) language model developed by Tencent, with a total parameter count of 80B and support for reasoning via Chain-of-Thought. It offers competitive benchmark...

by tencent |Jul 2025 |131K 上下文 |$0.1400/米输入 |$0.5700/米输出

131K 代币 ⓘ

Morph: Morph V3 Large

Morph's high-accuracy apply model for complex code edits. ~4,500 tokens/sec with 98% accuracy for precise code transformations. The model requires the prompt to be in the following format: {instruction} {initial_code}...

by morph |Jul 2025 |262K 上下文 |$0.9000/米输入 |$1.90/米输出

262K 代币 ⓘ

Morph: Morph V3 Fast

Morph's fastest apply model for code edits. ~10,500 tokens/sec with 96% accuracy for rapid code transformations. The model requires the prompt to be in the following format: {instruction} {initial_code} {edit_snippet}...

by morph |Jul 2025 |82K 上下文 |$0.8000/米输入 |$1.20/米输出

82K 代币 ⓘ

Baidu: ERNIE 4.5 VL 424B A47B

ERNIE-4.5-VL-424B-A47B is a multimodal Mixture-of-Experts (MoE) model from Baidu’s ERNIE 4.5 series, featuring 424B total parameters with 47B active per token. It is trained jointly on text and image data...

by baidu |六月 2025 |131K 上下文 |$0.4200/米输入 |$1.25/米输出

131K 代币 ⓘ

米斯特拉尔: Mistral Small 3.2 24乙

Mistral-Small-3.2-24B-Instruct-2506 is an updated 24B parameter model from Mistral optimized for instruction following, repetition reduction, and improved function calling. Compared to the 3.1 release, 版本 3.2 significantly improves accuracy on...

by 米斯特拉尔斯 |六月 2025 |128K 上下文 |$0.0750/米输入 |$0.2000/米输出

128K 代币 ⓘ

MiniMax: MiniMax M1

MiniMax-M1 is a large-scale, open-weight reasoning model designed for extended context and high-efficiency inference. It leverages a hybrid Mixture-of-Experts (MoE) architecture paired with a custom "lightning attention" mechanism, allowing it...

by minimax |六月 2025 |1M 上下文 |$0.4000/米输入 |$2.20/米输出

1M代币 ⓘ

谷歌: Gemini 2.5 Flash

Gemini 2.5 Flash is Google's state-of-the-art workhorse model, specifically designed for advanced reasoning, 编码, 数学, and scientific tasks. It includes built-in "thinking" capabilities, enabling it to provide responses with greater...

by 谷歌 |六月 2025 |1M 上下文 |$0.3000/米输入 |$2.50/米输出

1M代币 ⓘ

谷歌: Gemini 2.5 Pro

Gemini 2.5 Pro is Google’s state-of-the-art AI model designed for advanced reasoning, 编码, 数学, and scientific tasks. It employs “thinking” capabilities, enabling it to reason through responses with enhanced accuracy...

by 谷歌 |六月 2025 |1M 上下文 |$1.25/米输入 |$10.00/米输出

1M代币 ⓘ

开放人工智能: o3 Pro

The o-series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o3-pro model uses more compute to think harder and provide consistently...

by 开放性 |六月 2025 |200K 上下文 |$20.00/米输入 |$80.00/米输出

200K 代币 ⓘ

谷歌: Gemini 2.5 Pro Preview 06-05

by 谷歌 |六月 2025 |1M 上下文 |$1.25/米输入 |$10.00/米输出

1M代币 ⓘ

深度搜索: R1 0528

May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active...

by 深度搜索 |可能 2025 |164K 上下文 |$0.5000/米输入 |$2.15/米输出

164K 代币 ⓘ

人择: Claude Opus 4

Claude Opus 4 is benchmarked as the world’s best coding model, at time of release, bringing sustained performance on complex, long-running tasks and agent workflows. It sets new benchmarks in...

by 人择的 |可能 2025 |200K 上下文 |$15.00/米输入 |$75.00/米输出

200K 代币 ⓘ

人择: Claude Sonnet 4

Claude Sonnet 4 significantly enhances the capabilities of its predecessor, Sonnet 3.7, excelling in both coding and reasoning tasks with improved precision and controllability. Achieving state-of-the-art performance on SWE-bench (72.7%),...

by 人择的 |可能 2025 |1M 上下文 |$3.00/米输入 |$15.00/米输出

1M代币 ⓘ

谷歌: Gemma 3n 4B

Gemma 3n E4B-it is optimized for efficient execution on mobile and low-resource devices, such as phones, laptops, and tablets. It supports multimodal inputs—including text, visual data, and audio—enabling diverse tasks...

by 谷歌 |可能 2025 |33K 上下文 |$0.0600/米输入 |$0.1200/米输出

米斯特拉尔: Mistral Medium 3

Mistral Medium 3 is a high-performance enterprise-grade language model designed to deliver frontier-level capabilities at significantly reduced operational cost. It balances state-of-the-art reasoning and multimodal performance with 8× lower cost...

by 米斯特拉尔斯 |可能 2025 |131K 上下文 |$0.4000/米输入 |$2.00/米输出

131K 代币 ⓘ

谷歌: Gemini 2.5 Pro Preview 05-06

by 谷歌 |可能 2025 |1M 上下文 |$1.25/米输入 |$10.00/米输出

1M代币 ⓘ

Arcee AI: Virtuoso Large

Virtuoso‑Large is Arcee's top‑tier general‑purpose LLM at 72 B parameters, tuned to tackle cross‑domain reasoning, creative writing and enterprise QA. Unlike many 70 B peers, it retains the 128 k...

by arcee-ai |可能 2025 |131K 上下文 |$0.7500/米输入 |$1.20/米输出

131K 代币 ⓘ

Arcee AI: Coder Large

Coder‑Large is a 32 B‑parameter offspring of Qwen 2.5‑Instruct that has been further trained on permissively‑licensed GitHub, CodeSearchNet and synthetic bug‑fix corpora. It supports a 32k context window, enabling multi‑file...

by arcee-ai |可能 2025 |33K 上下文 |$0.5000/米输入 |$0.8000/米输出

Meta: Llama Guard 4 12乙

Llama Guard 4 is a Llama 4 Scout-derived multimodal pretrained model, fine-tuned for content safety classification. Similar to previous versions, it can be used to classify content in both LLM...

by meta-llama |4月 2025 |164K 上下文 |$0.1800/米输入 |$0.1800/米输出

164K 代币 ⓘ

Qwen: Qwen3 30B A3B

Qwen3, the latest generation in the Qwen large language model series, features both dense and mixture-of-experts (MoE) architectures to excel in reasoning, multilingual support, and advanced agent tasks. Its unique...

by qwen |4月 2025 |131K 上下文 |$0.1200/米输入 |$0.5000/米输出

131K 代币 ⓘ

Qwen: Qwen3 8B

Qwen3-8B is a dense 8.2B parameter causal language model from the Qwen3 series, designed for both reasoning-heavy tasks and efficient dialogue. It supports seamless switching between "thinking" mode for math,...

by qwen |4月 2025 |131K 上下文 |$0.1170/米输入 |$0.4550/米输出

131K 代币 ⓘ

Qwen: Qwen3 14B

Qwen3-14B is a dense 14.8B parameter causal language model from the Qwen3 series, designed for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...

by qwen |4月 2025 |132K 上下文 |$0.1000/米输入 |$0.2400/米输出

132K 代币 ⓘ

Qwen: Qwen3 32B

Qwen3-32B is a dense 32.8B parameter causal language model from the Qwen3 series, optimized for both complex reasoning and efficient dialogue. It supports seamless switching between a "thinking" mode for...

by qwen |4月 2025 |131K 上下文 |$0.0800/米输入 |$0.2800/米输出

131K 代币 ⓘ

Qwen: Qwen3 235B A22B

Qwen3-235B-A22B is a 235B parameter mixture-of-experts (MoE) model developed by Qwen, activating 22B parameters per forward pass. It supports seamless switching between a "thinking" mode for complex reasoning, math, and...

by qwen |4月 2025 |131K 上下文 |$0.4550/米输入 |$1.82/米输出

131K 代币 ⓘ

开放人工智能: o4 Mini High

OpenAI o4-mini-high is the same model as [o4-mini](/openai/o4-mini) with reasoning_effort set to high. OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining...

by 开放性 |4月 2025 |200K 上下文 |$1.10/米输入 |$4.40/米输出

200K 代币 ⓘ

开放人工智能: o3

o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, 编码, and visual reasoning tasks. It also excels at technical writing and instruction-following....

by 开放性 |4月 2025 |200K 上下文 |$2.00/米输入 |$8.00/米输出

200K 代币 ⓘ

AI Models

帐户

🔑 Lấy lại mật khẩu