AI Models

340 models 무료 & Paid Cập nhật: 10 hours trước

오픈AI: o4 Mini

OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining strong multimodal and agentic capabilities. It supports tool use and demonstrates competitive reasoning...

~에 의해 개방하다 |4월 2025 |200K context |$1.10/M input |$4.40/M output

200K tokens ⓘ

오픈AI: GPT-4.1

GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and...

~에 의해 개방하다 |4월 2025 |1M context |$2.00/M input |$8.00/M output

1M tokens ⓘ

오픈AI: GPT-4.1 Mini

GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantially lower latency and cost. It retains a 1 million token context window and scores 45.1% on hard...

~에 의해 개방하다 |4월 2025 |1M context |$0.4000/M input |$1.60/M output

1M tokens ⓘ

오픈AI: GPT-4.1 Nano

For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million...

~에 의해 개방하다 |4월 2025 |1M context |$0.1000/M input |$0.4000/M output

1M tokens ⓘ

Meta: 야마 4 Maverick

야마 4 Maverick 17B Instruct (128이자형) is a high-capacity multimodal language model from Meta, built on a mixture-of-experts (MoE) architecture with 128 experts and 17 billion active parameters per forward...

~에 의해 meta-llama |4월 2025 |1M context |$0.1500/M input |$0.6000/M output

1M tokens ⓘ

Meta: 야마 4 Scout

야마 4 Scout 17B Instruct (16이자형) is a mixture-of-experts (MoE) language model developed by Meta, activating 17 billion parameters out of a total of 109B. It supports native multimodal input...

~에 의해 meta-llama |4월 2025 |10M context |$0.1000/M input |$0.3000/M output

10M tokens ⓘ

DeepSeek: DeepSeek V3 0324

DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team. It succeeds the [DeepSeek V3](/deepseek/deepseek-chat-v3) model and performs really well...

~에 의해 깊은 탐색 |3월 2025 |164K context |$0.2400/M input |$0.9000/M output

164K tokens ⓘ

오픈AI: o1-pro

The o1 series of models are trained with reinforcement learning to think before they answer and perform complex reasoning. The o1-pro model uses more compute to think harder and provide...

~에 의해 개방하다 |3월 2025 |200K context |$150.00/M input |$600.00/M output

200K tokens ⓘ

미스트랄: 미스트랄 스몰 3.1 24비

미스트랄 스몰 3.1 24B Instruct is an upgraded variant of Mistral Small 3 (2501), featuring 24 billion parameters with advanced multimodal capabilities. It provides state-of-the-art performance in text-based reasoning and...

~에 의해 미스트랄 |3월 2025 |128K context |$0.3510/M input |$0.5550/M output

128K tokens ⓘ

Google: Gemma 3 4비

Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities,...

~에 의해 google |3월 2025 |131K context |$0.0500/M input |$0.1000/M output

131K tokens ⓘ

Google: Gemma 3 12비

~에 의해 google |3월 2025 |131K context |$0.0500/M input |$0.1500/M output

131K tokens ⓘ

Cohere: Command A

Command A is an open-weights 111B parameter model with a 256k context window focused on delivering great performance across agentic, multilingual, and coding use cases. Compared to other leading proprietary...

~에 의해 cohere |3월 2025 |256K context |$2.50/M input |$10.00/M output

256K tokens ⓘ

오픈AI: GPT-4o-mini Search Preview

GPT-4o mini Search Preview is a specialized model for web search in Chat Completions. It is trained to understand and execute web search queries.

~에 의해 개방하다 |3월 2025 |128K context |$0.1500/M input |$0.6000/M output

128K tokens ⓘ

오픈AI: GPT-4o Search Preview

GPT-4o Search Previewis a specialized model for web search in Chat Completions. It is trained to understand and execute web search queries.

~에 의해 개방하다 |3월 2025 |128K context |$2.50/M input |$10.00/M output

128K tokens ⓘ

Reka Flash 3

Reka Flash 3 is a general-purpose, instruction-tuned large language model with 21 billion parameters, developed by Reka. It excels at general chat, coding tasks, instruction-following, and function calling. Featuring a...

~에 의해 rekaai |3월 2025 |66K context |$0.1000/M input |$0.2000/M output

66K tokens ⓘ

Google: Gemma 3 27비

~에 의해 google |3월 2025 |131K context |$0.0800/M input |$0.1600/M output

131K tokens ⓘ

TheDrummer: Skyfall 36B V2

Skyfall 36B v2 is an enhanced iteration of Mistral Small 2501, specifically fine-tuned for improved creativity, nuanced writing, role-playing, and coherent storytelling.

~에 의해 thedrummer |3월 2025 |33K context |$0.5500/M input |$0.8000/M output

Perplexity: Sonar Reasoning Pro

Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://docs.perplexity.ai/guides/pricing#detailed-pricing-breakdown-for-sonar-reasoning-pro-and-sonar-pro) Sonar Reasoning Pro is a premier reasoning model powered by DeepSeek R1 with Chain of Thought (CoT). Designed for...

~에 의해 perplexity |3월 2025 |128K context |$2.00/M input |$8.00/M output

128K tokens ⓘ

Perplexity: Sonar Pro

Note: Sonar Pro pricing includes Perplexity search pricing. See [details here](https://docs.perplexity.ai/guides/pricing#detailed-pricing-breakdown-for-sonar-reasoning-pro-and-sonar-pro) For enterprises seeking more advanced capabilities, the Sonar Pro API can handle in-depth, multi-step queries with added extensibility, like...

~에 의해 perplexity |3월 2025 |200K context |$3.00/M input |$15.00/M output

200K tokens ⓘ

Perplexity: Sonar Deep Research

Sonar Deep Research is a research-focused model designed for multi-step retrieval, synthesis, and reasoning across complex topics. It autonomously searches, reads, and evaluates sources, refining its approach as it gathers...

~에 의해 perplexity |3월 2025 |128K context |$2.00/M input |$8.00/M output

128K tokens ⓘ

미스트랄: Saba

Mistral Saba is a 24B-parameter language model specifically designed for the Middle East and South Asia, delivering accurate and contextually relevant responses while maintaining efficient performance. Trained on curated regional...

~에 의해 미스트랄 |2월 2025 |33K context |$0.2000/M input |$0.6000/M output

오픈AI: o3 Mini High

OpenAI o3-mini-high is the same model as [o3-mini](/openai/o3-mini) with reasoning_effort set to high. o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, particularly excelling in science, mathematics, and...

~에 의해 개방하다 |2월 2025 |200K context |$1.10/M input |$4.40/M output

200K tokens ⓘ

아이온랩스: Aion-1.0

Aion-1.0 is a multi-model system designed for high performance across various tasks, including reasoning and coding. It is built on DeepSeek-R1, augmented with additional models and techniques such as Tree...

~에 의해 aion-labs |2월 2025 |131K context |$4.00/M input |$8.00/M output

131K tokens ⓘ

아이온랩스: Aion-1.0-Mini

Aion-1.0-Mini 32B parameter model is a distilled version of the DeepSeek-R1 model, designed for strong performance in reasoning domains such as mathematics, 코딩, and logic. It is a modified variant...

~에 의해 aion-labs |2월 2025 |131K context |$0.7000/M input |$1.40/M output

131K tokens ⓘ

아이온랩스: Aion-RP 1.0 (8비)

Aion-RP-Llama-3.1-8B ranks the highest in the character evaluation portion of the RPBench-Auto benchmark, a roleplaying-specific variant of Arena-Hard-Auto, where LLMs evaluate each other’s responses. It is a fine-tuned base model...

~에 의해 aion-labs |2월 2025 |33K context |$0.8000/M input |$1.60/M output

Qwen: Qwen2.5 VL 72B Instruct

Qwen2.5-VL is proficient in recognizing common objects such as flowers, birds, fish, and insects. It is also highly capable of analyzing texts, charts, icons, graphics, and layouts within images.

~에 의해 qwen |2월 2025 |131K context |$0.8000/M input |$1.00/M output

131K tokens ⓘ

Qwen: Qwen-Plus

Qwen-Plus, based on the Qwen2.5 foundation model, is a 131K context model with a balanced performance, speed, and cost combination.

~에 의해 qwen |2월 2025 |1M context |$0.2600/M input |$0.7800/M output

1M tokens ⓘ

오픈AI: o3 Mini

OpenAI o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, particularly excelling in science, mathematics, and coding. This model supports the `reasoning_effort` parameter, which can be set to...

~에 의해 개방하다 |Jan 2025 |200K context |$1.10/M input |$4.40/M output

200K tokens ⓘ

미스트랄: 미스트랄 스몰 3

미스트랄 스몰 3 is a 24B-parameter language model optimized for low-latency performance across common AI tasks. Released under the Apache 2.0 license, it features both pre-trained and instruction-tuned versions designed...

~에 의해 미스트랄 |Jan 2025 |33K context |$0.0500/M input |$0.0800/M output

Perplexity: Sonar

Sonar is lightweight, affordable, fast, and simple to use — now featuring citations and the ability to customize sources. It is designed for companies seeking to integrate lightweight question-and-answer features...

~에 의해 perplexity |Jan 2025 |127K context |$1.00/M input |$1.00/M output

127K tokens ⓘ

DeepSeek: R1 Distill Llama 70B

DeepSeek R1 Distill Llama 70B is a distilled large language model based on [Llama-3.3-70B-Instruct](/meta-llama/llama-3.3-70b-instruct), using outputs from [DeepSeek R1](/deepseek/deepseek-r1). The model combines advanced distillation techniques to achieve high performance across...

~에 의해 깊은 탐색 |Jan 2025 |128K context |$0.8000/M input |$0.8000/M output

128K tokens ⓘ

DeepSeek: R1

DeepSeek R1 is here: Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active in an inference pass....

~에 의해 깊은 탐색 |Jan 2025 |164K context |$0.7000/M input |$2.50/M output

164K tokens ⓘ

MiniMax: MiniMax-01

MiniMax-01 is a combines MiniMax-Text-01 for text generation and MiniMax-VL-01 for image understanding. It has 456 billion parameters, with 45.9 billion parameters activated per inference, and can handle a context...

~에 의해 미니맥스 |Jan 2025 |1M context |$0.2000/M input |$1.10/M output

1M tokens ⓘ

Microsoft: Phi 4

[Microsoft Research](/microsoft) Phi-4 is designed to perform well in complex reasoning tasks and can operate efficiently in situations with limited memory or where quick responses are needed. At 14 billion...

~에 의해 microsoft |Jan 2025 |16K context |$0.0700/M input |$0.1400/M output

Sao10K: 야마 3.1 70B Hanami x1

This is [Sao10K](/sao10k)'s experiment over [Euryale v2.2](/sao10k/l3.1-euryale-70b).

~에 의해 sao10k |Jan 2025 |16K context |$3.00/M input |$3.00/M output

DeepSeek: DeepSeek V3

DeepSeek-V3 is the latest model from the DeepSeek team, building upon the instruction following and coding abilities of the previous versions. Pre-trained on nearly 15 trillion tokens, the reported evaluations...

~에 의해 깊은 탐색 |Dec 2024 |131K context |$0.2002/M input |$0.8001/M output

131K tokens ⓘ

Sao10K: 야마 3.3 Euryale 70B

Euryale L3.3 70B is a model focused on creative roleplay from [Sao10k](https://ko-fi.com/sao10k). It is the successor of [Euryale L3 70B v2.2](/models/sao10k/l3-euryale-70b).

~에 의해 sao10k |Dec 2024 |131K context |$0.6500/M input |$0.7500/M output

131K tokens ⓘ

오픈AI: o1

The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 model series is trained with large-scale reinforcement learning to reason...

~에 의해 개방하다 |Dec 2024 |200K context |$15.00/M input |$60.00/M output

200K tokens ⓘ

Cohere: Command R7B (12-2024)

Command R7B (12-2024) is a small, fast update of the Command R+ model, delivered in December 2024. It excels at RAG, 도구 사용, agents, and similar tasks requiring complex reasoning...

~에 의해 cohere |Dec 2024 |128K context |$0.0375/M input |$0.1500/M output

128K tokens ⓘ

Meta: 야마 3.3 70B Instruct (free)

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model...

~에 의해 meta-llama |Dec 2024 |131K context |Miễn phí input |Miễn phí output

131K tokens ⓘ

Meta: 야마 3.3 70B Instruct

The Meta Llama 3.3 multilingual large language model (LLM) is a pretrained and instruction tuned generative model in 70B (text in/text out). The Llama 3.3 instruction tuned text only model...

~에 의해 meta-llama |Dec 2024 |131K context |$0.1000/M input |$0.3200/M output

131K tokens ⓘ

Amazon: Nova Lite 1.0

Amazon Nova Lite 1.0 is a very low-cost multimodal model from Amazon that focused on fast processing of image, video, and text inputs to generate text output. Amazon Nova Lite...

~에 의해 amazon |Dec 2024 |300K context |$0.0600/M input |$0.2400/M output

300K tokens ⓘ

Amazon: Nova Micro 1.0

Amazon Nova Micro 1.0 is a text-only model that delivers the lowest latency responses in the Amazon Nova family of models at a very low cost. With a context length...

~에 의해 amazon |Dec 2024 |128K context |$0.0350/M input |$0.1400/M output

128K tokens ⓘ

Amazon: Nova Pro 1.0

Amazon Nova Pro 1.0 is a capable multimodal model from Amazon focused on providing a combination of accuracy, speed, and cost for a wide range of tasks. As of December...

~에 의해 amazon |Dec 2024 |300K context |$0.8000/M input |$3.20/M output

300K tokens ⓘ

오픈AI: GPT-4o (2024-11-20)

The 2024-11-20 version of GPT-4o offers a leveled-up creative writing ability with more natural, engaging, and tailored writing to improve relevance & readability. It’s also better at working with uploaded...

~에 의해 개방하다 |11 월 2024 |128K context |$2.50/M input |$10.00/M output

128K tokens ⓘ

미스트랄 라지 2407

This is Mistral AI's flagship model, 미스트랄 라지 2 (version mistral-large-2407). It's a proprietary weights-available model and excels at reasoning, code, JSON, chat, and more. Read the launch announcement [here](https://mistral.ai/news/mistral-large-2407/)....

~에 의해 미스트랄 |11 월 2024 |131K context |$2.00/M input |$6.00/M output

131K tokens ⓘ

Qwen2.5 Coder 32B Instruct

Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). Qwen2.5-Coder brings the following improvements upon CodeQwen1.5: - Significantly improvements in **code generation**, **code reasoning**...

~에 의해 qwen |11 월 2024 |128K context |$0.6600/M input |$1.00/M output

128K tokens ⓘ

TheDrummer: UnslopNemo 12B

UnslopNemo v4.1 is the latest addition from the creator of Rocinante, designed for adventure writing and role-play scenarios.

~에 의해 thedrummer |11 월 2024 |33K context |$0.4000/M input |$0.4000/M output

Magnum v4 72B

This is a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet(https://openrouter.ai/anthropic/claude-3.5-sonnet) and Opus(https://openrouter.ai/anthropic/claude-3-opus). The model is fine-tuned on top of [Qwen2.5 72B](https://openrouter.ai/qwen/qwen-2.5-72b-instruct).

~에 의해 anthracite-org |Oct 2024 |33K context |$3.00/M input |$5.00/M output

Qwen: Qwen2.5 7B Instruct

Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...

~에 의해 qwen |Oct 2024 |131K context |$0.0400/M input |$0.1000/M output

131K tokens ⓘ

AI Models

계정

🔑 Lấy lại mật khẩu