AI Models

356 models Gratuit & Paid Cập nhật: 17 hours trước

Amazon Nova Pro 1.0 is a capable multimodal model from Amazon focused on providing a combination of accuracy, speed, and cost for a wide range of tasks. As of December...

by |Dec 2024 |300K context |$0.8000/M input |$3.20/M output
300K tokens

The 2024-11-20 version of GPT-4o offers a leveled-up creative writing ability with more natural, engaging, and tailored writing to improve relevance & readability. It’s also better at working with uploaded...

by |Nov 2024 |128K context |$2.50/M input |$10.00/M output
128K tokens

Mistral Grand 2 2411 is an update of [Mistral Grand 2](/mistralai/mistral-large) released together with [Pixtral Large 2411](/mistralai/pixtral-large-2411) It provides a significant upgrade on the previous [Mistral Grand 24.07](/mistralai/mistral-large-2407), with notable...

by |Nov 2024 |131K context |$2.00/M input |$6.00/M output
131K tokens

This is Mistral AI's flagship model, Mistral Grand 2 (version mistral-large-2407). It's a proprietary weights-available model and excels at reasoning, code, JSON, chat, and more. Read the launch announcement [here](https://mistral.ai/news/mistral-large-2407/)....

by |Nov 2024 |131K context |$2.00/M input |$6.00/M output
131K tokens

Pixtral Large is a 124B parameter, open-weight, multimodal model built on top of [Mistral Grand 2](/mistralai/mistral-large-2411). The model is able to understand documents, charts and natural images. The model is...

by |Nov 2024 |131K context |$2.00/M input |$6.00/M output
131K tokens

Qwen2.5-Coder is the latest series of Code-Specific Qwen large language models (formerly known as CodeQwen). Qwen2.5-Coder brings the following improvements upon CodeQwen1.5: - Significantly improvements in **code generation**, **code reasoning**...

by |Nov 2024 |128K context |$0.6600/M input |$1.00/M output
128K tokens

UnslopNemo v4.1 is the latest addition from the creator of Rocinante, designed for adventure writing and role-play scenarios.

by |Nov 2024 |33K context |$0.4000/M input |$0.4000/M output

Claude 3.5 Haiku features offers enhanced capabilities in speed, coding accuracy, and tool use. Engineered to excel in real-time applications, it delivers quick response times that are essential for dynamic...

by |Nov 2024 |200K context |$0.8000/M input |$4.00/M output
200K tokens

This is a series of models designed to replicate the prose quality of the Claude 3 models, specifically Sonnet(https://openrouter.ai/anthropic/claude-3.5-sonnet) and Opus(https://openrouter.ai/anthropic/claude-3-opus). The model is fine-tuned on top of [Qwen2.5 72B](https://openrouter.ai/qwen/qwen-2.5-72b-instruct).

by |Oct 2024 |33K context |$3.00/M input |$5.00/M output

Qwen2.5 7B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...

by |Oct 2024 |131K context |$0.0400/M input |$0.1000/M output
131K tokens

Inflection 3 Productivity is optimized for following instructions. It is better for tasks requiring JSON output or precise adherence to provided guidelines. It has access to recent news. For emotional...

by |Oct 2024 |8K context |$2.50/M input |$10.00/M output

Inflection 3 Pi powers Inflection's [Pi](https://pi.ai) chatbot, including backstory, emotional intelligence, productivity, and safety. It has access to recent news, and excels in scenarios like customer support and roleplay. Pi...

by |Oct 2024 |8K context |$2.50/M input |$10.00/M output

Rocinante 12B is designed for engaging storytelling and rich prose. Early testers have reported: - Expanded vocabulary with unique and expressive word choices - Enhanced creativity for vivid narratives -...

by |Sep 2024 |33K context |$0.1700/M input |$0.4300/M output

Llama 3.2 1B is a 1-billion-parameter language model focused on efficiently performing natural language tasks, such as summarization, dialogue, and multilingual text analysis. Its smaller size allows it to operate...

by |Sep 2024 |131K context |$0.0270/M input |$0.2010/M output
131K tokens

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

by |Sep 2024 |131K context |$0.2450/M input |$0.2450/M output
131K tokens

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it...

by |Sep 2024 |131K context |Miễn phí input |Miễn phí output
131K tokens

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it...

by |Sep 2024 |131K context |$0.0509/M input |$0.3350/M output
131K tokens

Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...

by |Sep 2024 |131K context |$0.3600/M input |$0.4000/M output
131K tokens

command-r-plus-08-2024 is an update of the [Command R+](/models/cohere/command-r-plus) with roughly 50% higher throughput and 25% lower latencies as compared to the previous Command R+ version, while keeping the hardware footprint...

by |Août 2024 |128K context |$2.50/M input |$10.00/M output
128K tokens

command-r-08-2024 is an update of the [Command R](/models/cohere/command-r) with improved performance for multilingual retrieval-augmented generation (RAG) and tool use. More broadly, it is better at math, code and reasoning and...

by |Août 2024 |128K context |$0.1500/M input |$0.6000/M output
128K tokens

Euryale L3.1 70B v2.2 is a model focused on creative roleplay from [Sao10k](https://ko-fi.com/sao10k). It is the successor of [Euryale L3 70B v2.1](/models/sao10k/l3-euryale-70b).

by |Août 2024 |131K context |$0.8500/M input |$0.8500/M output
131K tokens

Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nousresearch/nous-hermes-2-mistral-7b-dpo), including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

by |Août 2024 |131K context |$0.3000/M input |$0.3000/M output
131K tokens

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

by |Août 2024 |131K context |Miễn phí input |Miễn phí output
131K tokens

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

by |Août 2024 |131K context |$1.00/M input |$1.00/M output
131K tokens

Lunaris 8B is a versatile generalist and roleplaying model based on Llama 3. It's a strategic merge of multiple models, designed to balance creativity with improved logic and general knowledge....

by |Août 2024 |8K context |$0.0400/M input |$0.0500/M output

The 2024-08-06 version of GPT-4o offers improved performance in structured outputs, with the ability to supply a JSON schema in the respone_format. Read more [here](https://openai.com/index/introducing-structured-outputs-in-the-api/). GPT-4o ("o" for "omni") is...

by |Août 2024 |128K context |$2.50/M input |$10.00/M output
128K tokens

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compared to...

by |Jul 2024 |131K context |$0.0200/M input |$0.0500/M output
131K tokens

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 70B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrated strong...

by |Jul 2024 |131K context |$0.4000/M input |$0.4000/M output
131K tokens

A 12B parameter model with a 128k token context length built by Mistral in collaboration with NVIDIA. The model is multilingual, supporting English, French, German, Spanish, Italian, Portuguese, Chinese, Japanese,...

by |Jul 2024 |131K context |$0.0200/M input |$0.0300/M output
131K tokens

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

by |Jul 2024 |128K context |$0.1500/M input |$0.6000/M output
128K tokens

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

by |Jul 2024 |128K context |$0.1500/M input |$0.6000/M output
128K tokens

Gemma 2 27B by Google is an open model built from the same research and technology used to create the [Gemini models](/models?q=gemini). Gemma models are well-suited for a variety of...

by |Jul 2024 |8K context |$0.6500/M input |$0.6500/M output

Euryale 70B v2.1 is a model focused on creative roleplay from [Sao10k](https://ko-fi.com/sao10k). - Better prompt adherence. - Better anatomy / spatial awareness. - Adapts much better to unique and custom...

by |Jun 2024 |8K context |$1.48/M input |$1.48/M output

Hermes 2 Pro is an upgraded, retrained version of Nous Hermes 2, consisting of an updated and cleaned version of the OpenHermes 2.5 Dataset, as well as a newly introduced...

by |Peut 2024 |8K context |$0.1400/M input |$0.1400/M output

GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of [GPT-4 Turbo](/models/openai/gpt-4-turbo) while being twice as...

by |Peut 2024 |128K context |$5.00/M input |$15.00/M output
128K tokens

GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of [GPT-4 Turbo](/models/openai/gpt-4-turbo) while being twice as...

by |Peut 2024 |128K context |$2.50/M input |$10.00/M output
128K tokens

Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 8B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong...

by |Avr 2024 |8K context |$0.0400/M input |$0.0400/M output

Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 70B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong...

by |Avr 2024 |8K context |$0.5100/M input |$0.7400/M output

Mistral's official instruct fine-tuned version of [Mixtral 8x22B](/models/mistralai/mixtral-8x22b). It uses 39B active parameters out of 141B, offering unparalleled cost efficiency for its size. Its strengths include: - strong math, coding,...

by |Avr 2024 |66K context |$2.00/M input |$6.00/M output
66K tokens

WizardLM-2 8x22B is Microsoft AI's most advanced Wizard model. It demonstrates highly competitive performance compared to leading proprietary models, and it consistently outperforms all existing state-of-the-art opensource models. It is...

by |Avr 2024 |66K context |$0.6200/M input |$0.6200/M output
66K tokens

The latest GPT-4 Turbo model with vision capabilities. Vision requests can now use JSON mode and function calling. Training data: up to December 2023.

by |Avr 2024 |128K context |$10.00/M input |$30.00/M output
128K tokens

Claude 3 Haiku is Anthropic's fastest and most compact model for near-instant responsiveness. Quick and accurate targeted performance. See the launch announcement and benchmark results [here](https://www.anthropic.com/news/claude-3-haiku) #multimodal

by |Mar 2024 |200K context |$0.2500/M input |$1.25/M output
200K tokens

This is Mistral AI's flagship model, Mistral Grand 2 (version `mistral-large-2407`). It's a proprietary weights-available model and excels at reasoning, code, JSON, chat, and more. Read the launch announcement [here](https://mistral.ai/news/mistral-large-2407/)....

by |Fév 2024 |128K context |$2.00/M input |$6.00/M output
128K tokens

The preview GPT-4 model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Training data: up to Dec 2023. **Note:** heavily rate limited by OpenAI while...

by |Jan 2024 |128K context |$10.00/M input |$30.00/M output
128K tokens

GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.

by |Jan 2024 |4K context |$1.00/M input |$2.00/M output

Your prompt will be processed by a meta-model and routed to one of dozens of models (see below), optimizing for the best possible output. To see which model was used,...

by |Nov 2023 |2M context |Miễn phí input |Miễn phí output
2M tokens

The latest GPT-4 Turbo model with vision capabilities. Vision requests can now use JSON mode and function calling. Training data: up to April 2023.

by |Nov 2023 |128K context |$10.00/M input |$30.00/M output
128K tokens

This model is a variant of GPT-3.5 Turbo tuned for instructional prompts and omitting chat-related optimizations. Training data: up to Sep 2021.

by |Sep 2023 |4K context |$1.50/M input |$2.00/M output

A 7.3B parameter model that outperforms Llama 2 13B on all benchmarks, with optimizations for speed and context length.

by |Sep 2023 |4K context |$0.1100/M input |$0.1900/M output

This model offers four times the context length of gpt-3.5-turbo, allowing it to support approximately 20 pages of text in a single request at a higher cost. Training data: up...

by |Août 2023 |16K context |$3.00/M input |$4.00/M output