AI Models

340 modèles Gratuit & Paid Cập nhật: 7 hours trước

Inflection 3 Productivity is optimized for following instructions. It is better for tasks requiring JSON output or precise adherence to provided guidelines. It has access to recent news. For emotional...

by inflection |Oct 2024 |8K context |$2.50/M input |$10.00/M output

Inflection: Inflection 3 Pi

Inflection 3 Pi powers Inflection's [Pi](https://pi.ai) chatbot, including backstory, emotional intelligence, productivité, and safety. It has access to recent news, and excels in scenarios like customer support and roleplay. Pi...

by inflection |Oct 2024 |8K context |$2.50/M input |$10.00/M output

TheDrummer: Rocinante 12B

Rocinante 12B is designed for engaging storytelling and rich prose. Early testers have reported: - Expanded vocabulary with unique and expressive word choices - Enhanced creativity for vivid narratives -...

by thedrummer |Sep 2024 |33K context |$0.2500/M input |$0.5000/M output

Méta: Llama 3.2 3B Instruct (free)

Llama 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it...

by meta-llama |Sep 2024 |131K context |Miễn phí input |Miễn phí output

131K tokens ⓘ

Méta: Llama 3.2 3B Instruct

by meta-llama |Sep 2024 |131K context |$0.0509/M input |$0.3350/M output

131K tokens ⓘ

Méta: Llama 3.2 1B Instruct

Llama 3.2 1B is a 1-billion-parameter language model focused on efficiently performing natural language tasks, such as summarization, dialogue, and multilingual text analysis. Its smaller size allows it to operate...

by meta-llama |Sep 2024 |131K context |$0.0270/M input |$0.2010/M output

131K tokens ⓘ

Méta: Llama 3.2 11B Vision Instruct

Llama 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

by meta-llama |Sep 2024 |131K context |$0.3450/M input |$0.3450/M output

131K tokens ⓘ

Qwen2.5 72B Instruct

Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...

by qwen |Sep 2024 |131K context |$0.3600/M input |$0.4000/M output

131K tokens ⓘ

Cohere: Command R+ (08-2024)

command-r-plus-08-2024 is an update of the [Command R+](/models/cohere/command-r-plus) with roughly 50% higher throughput and 25% lower latencies as compared to the previous Command R+ version, while keeping the hardware footprint...

by cohere |Aug 2024 |128K context |$2.50/M input |$10.00/M output

128K tokens ⓘ

Cohere: Command R (08-2024)

command-r-08-2024 is an update of the [Command R](/models/cohere/command-r) with improved performance for multilingual retrieval-augmented generation (RAG) and tool use. More broadly, it is better at math, code and reasoning and...

by cohere |Aug 2024 |128K context |$0.1500/M input |$0.6000/M output

128K tokens ⓘ

Sao10K: Llama 3.1 Euryale 70B v2.2

Euryale L3.1 70B v2.2 is a model focused on creative roleplay from [Sao10k](https://ko-fi.com/sao10k). It is the successor of [Euryale L3 70B v2.1](/models/sao10k/l3-euryale-70b).

by sao10k |Aug 2024 |131K context |$0.8500/M input |$0.8500/M output

131K tokens ⓘ

Nous: Hermes 3 70B Instruct

Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nousresearch/nous-hermes-2-mistral-7b-dpo), including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

by nousresearch |Aug 2024 |131K context |$0.7000/M input |$0.7000/M output

131K tokens ⓘ

Nous: Hermes 3 405B Instruct (free)

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

by nousresearch |Aug 2024 |131K context |Miễn phí input |Miễn phí output

131K tokens ⓘ

Nous: Hermes 3 405B Instruct

by nousresearch |Aug 2024 |131K context |$1.00/M input |$1.00/M output

131K tokens ⓘ

Sao10K: Llama 3 8B Lunaris

Lunaris 8B is a versatile generalist and roleplaying model based on Llama 3. It's a strategic merge of multiple models, designed to balance creativity with improved logic and general knowledge....

by sao10k |Aug 2024 |8K context |$0.0400/M input |$0.0500/M output

OpenAI: GPT-4o (2024-08-06)

The 2024-08-06 version of GPT-4o offers improved performance in structured outputs, with the ability to supply a JSON schema in the respone_format. Read more [here](https://openai.com/index/introducing-structured-outputs-in-the-api/). GPT-4o ("o" for "omni") is...

by openai |Aug 2024 |128K context |$2.50/M input |$10.00/M output

128K tokens ⓘ

Méta: Llama 3.1 8B Instruct

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compared to...

by meta-llama |Jul 2024 |131K context |$0.0200/M input |$0.0300/M output

131K tokens ⓘ

Méta: Llama 3.1 70B Instruct

Meta's latest class of model (Llama 3.1) launched with a variety of sizes & flavors. This 70B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrated strong...

by meta-llama |Jul 2024 |131K context |$0.4000/M input |$0.4000/M output

131K tokens ⓘ

Mistral: Mistral Nemo

A 12B parameter model with a 128k token context length built by Mistral in collaboration with NVIDIA. The model is multilingual, supporting English, French, German, Spanish, Italian, Portuguese, Chinese, Japanese,...

by mistralai |Jul 2024 |131K context |$0.0200/M input |$0.0300/M output

131K tokens ⓘ

OpenAI: GPT-4o-mini (2024-07-18)

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

by openai |Jul 2024 |128K context |$0.1500/M input |$0.6000/M output

128K tokens ⓘ

OpenAI: GPT-4o-mini

by openai |Jul 2024 |128K context |$0.1500/M input |$0.6000/M output

128K tokens ⓘ

Google: Gemma 2 27B

Gemma 2 27B by Google is an open model built from the same research and technology used to create the [Gemini models](/modèles?q=gemini). Gemma models are well-suited for a variety of...

by Google |Jul 2024 |8K context |$0.6500/M input |$0.6500/M output

OpenAI: GPT-4o (2024-05-13)

GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of [GPT-4 Turbo](/models/openai/gpt-4-turbo) while being twice as...

by openai |Peut 2024 |128K context |$5.00/M input |$15.00/M output

128K tokens ⓘ

OpenAI: GPT-4o

by openai |Peut 2024 |128K context |$2.50/M input |$10.00/M output

128K tokens ⓘ

Méta: Llama 3 8B Instruct

Meta's latest class of model (Llama 3) launched with a variety of sizes & flavors. This 8B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong...

by meta-llama |Avr 2024 |8K context |$0.1400/M input |$0.1400/M output

Mistral: Mixtral 8x22B Instruct

Mistral's official instruct fine-tuned version of [Mixtral 8x22B](/models/mistralai/mixtral-8x22b). It uses 39B active parameters out of 141B, offering unparalleled cost efficiency for its size. Its strengths include: - strong math, coding,...

by mistralai |Avr 2024 |66K context |$2.00/M input |$6.00/M output

66K tokens ⓘ

WizardLM-2 8x22B

WizardLM-2 8x22B is Microsoft AI's most advanced Wizard model. It demonstrates highly competitive performance compared to leading proprietary models, and it consistently outperforms all existing state-of-the-art opensource models. It is...

by microsoft |Avr 2024 |66K context |$0.6200/M input |$0.6200/M output

66K tokens ⓘ

OpenAI: GPT-4 Turbo

The latest GPT-4 Turbo model with vision capabilities. Vision requests can now use JSON mode and function calling. Training data: up to December 2023.

by openai |Avr 2024 |128K context |$10.00/M input |$30.00/M output

128K tokens ⓘ

Anthropique: Claude 3 Haiku

Claude 3 Haiku is Anthropic's fastest and most compact model for near-instant responsiveness. Quick and accurate targeted performance. See the launch announcement and benchmark results [here](https://www.anthropic.com/news/claude-3-haiku) #multimodal

by anthropic |Mar 2024 |200K context |$0.2500/M input |$1.25/M output

200K tokens ⓘ

Mistral Grand

This is Mistral AI's flagship model, Mistral Grand 2 (version `mistral-large-2407`). It's a proprietary weights-available model and excels at reasoning, code, JSON, chat, and more. Read the launch announcement [here](https://mistral.ai/news/mistral-large-2407/)....

by mistralai |Fév 2024 |128K context |$2.00/M input |$6.00/M output

128K tokens ⓘ

OpenAI: GPT-4 Turbo Preview

The preview GPT-4 model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Training data: up to Dec 2023. **Note:** heavily rate limited by OpenAI while...

by openai |Jan 2024 |128K context |$10.00/M input |$30.00/M output

128K tokens ⓘ

OpenAI: GPT-3.5 Turbo (older v0613)

GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.

by openai |Jan 2024 |4K context |$1.00/M input |$2.00/M output

Auto Router

Your prompt will be processed by a meta-model and routed to one of dozens of models (see below), optimizing for the best possible output. To see which model was used,...

by openrouter |Nov 2023 |2M context |Miễn phí input |Miễn phí output

2M tokens ⓘ

OpenAI: GPT-3.5 Turbo Instruct

This model is a variant of GPT-3.5 Turbo tuned for instructional prompts and omitting chat-related optimizations. Training data: up to Sep 2021.

by openai |Sep 2023 |4K context |$1.50/M input |$2.00/M output

OpenAI: GPT-3.5 Turbo 16k

This model offers four times the context length of gpt-3.5-turbo, allowing it to support approximately 20 pages of text in a single request at a higher cost. Training data: up...

by openai |Aug 2023 |16K context |$3.00/M input |$4.00/M output

Mancer: Weaver (alpha)

An attempt to recreate Claude-style verbosity, but don't expect the same level of coherence or memory. Meant for use in roleplay/narrative situations.

by mancer |Aug 2023 |8K context |$0.7500/M input |$1.00/M output

ReMM SLERP 13B

A recreation trial of the original MythoMax-L2-B13 but with updated models. #merge

by undi95 |Jul 2023 |6K context |$0.4500/M input |$0.6500/M output

MythoMax 13B

One of the highest performing and most popular fine-tunes of Llama 2 13B, with rich descriptions and roleplay. #merge

by gryphe |Jul 2023 |4K context |$0.0600/M input |$0.0600/M output

OpenAI: GPT-3.5 Turbo

GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.

by openai |Peut 2023 |16K context |$0.5000/M input |$1.50/M output

OpenAI: GPT-4

OpenAI's flagship model, GPT-4 is a large-scale multimodal language model capable of solving difficult problems with greater accuracy than previous models due to its broader general knowledge and advanced reasoning...

by openai |Peut 2023 |8K context |$30.00/M input |$60.00/M output

AI Models

Compte

🔑 Lấy lại mật khẩu