AI Models

340 models 무료 & Paid Cập nhật: 7 hours trước

Inflection 3 Productivity is optimized for following instructions. It is better for tasks requiring JSON output or precise adherence to provided guidelines. It has access to recent news. For emotional...

~에 의해 |Oct 2024 |8K context |$2.50/M input |$10.00/M output

Inflection 3 Pi powers Inflection's [Pi](https://pi.ai) chatbot, including backstory, emotional intelligence, productivity, and safety. It has access to recent news, and excels in scenarios like customer support and roleplay. Pi...

~에 의해 |Oct 2024 |8K context |$2.50/M input |$10.00/M output

Rocinante 12B is designed for engaging storytelling and rich prose. Early testers have reported: - Expanded vocabulary with unique and expressive word choices - Enhanced creativity for vivid narratives -...

~에 의해 |9 월 2024 |33K context |$0.2500/M input |$0.5000/M output

야마 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it...

~에 의해 |9 월 2024 |131K context |Miễn phí input |Miễn phí output
131K tokens

야마 3.2 3B is a 3-billion-parameter multilingual large language model, optimized for advanced natural language processing tasks like dialogue generation, reasoning, and summarization. Designed with the latest transformer architecture, it...

~에 의해 |9 월 2024 |131K context |$0.0509/M input |$0.3350/M output
131K tokens

야마 3.2 1B is a 1-billion-parameter language model focused on efficiently performing natural language tasks, such as summarization, dialogue, and multilingual text analysis. Its smaller size allows it to operate...

~에 의해 |9 월 2024 |131K context |$0.0270/M input |$0.2010/M output
131K tokens

야마 3.2 11B Vision is a multimodal model with 11 billion parameters, designed to handle tasks combining visual and textual data. It excels in tasks such as image captioning and...

~에 의해 |9 월 2024 |131K context |$0.3450/M input |$0.3450/M output
131K tokens

Qwen2.5 72B is the latest series of Qwen large language models. Qwen2.5 brings the following improvements upon Qwen2: - Significantly more knowledge and has greatly improved capabilities in coding and...

~에 의해 |9 월 2024 |131K context |$0.3600/M input |$0.4000/M output
131K tokens

command-r-plus-08-2024 is an update of the [Command R+](/models/cohere/command-r-plus) with roughly 50% higher throughput and 25% lower latencies as compared to the previous Command R+ version, while keeping the hardware footprint...

~에 의해 |8월 2024 |128K context |$2.50/M input |$10.00/M output
128K tokens

command-r-08-2024 is an update of the [Command R](/models/cohere/command-r) with improved performance for multilingual retrieval-augmented generation (RAG) and tool use. More broadly, it is better at math, code and reasoning and...

~에 의해 |8월 2024 |128K context |$0.1500/M input |$0.6000/M output
128K tokens

Euryale L3.1 70B v2.2 is a model focused on creative roleplay from [Sao10k](https://ko-fi.com/sao10k). It is the successor of [Euryale L3 70B v2.1](/models/sao10k/l3-euryale-70b).

~에 의해 |8월 2024 |131K context |$0.8500/M input |$0.8500/M output
131K tokens

Hermes 3 is a generalist language model with many improvements over [Hermes 2](/models/nousresearch/nous-hermes-2-mistral-7b-dpo), including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

~에 의해 |8월 2024 |131K context |$0.7000/M input |$0.7000/M output
131K tokens

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

~에 의해 |8월 2024 |131K context |Miễn phí input |Miễn phí output
131K tokens

Hermes 3 is a generalist language model with many improvements over Hermes 2, including advanced agentic capabilities, much better roleplaying, reasoning, multi-turn conversation, long context coherence, and improvements across the...

~에 의해 |8월 2024 |131K context |$1.00/M input |$1.00/M output
131K tokens

Lunaris 8B is a versatile generalist and roleplaying model based on Llama 3. It's a strategic merge of multiple models, designed to balance creativity with improved logic and general knowledge....

~에 의해 |8월 2024 |8K context |$0.0400/M input |$0.0500/M output

The 2024-08-06 version of GPT-4o offers improved performance in structured outputs, with the ability to supply a JSON schema in the respone_format. Read more [here](https://openai.com/index/introducing-structured-outputs-in-the-api/). GPT-4o ("o" for "omni") is...

~에 의해 |8월 2024 |128K context |$2.50/M input |$10.00/M output
128K tokens

Meta's latest class of model (야마 3.1) launched with a variety of sizes & flavors. This 8B instruct-tuned version is fast and efficient. It has demonstrated strong performance compared to...

~에 의해 |Jul 2024 |131K context |$0.0200/M input |$0.0300/M output
131K tokens

Meta's latest class of model (야마 3.1) launched with a variety of sizes & flavors. This 70B instruct-tuned version is optimized for high quality dialogue usecases. It has demonstrated strong...

~에 의해 |Jul 2024 |131K context |$0.4000/M input |$0.4000/M output
131K tokens

A 12B parameter model with a 128k token context length built by Mistral in collaboration with NVIDIA. The model is multilingual, supporting English, French, German, Spanish, Italian, Portuguese, Chinese, Japanese,...

~에 의해 |Jul 2024 |131K context |$0.0200/M input |$0.0300/M output
131K tokens

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

~에 의해 |Jul 2024 |128K context |$0.1500/M input |$0.6000/M output
128K tokens

GPT-4o mini is OpenAI's newest model after [GPT-4 Omni](/models/openai/gpt-4o), supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable...

~에 의해 |Jul 2024 |128K context |$0.1500/M input |$0.6000/M output
128K tokens

Gemma 2 27B by Google is an open model built from the same research and technology used to create the [Gemini models](/models?q=gemini). Gemma models are well-suited for a variety of...

~에 의해 |Jul 2024 |8K context |$0.6500/M input |$0.6500/M output

GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of [GPT-4 Turbo](/models/openai/gpt-4-turbo) while being twice as...

~에 의해 |5월 2024 |128K context |$5.00/M input |$15.00/M output
128K tokens

GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of [GPT-4 Turbo](/models/openai/gpt-4-turbo) while being twice as...

~에 의해 |5월 2024 |128K context |$2.50/M input |$10.00/M output
128K tokens

Meta's latest class of model (야마 3) launched with a variety of sizes & flavors. This 8B instruct-tuned version was optimized for high quality dialogue usecases. It has demonstrated strong...

~에 의해 |4월 2024 |8K context |$0.1400/M input |$0.1400/M output

Mistral's official instruct fine-tuned version of [Mixtral 8x22B](/models/mistralai/mixtral-8x22b). It uses 39B active parameters out of 141B, offering unparalleled cost efficiency for its size. Its strengths include: - strong math, 코딩,...

~에 의해 |4월 2024 |66K context |$2.00/M input |$6.00/M output
66K tokens

WizardLM-2 8x22B is Microsoft AI's most advanced Wizard model. It demonstrates highly competitive performance compared to leading proprietary models, and it consistently outperforms all existing state-of-the-art opensource models. It is...

~에 의해 |4월 2024 |66K context |$0.6200/M input |$0.6200/M output
66K tokens

The latest GPT-4 Turbo model with vision capabilities. Vision requests can now use JSON mode and function calling. Training data: up to December 2023.

~에 의해 |4월 2024 |128K context |$10.00/M input |$30.00/M output
128K tokens

클로드 3 Haiku is Anthropic's fastest and most compact model for near-instant responsiveness. Quick and accurate targeted performance. See the launch announcement and benchmark results [here](https://www.anthropic.com/news/claude-3-haiku) #multimodal

~에 의해 |3월 2024 |200K context |$0.2500/M input |$1.25/M output
200K tokens

This is Mistral AI's flagship model, 미스트랄 라지 2 (version `mistral-large-2407`). It's a proprietary weights-available model and excels at reasoning, code, JSON, chat, and more. Read the launch announcement [here](https://mistral.ai/news/mistral-large-2407/)....

~에 의해 |2월 2024 |128K context |$2.00/M input |$6.00/M output
128K tokens

The preview GPT-4 model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Training data: up to Dec 2023. **Note:** heavily rate limited by OpenAI while...

~에 의해 |Jan 2024 |128K context |$10.00/M input |$30.00/M output
128K tokens

GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.

~에 의해 |Jan 2024 |4K context |$1.00/M input |$2.00/M output

Your prompt will be processed by a meta-model and routed to one of dozens of models (see below), optimizing for the best possible output. To see which model was used,...

~에 의해 |11 월 2023 |2M context |Miễn phí input |Miễn phí output
2M tokens

This model is a variant of GPT-3.5 Turbo tuned for instructional prompts and omitting chat-related optimizations. Training data: up to Sep 2021.

~에 의해 |9 월 2023 |4K context |$1.50/M input |$2.00/M output

This model offers four times the context length of gpt-3.5-turbo, allowing it to support approximately 20 pages of text in a single request at a higher cost. Training data: up...

~에 의해 |8월 2023 |16K context |$3.00/M input |$4.00/M output

An attempt to recreate Claude-style verbosity, but don't expect the same level of coherence or memory. Meant for use in roleplay/narrative situations.

~에 의해 |8월 2023 |8K context |$0.7500/M input |$1.00/M output

A recreation trial of the original MythoMax-L2-B13 but with updated models. #merge

~에 의해 |Jul 2023 |6K context |$0.4500/M input |$0.6500/M output

One of the highest performing and most popular fine-tunes of Llama 2 13비, with rich descriptions and roleplay. #merge

~에 의해 |Jul 2023 |4K context |$0.0600/M input |$0.0600/M output

GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.

~에 의해 |5월 2023 |16K context |$0.5000/M input |$1.50/M output

OpenAI's flagship model, GPT-4 is a large-scale multimodal language model capable of solving difficult problems with greater accuracy than previous models due to its broader general knowledge and advanced reasoning...

~에 의해 |5월 2023 |8K context |$30.00/M input |$60.00/M output