AI Models

340 models Free & Paid Cập nhật: 4 giờ trước

Laguna XS 2.1 is the latest coding agent model in the 33B-A3B category from [Poolside](https://poolside.ai/) and a step forward from their Laguna XS.2 model (released in April 2026). It combines...

by poolside |Th7 2026 |262K context |Miễn phí input |Miễn phí output

262K tokens ⓘ

Poolside: Laguna XS 2.1

Laguna XS 2.1 is the latest coding agent model in the 33B-A3B category from [Poolside](https://poolside.ai/) and a step forward from their Laguna XS.2 model (released in April 2026). It combines...

by poolside |Th7 2026 |262K context |$0.0600/M input |$0.1200/M output

262K tokens ⓘ

Anthropic: Claude Sonnet 5

Sonnet 5 is Anthropic's most capable Sonnet-class model, with frontier performance across coding, agents, and professional work. It supports adaptive thinking with selectable reasoning effort levels (low, medium, high, max,...

by anthropic |Th6 2026 |1M context |$2.00/M input |$10.00/M output

1M tokens ⓘ

Google: Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image)

Nano Banana 2 Lite (Gemini 3.1 Flash Lite Image) is Google's fastest, most cost-efficient Gemini image model, built for high-velocity developer pipelines and rapid-fire visual exploration. It delivers text-to-image generation...

by google |Th6 2026 |66K context |$0.2500/M input |$1.50/M output

66K tokens ⓘ

Sakana: Fugu Ultra

Fugu Ultra is the higher-performance model in Sakana AI's Fugu family. Rather than a single monolithic model, Fugu is a learned multi-agent orchestration system: a language model trained to route...

by sakana |Th6 2026 |1M context |$5.00/M input |$30.00/M output

1M tokens ⓘ

Google: Nano Banana 2 (Gemini 3.1 Flash Image)

Gemini 3.1 Flash Image, a.k.a. "Nano Banana 2," is Google’s latest state of the art image generation and editing model, delivering Pro-level visual quality at Flash speed. It combines advanced...

by google |Th6 2026 |131K context |$0.5000/M input |$3.00/M output

131K tokens ⓘ

Google: Nano Banana Pro (Gemini 3 Pro Image)

Nano Banana Pro is Google’s most advanced image-generation and editing model, built on Gemini 3 Pro. It extends the original Nano Banana with significantly improved multimodal reasoning, real-world grounding, and...

by google |Th6 2026 |66K context |$2.00/M input |$12.00/M output

66K tokens ⓘ

Cohere: North Mini Code (free)

North Mini Code is Cohere's first agentic coding model and the debut of its North family. A sparse mixture-of-experts model with 30B total parameters and 3B active, it is optimized...

by cohere |Th6 2026 |256K context |Miễn phí input |Miễn phí output

256K tokens ⓘ

Z.ai: GLM 5.2

GLM 5.2 is a large-scale reasoning model from Z.ai. It supports text input and output with a 1M-token context window, and is suited for long-horizon agent workflows, project-level software engineering,...

by z-ai |Th6 2026 |1M context |$0.9300/M input |$3.00/M output

1M tokens ⓘ

OpenRouter: Fusion

Fusion turns your prompt into a small multi-model deliberation. A panel of expert models (see below) analyzes your prompt in parallel with web search and web fetch enabled, then a...

by openrouter |Th6 2026 |1M context |Miễn phí input |Miễn phí output

1M tokens ⓘ

MoonshotAI: Kimi K2.7 Code

MoonshotAI: Kimi K2.7 Code is a coding-focused model in Moonshot AI's Kimi K2 family, built to complete end-to-end programming tasks reliably over long contexts. It uses a native multimodal mixture-of-experts...

by moonshotai |Th6 2026 |262K context |$0.7400/M input |$3.50/M output

262K tokens ⓘ

Anthropic: Claude Fable Latest

This model always redirects to the latest model in the Claude Fable family.

by ~anthropic |Th6 2026 |1M context |$10.00/M input |$50.00/M output

1M tokens ⓘ

Anthropic: Claude Fable 5

Claude Fable 5 is a Mythos-class model from Anthropic, built for autonomous knowledge work and coding. It supports text, image, and file inputs with text output, with reasoning support and...

by anthropic |Th6 2026 |1M context |$10.00/M input |$50.00/M output

1M tokens ⓘ

Nex AGI: Nex-N2-Pro

Nex-N2-Pro is an agentic mixture-of-experts model from Nex AGI, with 17B active parameters out of 397B total. Built on the Qwen3.5 architecture, it accepts text and image input and produces...

by nex-agi |Th6 2026 |262K context |$0.2500/M input |$1.00/M output

262K tokens ⓘ

NVIDIA: Nemotron 3.5 Content Safety (free)

NVIDIA Nemotron 3.5 Content Safety is a compact 4B-parameter multimodal guardrail model from NVIDIA, fine-tuned from Google Gemma-3-4B. It moderates both inputs to and responses from LLMs and VLMs, accepting...

by nvidia |Th6 2026 |128K context |Miễn phí input |Miễn phí output

128K tokens ⓘ

NVIDIA: Nemotron 3 Ultra (free)

NVIDIA Nemotron 3 Ultra is an open frontier-reasoning and orchestration model from NVIDIA, with 55B active parameters out of 550B total (MoE). Built on a hybrid Transformer-Mamba mixture-of-experts architecture, it...

by nvidia |Th6 2026 |1M context |Miễn phí input |Miễn phí output

1M tokens ⓘ

NVIDIA: Nemotron 3 Ultra

by nvidia |Th6 2026 |1M context |$0.5000/M input |$2.20/M output

1M tokens ⓘ

Qwen: Qwen3.7 Plus

Qwen3.7-Plus is a cost-effective model in Alibaba's Qwen3.7 series. It supports text and image input with text output, building on the series' text capabilities with a comprehensive upgrade to its...

by qwen |Th6 2026 |1M context |$0.3200/M input |$1.28/M output

1M tokens ⓘ

MiniMax: MiniMax M3

MiniMax-M3 is a multimodal foundation model from MiniMax. It supports text, image, and video inputs with text output, a 1M-token context window, and is suited for long-horizon agentic work, coding,...

by minimax |Th5 2026 |1M context |$0.3000/M input |$1.20/M output

1M tokens ⓘ

StepFun: Step 3.7 Flash

Step 3.7 Flash is StepFun's latest high-efficiency multimodal Mixture-of-Experts model. It pairs a 196B-parameter language backbone with a vision encoder for native image and video understanding, activating roughly 11B parameters...

by stepfun |Th5 2026 |256K context |$0.2000/M input |$1.15/M output

256K tokens ⓘ

Anthropic: Claude Opus 4.8 (Fast)

Fast-mode variant of [Opus 4.8](/anthropic/claude-opus-4.8) - identical capabilities with higher output speed at 2x pricing relative to regular Opus 4.8. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode

by anthropic |Th5 2026 |1M context |$10.00/M input |$50.00/M output

1M tokens ⓘ

Anthropic: Claude Opus 4.8

Claude Opus 4.8 is Anthropic's most capable generally available model in the Opus family. It supports text, image, and file inputs with text output, with reasoning support and a 1M-token...

by anthropic |Th5 2026 |1M context |$5.00/M input |$25.00/M output

1M tokens ⓘ

Qwen: Qwen3.7 Max

Qwen3.7-Max is the flagship model in Alibaba's Qwen3.7 series. It supports text input and output and is designed for agent-centric workloads, with particular strengths in coding, office and productivity tasks,...

by qwen |Th5 2026 |1M context |$1.25/M input |$3.75/M output

1M tokens ⓘ

xAI: Grok Build 0.1

Grok Build 0.1 is xAI’s fast coding model trained specifically for agentic software engineering workflows. It supports text and image inputs with text output, and is optimized for interactive coding...

by x-ai |Th5 2026 |256K context |$1.00/M input |$2.00/M output

256K tokens ⓘ

Google: Gemini 3.5 Flash

Gemini 3.5 Flash is Google's high-efficiency multimodal model, bringing near-Pro level coding and reasoning at Flash-tier cost and speed. It is highly optimized for coding proficiency and parallel agentic execution...

by google |Th5 2026 |1M context |$1.50/M input |$9.00/M output

1M tokens ⓘ

Anthropic: Claude Opus 4.7 (Fast)

Fast-mode variant of [Opus 4.7](/anthropic/claude-opus-4.7) - identical capabilities with higher output speed at premium 6x pricing. Learn more in Anthropic's docs: https://platform.claude.com/docs/en/build-with-claude/fast-mode

by anthropic |Th5 2026 |1M context |$30.00/M input |$150.00/M output

1M tokens ⓘ

Perceptron: Perceptron Mk1

Perceptron Mk1 (Mark One) is Perceptron's highest-quality vision-language model for video and embodied reasoning.** It accepts image and video inputs paired with natural language queries, and produces detailed visual understanding...

by perceptron |Th5 2026 |33K context |$0.1500/M input |$1.50/M output

inclusionAI: Ring-2.6-1T

Ring-2.6-1T is a 1T-parameter-scale thinking model with 63B active parameters, built for real-world agent workflows that require both strong capability and operational efficiency. It is optimized for coding agents, tool...

by inclusionai |Th5 2026 |262K context |$0.0750/M input |$0.6250/M output

262K tokens ⓘ

Google: Gemini 3.1 Flash Lite

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...

by google |Th5 2026 |1M context |$0.2500/M input |$1.50/M output

1M tokens ⓘ

OpenAI: GPT Chat Latest

GPT Chat Latest points to OpenAI's stable API alias `chat-latest` that always resolves to the latest Instant chat model used in ChatGPT. As OpenAI rolls out new Instant model updates...

by openai |Th5 2026 |400K context |$5.00/M input |$30.00/M output

400K tokens ⓘ

xAI: Grok 4.3

Grok 4.3 is a reasoning model from xAI. It accepts text and image inputs with text output, and is suited for agentic workflows, instruction-following tasks, and applications requiring high factual...

by x-ai |Th4 2026 |1M context |$1.25/M input |$2.50/M output

1M tokens ⓘ

IBM: Granite 4.1 8B

Granite 4.1 8B is a dense, decoder-only 8-billion-parameter language model from IBM, part of the Granite 4.1 family. It supports a 131K-token context window and is designed for enterprise tasks...

by ibm-granite |Th4 2026 |131K context |$0.0500/M input |$0.1000/M output

131K tokens ⓘ

Mistral: Mistral Medium 3.5

Mistral Medium 3.5 is a dense 128B instruction-following model from Mistral AI. It supports text and image inputs with text output, and is designed for agentic workflows, coding, and complex...

by mistralai |Th4 2026 |262K context |$1.50/M input |$7.50/M output

262K tokens ⓘ

NVIDIA: Nemotron 3 Nano Omni (free)

NVIDIA Nemotron™ 3 Nano Omni is a 30B-A3B open multimodal model designed to function as a perception and context sub-agent in enterprise agent systems. It accepts text, image, video, and...

by nvidia |Th4 2026 |256K context |Miễn phí input |Miễn phí output

256K tokens ⓘ

Poolside: Laguna XS.2 (free)

Laguna XS.2 is the second-generation model in the XS size class from [Poolside](https://poolside.ai/), their efficient coding agent series. It combines tool calling and reasoning capabilities with a compact footprint, offering...

by poolside |Th4 2026 |262K context |Miễn phí input |Miễn phí output

262K tokens ⓘ

Poolside: Laguna XS.2

by poolside |Th4 2026 |262K context |$0.1000/M input |$0.2000/M output

262K tokens ⓘ

Poolside: Laguna M.1 (free)

Laguna M.1 is the flagship coding agent model from [Poolside](https://poolside.ai/), optimized for complex software engineering tasks. Designed for agentic coding workflows, it supports tool calling and reasoning, with a 256K...

by poolside |Th4 2026 |262K context |Miễn phí input |Miễn phí output

262K tokens ⓘ

Poolside: Laguna M.1

by poolside |Th4 2026 |262K context |$0.2000/M input |$0.4000/M output

262K tokens ⓘ

Anthropic Claude Haiku Latest

This model always redirects to the latest model in the Anthropic Claude Haiku family.

by ~anthropic |Th4 2026 |200K context |$1.00/M input |$5.00/M output

200K tokens ⓘ

OpenAI GPT Mini Latest

This model always redirects to the latest model in the OpenAI GPT Mini family.

by ~openai |Th4 2026 |400K context |$0.7500/M input |$4.50/M output

400K tokens ⓘ

Google Gemini Pro Latest

This model always redirects to the latest model in the Google Gemini Pro family.

by ~google |Th4 2026 |1M context |$2.00/M input |$12.00/M output

1M tokens ⓘ

MoonshotAI Kimi Latest

This model always redirects to the latest model in the MoonshotAI Kimi family.

by ~moonshotai |Th4 2026 |262K context |$0.6600/M input |$3.41/M output

262K tokens ⓘ

Google Gemini Flash Latest

This model always redirects to the latest model in the Google Gemini Flash family.

by ~google |Th4 2026 |1M context |$1.50/M input |$9.00/M output

1M tokens ⓘ

Anthropic Claude Sonnet Latest

This model always redirects to the latest model in the Anthropic Claude Sonnet family.

by ~anthropic |Th4 2026 |1M context |$2.00/M input |$10.00/M output

1M tokens ⓘ

OpenAI GPT Latest

This model always redirects to the latest model in the OpenAI GPT family.

by ~openai |Th4 2026 |1.1M context |$5.00/M input |$30.00/M output

1.1M tokens ⓘ

Qwen: Qwen3.5 Plus 2026-04-20

Qwen3.5 Plus (April 2026) is a large-scale multimodal language model from Alibaba. It accepts text, image, and video input and produces text output, with a 1M token context window. This...

by qwen |Th4 2026 |1M context |$0.3000/M input |$1.80/M output

1M tokens ⓘ

Qwen: Qwen3.6 Flash

Qwen3.6 Flash is a fast, efficient language model from Alibaba's Qwen 3.6 series. It supports text, image, and video input with a 1M token context window. Tiered pricing kicks in...

by qwen |Th4 2026 |1M context |$0.1875/M input |$1.13/M output

1M tokens ⓘ

Qwen: Qwen3.6 35B A3B

Qwen3.6-35B-A3B is an open-weight multimodal model from Alibaba Cloud with 35 billion total parameters and 3 billion active parameters per token. It uses a hybrid sparse mixture-of-experts architecture combining Gated...

by qwen |Th4 2026 |262K context |$0.1400/M input |$1.00/M output

262K tokens ⓘ

Qwen: Qwen3.6 Max Preview

Qwen3.6-Max-Preview is a proprietary frontier model from Alibaba Cloud built on a sparse mixture-of-experts architecture with approximately 1 trillion total parameters. It is optimized for agentic coding, tool use, and...

by qwen |Th4 2026 |262K context |$1.04/M input |$6.24/M output

262K tokens ⓘ

Qwen: Qwen3.6 27B

Qwen3.6 27B is a dense 27-billion-parameter language model from the Qwen Team at Alibaba, released in April 2026. It features hybrid multimodal capabilities — accepting text, image, and video inputs...

by qwen |Th4 2026 |262K context |$0.2850/M input |$2.40/M output

262K tokens ⓘ

AI Models

Tài khoản

🔑 Lấy lại mật khẩu