š New Models Available: grok-4 now live!
ā Explore New Features
glm-4.5
, glm-4.5-air
, glm-4.5-x
, glm-4.5-airx
, glm-4.5-flash
glm-4.5
: Flagship model with 355B total parameters and 32B active parameters, designed for agentic applications, supporting hybrid reasoning modes and excelling in complex reasoning, tool calling, and web browsing.glm-4.5-air
: Cost-effective model with 106B total parameters and 12B active parameters, maintaining strong performance while significantly reducing costs, ideal for resource-sensitive applications.glm-4.5-x
: High-performance model optimized for ultra-fast inference and powerful reasoning capabilities, delivering millisecond-level response times for scenarios requiring speed and logic.glm-4.5-airx
: Lightweight yet powerful model combining Air's cost advantages with X's speed benefits, offering the perfect balance between performance and efficiency.glm-4.5-flash
: Efficient multi-purpose model with high generation speed, specifically optimized for coding and reasoning tasks, suitable for developers getting started and rapid prototyping.gemini-2.5-pro-all
, gemini-2.5-flash-all
, gemini-2.5-pro-deepsearch
, gemini-2.5-flash-deepsearch
, deepseek-r1t2-chimera
gemini-2.5-pro-all
: A multimodal version of the Gemini model,gemini-2.5-flash-all
: A multimodal version of the Gemini model,gemini-2.5-pro-deepsearch
: A deep search model with enhanced deep search and information retrieval capabilitiesgemini-2.5-flash-deepsearch
: A deep search model combining the rapid performance of the Flash model with advanced deep search capabilities for fast, in-depth information discovery.deepseek-r1t2-chimera
: A 671B parameter Mixture-of-Experts (MoE) text generation model merged from DeepSeek-AI's R1-0528, R1, and V3-0324, supporting a context of up to 60k tokens.qwen3-coder-plus
qwen3-coder-plus
: Focused on code generation, understanding, and optimization, excels in complex programming tasks.qwen3-coder-plus-2025-07-22
qwen3-coder-plus-2025-07-22
: Optimized version from 2025-07-22, stable and reliable, suitable for production.qwen3-coder-480b-a35b-instruct
qwen3-coder-480b-a35b-instruct
: Flagship model with 480 billion parameters, MoE architecture, capable of handling extremely complex programming.Suno v4.5+
Suno v4.5+
: v4.5+ has richer sounds, new creation methods, and a maximum length of 8 minutes. This website currently supports Suno 4.5+. Please change the request parameter mv
to chirp-bluejay
.CometAPI supports Midjourney uploading masked images for local modifications
kimi-k2-0711-preview
kimi-k2-0711-preview
: Kimi K2 is a large-scale mixed-expertise (MoE) language model developed by Moonshot AI.CometAPI now supports direct calls to the OpenAI API to process PDFs without uploading files by providing the URL of the PDF file.
š CometAPI supports Claude code!
grok-4
grok-4-0709
grok-4
ļ¼grok-4-0709
: Currently supports text modal, with visual, image generation and other features coming soon. Extremely powerful technical parameters and ecological capabilities: Context Window: Supports up to 256,000 tokens of contextualization, ahead of mainstream models.Suno now supports stem separation, creating Persona, generating MP4 MV videos, getting WAV format files, and Timing: lyrics & audio timeline
veo3
veo3-pro
veo3-fast
veo3-frames
veo3-fast-frames
veo3-pro-frames
veo3
,veo3-pro
,veo3-fast
: is the official Google's latest video generation model, the generated video with sound, the world's only video model with sound. veo3-frames,veo3-fast-frames,veo3-pro-frames Support first frame mode.mj_fast_video
kling_image_expand
black-forest-labs/flux-kontext-pro
black-forest-labs/flux-kontext-max
flux-kontext-pro
flux-kontext-max
black-forest-labs/flux-kontext-pro
, black-forest-labs/flux-kontext-max
:flux-kontext-pro
, flux-kontext-max
:gemini-2.5-flash-lite-preview-06-17
gemini-2.5-flash-lite-preview-06-17
: Large scale processingļ¼Lower cost.o3-pro
o3-pro-2025-06-10
o3-pro
,o3-pro-2025-06-10
: Supports web search, file analysis, visual input reasoning, Python programming, and personalized responses.curl --location --request POST 'https://api.cometapi.com/v1/responses' \
--header 'Authorization: Bearer sk-xxxxxx' \
--header 'User-Agent: Apifox/1.0.0 (https://apifox.com)' \
--header 'Content-Type: application/json' \
--header 'Accept: */*' \
--header 'Host: api.cometapi.com' \
--header 'Connection: keep-alive' \
--data-raw '{
"model": "o3-pro",
"input": [{"role": "user", "content": "Whatās the difference between inductive and deductive reasoning?"}]
}'
gemini-2.5-pro-preview-06-05
gemini-2.5-pro-preview-06-05
: With native multimodal processing capabilities and a very long context window of up to 1 million words (Token), it provides unprecedented power for processing complex, long sequence tasks.gemini-2.5-flash-preview-05-20
\ gemini-2.5-flash-preview-04-17
\ gemini-2.5-pro-preview-05-06
\ gemini-2.5-pro-preview-03-25
\ gemini-2.5-pro -preview-03-25
\ gemini-2.5-pro-exp-03-25
;* gemini-2.5-pro-exp-03-25
deepseek-r1-0528
deepseek-r1-0528
: Advanced reasoning capabilities, large parameter scale, powerful performance, suitable for complex tasks.claude-sonnet-4-20250514
claude-sonnet-4-20250514
: An important model in the Claude 4 series developed by Anthropic, significantly improving coding and reasoning capabilities compared to its predecessor Claude Sonnet 3.7.claude-sonnet-4-20250514-thinking
claude-sonnet-4-20250514-thinking
: An important model in the Claude 4 series developed by Anthropic, significantly improving coding and reasoning capabilities compared to its predecessor Claude Sonnet 3.7.claude-opus-4-20250514
claude-opus-4-20250514
: Opus 4 is Anthropic's most advanced model, acclaimed as the world's best coding model.claude-opus-4-20250514-thinking
claude-opus-4-20250514-thinking
: Opus 4 is Anthropic's most advanced model, acclaimed as the world's best coding model.Suno v4.5
Suno v4.5
: v4.5 has more expressive music and richer vocals, designed to enhance the user's expression and intuition in music creation. This site now supports Suno 4.5, change the request parameter mv to chirp-aukqwen3-235b-a22b
qwen3-235b-a22b
: This is the flagship model of the Qwen3 series, with 235 billion parameters, utilizing a Mixture of Experts (MoE) architecture.qwen3-30b-a3b
qwen3-30b-a3b
: With 30 billion parameters, it balances performance and resource requirements, suitable for enterprise-level applications.qwen3-8b
qwen3-8b
: A lightweight model with 800 million parameters, designed specifically for resource-constrained environments (such as mobile devices or low-configuration servers).gpt-image-1
{
"model": "gpt-image-1",
"prompt": "A cute baby sea otter",
"n": 1,
"size": "1024x1024"
}
gemini-2.5-flash-preview-04-17
gemini-2.5-flash-preview-04-17
, Gemini 2.5 Flash is an AI model developed by Google, designed to provide developers with fast and cost-effective solutions, especially suitable for applications requiring enhanced reasoning capabilities.o4-mini
o4-mini-2025-04-16
o4-mini
, o4-mini-2025-04-16
: A smaller, faster, and more economical model, research shows it performs well in mathematics, coding, and visual tasks, designed to be efficient and responsive, suitable for developers. Released on April 16, 2025.o3
o3-2025-04-16
o3
, o3-2025-04-16
: A reflective generative pre-trained transformer (GPT) model designed to handle problems requiring step-by-step logical reasoning.gpt-4.1
gpt-4.1
: Major advancements in coding and instruction following; GPT-4.1 has become the leading model for coding.gpt-4.1-mini
gpt-4.1-mini
: Represents a significant leap in small model performance, even outperforming GPT-4o on many benchmarks.gpt-4.1-nano
gpt-4.1-nano
: Features a larger context windowāsupporting up to 1 million context tokensgrok-3-deepersearch
grok-3-deepersearch
: Features high data timeliness, excellent interactive experience, and thorough search thinking process; comprehensive webpage aggregation.gemini-2.0-flash-exp-image-generation
grok-3-fast
grok-3-fast-latest
grok-3-fast
, grok-3-fast-latest
: grok-3 and grok-3-fast use exactly the same underlying model and provide the same response quality. However, grok-3-fast is served on faster infrastructure, delivering response times that are much quicker than the standard grok-3.grok-3-mini
grok-3-mini-latest
grok-3-mini
, grok-3-mini-latest
: A lightweight model that thinks before responding. Fast, intelligent, and ideal for logic-based tasks that don't require deep domain knowledge. The original thought traces are accessible.grok-3-mini-fast
grok-3-mini-fast-latest
grok-3-mini-fast
, grok-3-mini-fast-latest
: grok-3-mini and grok-3-mini-fast use exactly the same underlying model and provide the same response quality. However, grok-3-mini-fast is served on faster infrastructure, delivering response times that are much quicker than the standard grok-3-mini.llama-4-maverick
llama-4-scout
gpt-4o-all
gpt-4o-image
gemini-2.5-pro-exp-03-25
gemini-2.5-pro-preview-03-25
gpt-4.5-preview-2025-02-27
gpt-4.5-preview
gpt-4.5
claude-3-7-sonnet-thinking
claude-3-7-sonnet-20250219
cometapi-3-7-sonnet
cometapi-3-7-sonnet-thinking