GLM-5
Open-source LLM with 744B parameters, featuring enhanced coding and agentic capabilities—approaches Claude Opus 4.5 in coding benchmarks.
What is GLM-5?
GLM-5 is Zhipu AI's latest flagship open-source large language model with 744 billion parameters. Released February 11, 2026, it features enhanced coding capabilities and the ability to perform long-running agent tasks, approaching Anthropic's Claude Opus 4.5 in coding benchmark tests and surpassing Google's Gemini 3 Pro on some benchmarks. The model leverages DeepSeek Sparse Attention technology for improved performance and achieves a record-low hallucination rate. Available via Z.ai platform and WaveSpeed API, with pricing approximately $0.80-$1.00 per million input tokens and $2.56-$3.20 per million output tokens—roughly 6x cheaper than Claude Opus 4.6.
Best for: Cost-sensitive projects · Self-hosted deployments · Coding-heavy workloads

Developer Stack Fit
Quick read on where GLM-5 fits in a software team's AI stack. Validate final fit against your codebase, data policy, and deployment model.
- Stack layer
- LLM APIs
- Deployment model
- Open-source deployable
- Open-source status
- Yes or source-available
- API support
- API or integration-friendly
- MCP support
- No MCP signal found
- Security posture
- Review vendor privacy and data retention
- Best use case
- Cost-sensitive projects
Discovery graph
Featured in NeuralStackly paths
Product media
Interface proof

Key Features
- 01
744 billion parameters
Open-source LLM competitive with top models
- 02
Enhanced coding capabilities approaching Claude Opus 4.5
Strong coding performance
- 03
Long-running agent task support
Cost-effective API pricing (~6x cheaper than Claude)
- 04
DeepSeek Sparse Attention integration
A core development capability that teams use daily.
- 05
Record-low hallucination rate
A core development capability that teams use daily.
- 06
Available on OpenRouter
A core development capability that teams use daily.
- 07
Open-source deployment option
A core development capability that teams use daily.
Pros & Cons
What stands out
- Open-source with self-deployment option
- Competitive coding benchmarks
- Significantly cheaper API access than comparable models
- Supports long-running agent tasks
Watch outs
- Chinese AI company (data residency considerations)
- Limited Western market presence
- API requires account with Z.ai or WaveSpeed
Pricing Plans
GLM-5 Pricing
Choose the perfect plan for your needs. All plans include our core features with different usage limits and advanced capabilities.
Open Source
API Access
Need a Custom Solution?
Looking for enterprise features or custom pricing? Contact GLM-5 directly for tailored solutions.
Contact SalesMost teams land on the Open Source plan.
Alternatives
FAQ
What is GLM-5 and how does it work?
GLM-5 is a development tool that open-source llm with 744b parameters, featuring enhanced coding and agentic capabilities—approaches claude opus 4.5 in coding benchmarks.. It uses AI to help users improve productivity through analyzing input and generating relevant output.
Is GLM-5 free to use?
GLM-5 offers a completely free plan. You can get started without paying anything.
Does GLM-5 have a free trial?
Yes — 14-day free trial (credit card required)
What can GLM-5 do?
Explore by task
More development Tools
Cursor
AI-powered code editor with autonomous agents, multi-model support, and Automations for triggering agents via code changes, Slack, or timers.
TurboQuant
Revolutionary KV cache compression achieving 6x memory reduction and 8x speedup for LLM inference with zero accuracy loss.
Ollama
Local-first LLM runtime for running models on your hardware with local privacy, no per-token API costs, and offline-capable workflows.
Affiliate Disclosure: We may earn a commission when you purchase through links on our site. This doesn't affect our editorial independence or the price you pay.
GLM-5
Free