Contact Us Careers Register

IBM Release Its New Granite 4.0 Hybrid AI Models with Lower Memory and Hardware Costs

03 Oct, 2025 - by CMI | Category : Information and Communication Technology

IBM Release Its New Granite 4.0 Hybrid AI Models with Lower Memory and Hardware Costs

IBM released Granite 4.0, its new family of open large language models (LLMs), showcasing a hybrid Mamba/transformer architecture to reduce memory requirements and hardware costs in the machine learning industry.

IBM, Granite 4.0 models can run on significantly cheaper GPUs while maintaining performance, as stated by IBM.

The models are open‑source under the Apache 2.0 license and are the first open models to receive ISO 42001 certification, confirming their alignment with international standards for AI security, governance, and transparency. All Granite 4.0 checkpoints are cryptographically signed to verify provenance and authenticity.

Granite 4.0 is accessible via IBM watsonx.ai and distributed through platform partners such as Dell Technologies, Docker Hub, Hugging Face, Kaggle, LM Studio, NVIDIA NIM, Ollama, OPAQUE, and Replicate. Support for access via Amazon SageMaker JumpStart and Microsoft Azure AI Foundry is planned to follow.

The release includes Granite-4.0-H-Small (32B parameters, 9B active), Granite-4.0-H-Tiny (7B parameters, 1B active), and Granite-4.0-H-Micro (3B parameters). A conventional transformer variant, Granite-4.0-Micro, is also available for platforms that do not yet support hybrid architectures.

Benchmarking shows Granite 4.0 models outperform prior Granite releases. The Granite‑4.0‑H‑Small model outpaces virtually all open‑weight models on the Stanford IFEval benchmark, trailing only Llama 4 Maverick. On the Berkeley Function Calling Leaderboard v3, Granite 4.0 holds its ground against much larger models—delivering comparable performance at a lower cost.

Before launch, enterprise partners such as EY and Lockheed Martin tested Granite 4.0. IBM also teamed up with HackerOne to run a bug bounty program, offering up to $100,000 for vulnerabilities or jailbreak exploits.

Granite 4.0 was trained on a 22 trillion‑token, enterprise‑focused corpus. Instruction‑tuned models are available now, while reasoning‑optimized models are slated to release later this fall. IBM plans to roll out more variants by year-end—including Granite 4.0 Medium and Granite 4.0 Nano.

About Author

Shivani Latey

Shivani Latey

Shivani Latey is a talented content writer with over three years of experience specializing in crafting informative and engaging articles and blog posts. Known for her ability to simplify complex topics, she ensures that her content is clear, accessible, and resonates with a wide range of audiences. Her strengths lie in distilling intricate concepts into digesti... View more

LogoCredibility and Certifications

Trusted Insights, Certified Excellence! Coherent Market Insights is a certified data advisory and business consulting firm recognized by global institutes.

Reliability and Reputation

860519526

Reliability and Reputation
ISO 9001:2015

9001:2015

ISO 27001:2022

27001:2022

Reliability and Reputation
Reliability and Reputation
© 2025 Coherent Market Insights Pvt Ltd. All Rights Reserved.