IBM Updates Granite Models
Written by Kay Ewbank   
Monday, 28 October 2024

IBM has released new Granite models that it says provide state-of-the-art performance relative to model size. The Granite 3.0 collection includes a new, instruction-tuned, dense decoder-only LLM.

The Granite series is a collection of generative AI models that apply generative AI to the modalities of language and code. The models come in different sizes and are built on a decoder-only architecture. They are trained on business-relevant domains, including financial, legal, IT, coding, and academic domains. The model type is designed to be used for retrieval augmented generation (RAG) for searching knowledge bases to generate tailored responses to customer inquiries, or to condense content into short descriptions.

watsonlogo

IBM says the new Granite 3.0 8B Instruct has been trained using a novel two-phase method on over 12 trillion tokens of carefully vetted data across 12 different natural languages and 116 different programming languages. IBM says that fine-tuning smaller, fit-for-purpose models like Granite provides enterprises with a way to get frontier model performance at a fraction of the cost. Key to this is the ability to tailor Granite models to an organization's needs through InstructLab.

InstructLab is an AI project developed by IBM and Red Hat that can be used by developers to enhance Large Language Models (LLMs) for specific business needs. It is open source and provides systematically generated synthetic data and phased-training protocols.

All Granite models are released under the permissive Apache 2.0 license, and IBM is providing a detailed disclosure of training data sets and methodologies in the Granite 3.0 technical paper. 

IBM has also placed emphasis on model safety in this release, saying. Granite 3.0 8B Instruct demonstrates industry-leading robustness on the AttaQ benchmark, which measures an LLM's vulnerability to adversarial prompts designed to provoke models into generating harmful, inappropriate or otherwise undesirable prompts.

The updated collection of Granite including recipes and how-to guides is available on Github, and developers can also experiment with the new Granite 3.0 8B Instruct model on the IBM Granite Playground.

Granite 3.0 models are now available on IBM watsonx.ai through platform partners such as Google Vertex AI (through Google Cloud's Vertex AI Model Garden integrations with Hugging Face), Hugging Face, NVIDIA (as NIM microservices), Ollama and Replicate.

watsonlogo

More Information

Watsonx Website

IBM Granite Playground

Related Articles

IBM Launches The Granite Code LLM Series

IBM Releases Watsonx Granite Models

IBM Announces WatsonX AI Platform

To be informed about new articles on I Programmer, sign up for our weekly newsletter, subscribe to the RSS feed and follow us on Twitter, Facebook or Linkedin.

Banner


JavaZone - The Conference We Missed
25/10/2024

Amongst the many Java related conferences, this one flew under the radar. A real shame because it had many great sessions.
JavaZone might not be that famous internationally, but it still is the bi [ ... ]



Check Your APIs With Zuplo's Rate My OpenAPI
15/10/2024

Zuplo has launched a new suite of tools that rates the quality of your API, based on its OpenAPI specification. We put it through its paces and find it useful.


More News

espbook

 

Comments




or email your comment to: comments@i-programmer.info