Namaste, vanakkam, sat sri akaal — these are simply three types of greeting in India, a rustic with 22 constitutionally acknowledged languages and over 1,500 extra recorded by the nation’s census. Round 10% of its residents converse English, the web’s commonest language.
As India, the world’s most populous nation, forges forward with fast digitalization efforts, its enterprises and native startups are growing multilingual AI fashions that allow extra Indians to work together with know-how of their major language. It’s a case examine in sovereign AI — the event of home AI infrastructure that’s constructed on native datasets and displays a area’s particular dialects, cultures and practices.
These initiatives are constructing language fashions for Indic languages and English that may energy customer support AI brokers for companies, quickly translate content material to broaden entry to data, and allow companies to extra simply attain a various inhabitants of over 1.4 billion people.
To help initiatives like these, NVIDIA has launched a small language mannequin for Hindi, India’s most prevalent language with over half a billion audio system. Now out there as an NVIDIA NIM microservice, the mannequin, dubbed Nemotron-4-Mini-Hindi-4B, might be simply deployed on any NVIDIA GPU-accelerated system for optimized efficiency.
Tech Mahindra, an Indian IT companies and consulting firm, is the primary to make use of the Nemotron Hindi NIM microservice to develop an AI mannequin referred to as Indus 2.0, which is concentrated on Hindi and dozens of its dialects. Indus 2.0 harnesses Tech Mahindra’s high-quality fine-tuning information to additional increase mannequin accuracy, unlocking alternatives for shoppers in banking, training, healthcare and different industries to ship localized companies.
Tech Mahindra will showcase Indus 2.0 on the NVIDIA AI Summit, happening Oct. 23-25 in Mumbai. The corporate additionally makes use of NVIDIA NeMo to develop its sovereign massive language mannequin (LLM) platform, TeNo.
NVIDIA NIM Makes AI Adoption for Hindi as Simple as Ek, Do, Teen
The Nemotron Hindi mannequin has 4 billion parameters and is derived from Nemotron-4 15B, a 15-billion parameter multilingual language mannequin developed by NVIDIA. The mannequin was pruned, distilled and skilled with a mixture of real-world Hindi information, artificial Hindi information and an equal quantity of English information utilizing NVIDIA NeMo, an end-to-end, cloud-native framework and suite of microservices for growing generative AI.
The dataset was created with NVIDIA NeMo Curator, which improves generative AI mannequin accuracy by processing high-quality multimodal information at scale for coaching and customization. NeMo Curator makes use of NVIDIA RAPIDS libraries to speed up information processing pipelines on multi-node GPU techniques, decreasing processing time and whole value of possession. It additionally offers pre-built pipelines and constructing blocks for artificial information technology, information filtering, classification and deduplication to course of high-quality information.
After fine-tuning with NeMo, the ultimate mannequin leads on a number of accuracy benchmarks for AI fashions with as much as 8 billion parameters. Packaged as a NIM microservice, it may be simply harnessed to help use circumstances throughout industries reminiscent of training, retail and healthcare.
It’s out there as a part of the NVIDIA AI Enterprise software program platform, which supplies companies entry to further sources, together with technical help and enterprise-grade safety, to streamline AI growth for manufacturing environments.
Bevy of Companies Serves Multilingual Inhabitants
Innovators, main enterprises and world techniques integrators throughout India are constructing custom-made language fashions utilizing NVIDIA NeMo.
Corporations within the NVIDIA Inception program for cutting-edge startups are utilizing NeMo to develop AI fashions for a number of Indic languages.
Sarvam AI affords enterprise clients speech-to-text, text-to-speech, translation and information parsing fashions. The corporate developed Sarvam 1, India’s first homegrown, multilingual LLM, which was skilled from scratch on home AI infrastructure powered by NVIDIA H100 Tensor Core GPUs.
Sarvam 1 — developed utilizing NVIDIA AI Enterprise software program together with NeMo Curator and NeMo Framework — helps English and 10 main Indian languages, together with Bengali, Marathi, Tamil and Telugu.
Sarvam AI additionally makes use of NVIDIA NIM microservices, NVIDIA Riva for conversational AI, NVIDIA TensorRT-LLM software program and NVIDIA Triton Inference Server to optimize and deploy conversational AI brokers with sub-second latency.
One other Inception startup, Gnani.ai, constructed a multilingual speech-to-speech LLM that powers AI customer support assistants that deal with round 10 million real-time voice interactions each day for over 150 banking, insurance coverage and monetary companies corporations throughout India and the U.S. The mannequin helps 14 languages and was skilled on over 14 million hours of conversational speech information utilizing NVIDIA Hopper GPUs and NeMo Framework.
Gnani.ai makes use of TensorRT-LLM, Triton Inference Server and Riva NIM microservices to optimize its AI for digital customer support assistants and speech analytics.
Giant enterprises constructing LLMs with NeMo embody:
- Flipkart, a significant Indian ecommerce firm majority-owned by Walmart, is integrating NeMo Guardrails, an open-source toolkit that permits builders so as to add programmable guardrails to LLMs, to improve the protection of its conversational AI techniques.
- Krutrim, a part of the Ola Group of companies that features considered one of India’s high ride-booking platforms, is growing a multilingual Indic basis mannequin utilizing Mistral NeMo 12B, a state-of-the-art LLM developed by Mistral AI and NVIDIA.
- Zoho Company, a worldwide know-how firm based mostly in Chennai, will use NVIDIA TensorRT-LLM and NVIDIA Triton Inference Server to optimize and ship language fashions for its over 700,000 clients. The corporate will use NeMo operating on NVIDIA Hopper GPUs to pretrain slender, small, medium and enormous fashions from scratch for over 100 enterprise functions.
India’s high world techniques integrators are additionally providing NVIDIA NeMo-accelerated options to their clients.
- Infosys will work on particular instruments and options utilizing the NVIDIA AI stack. The corporate’s heart of excellence can be growing AI-powered small language fashions that will likely be provided to clients as a service.
- Tata Consultancy Companies has developed AI options based mostly on NVIDIA NIM Agent Blueprints for the telecommunications, retail, manufacturing, automotive and monetary companies industries. TCS’ choices embody NeMo-powered, domain-specific language fashions that may be custom-made to deal with buyer queries and reply company-specific questions for workers for all enterprise features reminiscent of IT, HR or subject operations.
- Wipro is utilizing NVIDIA AI Enterprise software program together with NIM Agent Blueprints and NeMo to assist companies simply develop customized conversational AI options reminiscent of digital people to help customer support interactions.
Wipro and TCS additionally use NeMo Curator’s artificial information technology pipelines to generate information in languages apart from English to customise LLMs for his or her shoppers.
To study extra about NVIDIA’s collaboration with companies and builders in India, watch the replay of firm founder and CEO Jensen Huang’s hearth chat on the NVIDIA AI Summit.

