• Live Crypto Prices
  • Crypto News
    • Worldwide
      • Bitcoin
      • Ethereum
      • Altcoin
      • Blockchain
      • Regulation
    • Australian Crypto News
  • Education
    • Cryptocurrency For Beginners
    • Where to Buy Cryptocurrency
    • Where to Store Cryptos
    • Cryptocurrency Tax in Australia 2021
No Result
View All Result
CryptoABC.net
No Result
View All Result

NVIDIA Launches Granary Dataset to Enhance Multilingual Speech AI

August 15, 2025
in Blockchain
Reading Time: 2min read
0 0
A A
0
Nvidia Plans to add Innovation in the Metaverse with Software, Marketplace Deals
0
SHARES
20
VIEWS
ShareShareShareShareShare


Jessie A Ellis
Aug 15, 2025 09:01

NVIDIA introduces the Granary dataset and models designed to improve speech recognition and translation across 25 European languages, addressing data scarcity in AI language models.





NVIDIA has unveiled a new open dataset and models aimed at advancing multilingual speech AI, addressing the limited language support in existing AI language models. The Granary dataset, alongside the NVIDIA Canary and Parakeet models, seeks to enhance speech recognition and translation capabilities for 25 European languages, including underrepresented ones such as Croatian, Estonian, and Maltese, according to NVIDIA’s blog.

Granary Dataset: A New Resource for AI Developers

The Granary dataset is a comprehensive collection of multilingual speech datasets, encompassing approximately a million hours of audio. This includes nearly 650,000 hours dedicated to speech recognition and over 350,000 hours for speech translation. The dataset is accessible on Hugging Face, providing a valuable resource for developers to scale AI applications globally, facilitating the creation of multilingual chatbots, customer service voice agents, and real-time translation services.

Developed in collaboration with Carnegie Mellon University and Fondazione Bruno Kessler, the Granary dataset utilizes NVIDIA’s NeMo Speech Data Processor toolkit to transform unlabeled audio into structured, high-quality data. This innovative processing pipeline allows for enhanced public speech data without the need for extensive human annotation, making it a critical resource for AI training in the European Union’s official languages, plus Russian and Ukrainian.

Introducing NVIDIA Canary and Parakeet Models

The NVIDIA Canary-1b-v2 and Parakeet-tdt-0.6b-v3 models, trained on the Granary dataset, offer powerful tools for transcription and translation. Canary-1b-v2, a billion-parameter model, supports high-quality transcription of European languages and translation between English and 24 other languages. Meanwhile, Parakeet-tdt-0.6b-v3, with 600 million parameters, is optimized for real-time or large-volume transcription tasks.

Both models are designed to provide accurate punctuation, capitalization, and word-level timestamps in their outputs. Canary-1b-v2 is particularly notable for its efficiency, offering transcription and translation quality comparable to models three times its size, while running inference up to ten times faster.

Advancing Speech AI Innovation

By sharing the methodology behind Granary and its associated models, NVIDIA is empowering the global speech AI developer community to adapt similar data processing workflows to other automatic speech recognition (ASR) or automatic speech translation (AST) models, thereby accelerating innovation in the field. The models and dataset are publicly available under a permissive license, encouraging widespread use and adaptation.

The Granary dataset and NVIDIA’s new models represent a significant step forward in addressing the challenges of data scarcity in speech AI, particularly for languages that have been historically underrepresented in AI language models. This initiative not only broadens the scope of multilingual speech recognition and translation but also enhances the inclusivity and effectiveness of AI technologies globally.

The Granary dataset and models are available for exploration on Hugging Face, and further details can be accessed on NVIDIA’s blog.

Image source: Shutterstock


Credit: Source link

ShareTweetSendPinShare
Previous Post

BitMEX to Migrate Datacentre from AWS Dublin to Tokyo

Next Post

Hong Kong to Hold Tender for 3-Year RMB Government Bonds Amid Infrastructure Push

Next Post
Hong Kong to Hold Tender for 3-Year RMB Government Bonds Amid Infrastructure Push

Hong Kong to Hold Tender for 3-Year RMB Government Bonds Amid Infrastructure Push

You might also like

Strategy’s High-Yield Stock Will Continue to Fuel Bitcoin Surge, Says Bitwise CIO

Strategy’s High-Yield Stock Will Continue to Fuel Bitcoin Surge, Says Bitwise CIO

April 29, 2026

Bitcoin ETFs Lose Nearly Half A Billion Dollars As Fear Returns To Crypto

April 30, 2026
AAVE Price Prediction: Testing $240 Breakout with $280 Medium-Term Target Despite Bearish Momentum

AAVE Price Prediction: $85 Breakdown Before Explosive Rally to $110+ by June

April 30, 2026
Solana (SOL) Rebound Feels Exhausted—Are Sellers Taking Over Again?

Solana (SOL) Rebound Feels Exhausted—Are Sellers Taking Over Again?

April 29, 2026
Chainlink Exchange Outflows Hit 970,430 LINK, Largest Of 2026

Chainlink Exchange Outflows Hit 970,430 LINK, Largest Of 2026

April 29, 2026
Tether-Linked £5 Million Political Donation Draws Regulatory Scrutiny

Tether-Linked £5 Million Political Donation Draws Regulatory Scrutiny

April 30, 2026
CryptoABC.net

This is an Australian online news/education portal that aims to provide the latest crypto news, real-time updates, education and reviews within Australia and around the world. Feel free to get in touch with us!

What's New Here!

GameStop GME Eyes $55.5B eBay Takeover: $368M Bitcoin Treasury in Danger?

GameStop GME Eyes $55.5B eBay Takeover: $368M Bitcoin Treasury in Danger?

May 5, 2026
Here’s What Historical Data Says Is Coming Next

Here’s What Historical Data Says Is Coming Next

May 5, 2026

Subscribe Now

  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA

© 2021 cryptoabc.net - All rights reserved!

No Result
View All Result
  • Live Crypto Prices
  • Crypto News
    • Worldwide
      • Bitcoin
      • Ethereum
      • Altcoin
      • Blockchain
      • Regulation
    • Australian Crypto News
  • Education
    • Cryptocurrency For Beginners
    • Where to Buy Cryptocurrency
    • Where to Store Cryptos
    • Cryptocurrency Tax in Australia 2021

© 2021 cryptoabc.net - All rights reserved!

Welcome Back!

Login to your account below

Forgotten Password?

Create New Account!

Fill the forms below to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Please enter CoinGecko Free Api Key to get this plugin works.