• Live Crypto Prices
  • Crypto News
    • Worldwide
      • Bitcoin
      • Ethereum
      • Altcoin
      • Blockchain
      • Regulation
    • Australian Crypto News
  • Education
    • Cryptocurrency For Beginners
    • Where to Buy Cryptocurrency
    • Where to Store Cryptos
    • Cryptocurrency Tax in Australia 2021
No Result
View All Result
CryptoABC.net
No Result
View All Result

NVIDIA Launches Granary Dataset to Enhance Multilingual Speech AI

August 15, 2025
in Blockchain
Reading Time: 2min read
0 0
A A
0
Nvidia Plans to add Innovation in the Metaverse with Software, Marketplace Deals
0
SHARES
16
VIEWS
ShareShareShareShareShare


Jessie A Ellis
Aug 15, 2025 09:01

NVIDIA introduces the Granary dataset and models designed to improve speech recognition and translation across 25 European languages, addressing data scarcity in AI language models.





NVIDIA has unveiled a new open dataset and models aimed at advancing multilingual speech AI, addressing the limited language support in existing AI language models. The Granary dataset, alongside the NVIDIA Canary and Parakeet models, seeks to enhance speech recognition and translation capabilities for 25 European languages, including underrepresented ones such as Croatian, Estonian, and Maltese, according to NVIDIA’s blog.

Granary Dataset: A New Resource for AI Developers

The Granary dataset is a comprehensive collection of multilingual speech datasets, encompassing approximately a million hours of audio. This includes nearly 650,000 hours dedicated to speech recognition and over 350,000 hours for speech translation. The dataset is accessible on Hugging Face, providing a valuable resource for developers to scale AI applications globally, facilitating the creation of multilingual chatbots, customer service voice agents, and real-time translation services.

Developed in collaboration with Carnegie Mellon University and Fondazione Bruno Kessler, the Granary dataset utilizes NVIDIA’s NeMo Speech Data Processor toolkit to transform unlabeled audio into structured, high-quality data. This innovative processing pipeline allows for enhanced public speech data without the need for extensive human annotation, making it a critical resource for AI training in the European Union’s official languages, plus Russian and Ukrainian.

Introducing NVIDIA Canary and Parakeet Models

The NVIDIA Canary-1b-v2 and Parakeet-tdt-0.6b-v3 models, trained on the Granary dataset, offer powerful tools for transcription and translation. Canary-1b-v2, a billion-parameter model, supports high-quality transcription of European languages and translation between English and 24 other languages. Meanwhile, Parakeet-tdt-0.6b-v3, with 600 million parameters, is optimized for real-time or large-volume transcription tasks.

Both models are designed to provide accurate punctuation, capitalization, and word-level timestamps in their outputs. Canary-1b-v2 is particularly notable for its efficiency, offering transcription and translation quality comparable to models three times its size, while running inference up to ten times faster.

Advancing Speech AI Innovation

By sharing the methodology behind Granary and its associated models, NVIDIA is empowering the global speech AI developer community to adapt similar data processing workflows to other automatic speech recognition (ASR) or automatic speech translation (AST) models, thereby accelerating innovation in the field. The models and dataset are publicly available under a permissive license, encouraging widespread use and adaptation.

The Granary dataset and NVIDIA’s new models represent a significant step forward in addressing the challenges of data scarcity in speech AI, particularly for languages that have been historically underrepresented in AI language models. This initiative not only broadens the scope of multilingual speech recognition and translation but also enhances the inclusivity and effectiveness of AI technologies globally.

The Granary dataset and models are available for exploration on Hugging Face, and further details can be accessed on NVIDIA’s blog.

Image source: Shutterstock


Credit: Source link

ShareTweetSendPinShare
Previous Post

BitMEX to Migrate Datacentre from AWS Dublin to Tokyo

Next Post

Hong Kong to Hold Tender for 3-Year RMB Government Bonds Amid Infrastructure Push

Next Post
Hong Kong to Hold Tender for 3-Year RMB Government Bonds Amid Infrastructure Push

Hong Kong to Hold Tender for 3-Year RMB Government Bonds Amid Infrastructure Push

You might also like

DOJ and Europol Dismantle Crypto-Linked Proxy Network SocksEscort in Joint Action

DOJ and Europol Dismantle Crypto-Linked Proxy Network SocksEscort in Joint Action

March 13, 2026
Solana Network Goes Offline Amid Massive SOL Price Plunge

Western Union and Papaya Global Move Treasury Operations to Solana (SOL)

March 17, 2026
Trump’s Exclusive $TRUMP Dinner Fuels Rally – But For How Long?

Trump’s Exclusive $TRUMP Dinner Fuels Rally – But For How Long?

March 14, 2026
Bitcoin (BTC) Profitability Robust Despite Declining Market Volumes

Glassnode Study Exposes Critical Flaw in Crypto Backtesting Methods

March 13, 2026
Leading AI Claude Predicts the Price of XRP, Bitcoin and Ethereum by The End of 2026

Leading AI Claude Predicts the Price of XRP, Bitcoin and Ethereum by The End of 2026

March 16, 2026
Ethereum Gains New Inflow Channel As BlackRock’s ETHB Starts Trading

Ethereum Gains New Inflow Channel As BlackRock’s ETHB Starts Trading

March 13, 2026
CryptoABC.net

This is an Australian online news/education portal that aims to provide the latest crypto news, real-time updates, education and reviews within Australia and around the world. Feel free to get in touch with us!

What's New Here!

Nasdaq Gets Green Light For Tokenized Securities Trading After SEC Approval

March 18, 2026
Hong Kong’s RedotPay Targets $150M Pre-IPO Raise for US Listing

Hong Kong’s RedotPay Targets $150M Pre-IPO Raise for US Listing

March 18, 2026

Subscribe Now

  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA

© 2021 cryptoabc.net - All rights reserved!

No Result
View All Result
  • Live Crypto Prices
  • Crypto News
    • Worldwide
      • Bitcoin
      • Ethereum
      • Altcoin
      • Blockchain
      • Regulation
    • Australian Crypto News
  • Education
    • Cryptocurrency For Beginners
    • Where to Buy Cryptocurrency
    • Where to Store Cryptos
    • Cryptocurrency Tax in Australia 2021

© 2021 cryptoabc.net - All rights reserved!

Welcome Back!

Login to your account below

Forgotten Password?

Create New Account!

Fill the forms below to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Please enter CoinGecko Free Api Key to get this plugin works.