• Live Crypto Prices
  • Crypto News
    • Worldwide
      • Bitcoin
      • Ethereum
      • Altcoin
      • Blockchain
      • Regulation
    • Australian Crypto News
  • Education
    • Cryptocurrency For Beginners
    • Where to Buy Cryptocurrency
    • Where to Store Cryptos
    • Cryptocurrency Tax in Australia 2021
No Result
View All Result
CryptoABC.net
No Result
View All Result

IBM Unveils Breakthroughs in PyTorch for Faster AI Model Training

September 18, 2024
in Blockchain
Reading Time: 2min read
0 0
A A
0
Crypto Innovations and IBM’s Role in the Evolving Payments Landscape
0
SHARES
20
VIEWS
ShareShareShareShareShare


Jessie A Ellis
Sep 18, 2024 12:38

IBM Research reveals advancements in PyTorch, including a high-throughput data loader and enhanced training throughput, aiming to revolutionize AI model training.





IBM Research has announced significant advancements in the PyTorch framework, aiming to enhance the efficiency of AI model training. These improvements were presented at the PyTorch Conference, highlighting a new data loader capable of handling massive data and significant enhancements to large language model (LLM) training throughput.

Enhancements to PyTorch’s Data Loader

The new high-throughput data loader allows PyTorch users to distribute LLM training workloads seamlessly across multiple machines. This innovation enables developers to save checkpoints more efficiently, reducing duplicated work. According to IBM Research, this tool was developed out of necessity by Davis Wertheimer and his colleagues, who needed a solution to manage and stream vast quantities of data across multiple devices efficiently.

Initially, the team faced challenges with existing data loaders, which caused bottlenecks in training processes. By iterating and refining their approach, they created a PyTorch-native data loader that supports dynamic and adaptable operations. This tool ensures that previously seen data isn’t revisited, even if the resource allocation changes mid-job.

In stress tests, the data loader managed to stream 2 trillion tokens over a month of continuous operation without any failures. It demonstrated the capability to load over 90,000 tokens per second per worker, translating to half a trillion tokens per day on 64 GPUs.

Maximizing Training Throughput

Another significant focus for IBM Research is optimizing GPU usage to prevent bottlenecks in AI model training. The team has employed fully sharded data parallel (FSDP) techniques to distribute large training datasets evenly across multiple machines, enhancing the efficiency and speed of model training and tuning. Using FSDP in conjunction with torch.compile has led to substantial gains in throughput.

IBM Research scientist Linsong Chu highlighted that their team was among the first to train a model using torch.compile and FSDP, achieving a training rate of 4,550 tokens per second per GPU on A100 GPUs. This breakthrough was demonstrated with the Granite 7B model, recently released on Red Hat Enterprise Linux AI (RHEL AI).

Further optimizations are being explored, including the integration of FP8 (8-point floating bit) datatype supported by Nvidia H100 GPUs, which has shown up to 50% gains in throughput. IBM Research scientist Raghu Ganti emphasized the significant impact of these improvements on infrastructure cost reduction.

Future Prospects

IBM Research continues to explore new frontiers, including the use of FP8 for model training and tuning on IBM’s artificial intelligence unit (AIU). The team is also focusing on Triton, Nvidia’s open-source software for AI deployment and execution, which aims to further optimize training by compiling Python code into the specific hardware programming language.

These advancements collectively aim to move faster cloud-based model training from experimental stages into broader community applications, potentially transforming the landscape of AI model training.

Image source: Shutterstock


Credit: Source link

ShareTweetSendPinShare
Previous Post

Shiba Inu Eyes Explosive 430% Climb As Momentum Builds —Analyst

Next Post

Can Bulls Stage A Comeback?

Next Post
Can Bulls Stage A Comeback?

Can Bulls Stage A Comeback?

You might also like

US Should Act On Bitcoin, Not Just Praise It: Trump Ex-Advisor

US Should Act On Bitcoin, Not Just Praise It: Trump Ex-Advisor

March 5, 2026
Bitcoin Big-Money On The Move: Exchange Whale Ratio Spikes To 0.6

Bitcoin Big-Money On The Move: Exchange Whale Ratio Spikes To 0.6

March 7, 2026
US Prosecutors Push for October Retrial of Tornado Cash Developer Roman Storm

US Prosecutors Push for October Retrial of Tornado Cash Developer Roman Storm

March 11, 2026
Coinbase Faces Backlash as Base Devs Point to “Corporate Double Speak”

Binance, CZ Cleared in US Civil Suit Over Alleged Terror Financing

March 7, 2026
Banks Push Back After Fed Grants Kraken Financial Access to Payment Rails

Banks Push Back After Fed Grants Kraken Financial Access to Payment Rails

March 5, 2026
Bitcoin Prints A 2022-Like Iran War Chart, But It’s Not

Bitcoin Prints A 2022-Like Iran War Chart, But It’s Not

March 6, 2026
CryptoABC.net

This is an Australian online news/education portal that aims to provide the latest crypto news, real-time updates, education and reviews within Australia and around the world. Feel free to get in touch with us!

What's New Here!

What To Expect For The Bitcoin Price After The Weekend Breakdown Below $70,000

What To Expect For The Bitcoin Price After The Weekend Breakdown Below $70,000

March 11, 2026
Circle CEO Allaire Supports Binance Stablecoin Decision

Circle Deploys USDC and CCTP on Morph Layer-2 Network

March 11, 2026

Subscribe Now

  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA

© 2021 cryptoabc.net - All rights reserved!

No Result
View All Result
  • Live Crypto Prices
  • Crypto News
    • Worldwide
      • Bitcoin
      • Ethereum
      • Altcoin
      • Blockchain
      • Regulation
    • Australian Crypto News
  • Education
    • Cryptocurrency For Beginners
    • Where to Buy Cryptocurrency
    • Where to Store Cryptos
    • Cryptocurrency Tax in Australia 2021

© 2021 cryptoabc.net - All rights reserved!

Welcome Back!

Login to your account below

Forgotten Password?

Create New Account!

Fill the forms below to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Please enter CoinGecko Free Api Key to get this plugin works.