• Live Crypto Prices
  • Crypto News
    • Worldwide
      • Bitcoin
      • Ethereum
      • Altcoin
      • Blockchain
      • Regulation
    • Australian Crypto News
  • Education
    • Cryptocurrency For Beginners
    • Where to Buy Cryptocurrency
    • Where to Store Cryptos
    • Cryptocurrency Tax in Australia 2021
No Result
View All Result
CryptoABC.net
No Result
View All Result

StripedHyena-7B: The Next Generation AI Architecture for Enhanced Performance and Efficiency

January 3, 2024
in Blockchain
Reading Time: 2min read
0 0
A A
0
StripedHyena-7B: The Next Generation AI Architecture for Enhanced Performance and Efficiency
0
SHARES
7
VIEWS
ShareShareShareShareShare

Recent advancements in AI have been significantly influenced by the Transformer architecture, a key component in large models across various fields like language, vision, audio, and biology. However, the complexity of the Transformer’s attention mechanism limits its application in processing long sequences. Even sophisticated models like GPT-4 struggle with this limitation​​.

Breakthrough with StripedHyena

To address these challenges, Together Research recently open-sourced StripedHyena, a language model boasting a novel architecture optimized for long contexts. StripedHyena can handle up to 128k tokens and has demonstrated improvements over the Transformer architecture in both training and inference performance​​. It’s the first model to match the performance of the best open-source Transformer models for both short and long contexts​​.

Hybrid Architecture of StripedHyena

StripedHyena incorporates a hybrid architecture, combining multi-head, grouped-query attention with gated convolutions within Hyena blocks. This design differs from the traditional decoder-only Transformer models. It decodes with constant memory in Hyena blocks through the representation of convolutions as state-space models or truncated filters. This architecture results in lower latency, faster decoding, and higher throughput compared to Transformers​​​​.

Training and Efficiency Gains

StripedHyena outperforms traditional Transformers in end-to-end training for sequences of 32k, 64k, and 128k tokens, with speed improvements of 30%, 50%, and over 100%, respectively​​. In terms of memory efficiency, it reduces memory usage by more than 50% during autoregressive generation compared to Transformers​​.

Comparative Performance with Attention Mechanism

StripedHyena achieves a significant reduction in the quality gap with large-scale attention, offering similar perplexity and downstream performance with less computational cost, and without the need for mixed attention​​.

Applications Beyond Language Processing

StripedHyena’s versatility extends to image recognition. Researchers have tested its applicability in replacing attention in visual Transformers (ViT), showing comparable accuracy in image classification tasks on the ImageNet-1k dataset​​.

StripedHyena represents a significant step forward in AI architecture, offering a more efficient alternative to the Transformer model, especially in handling long sequences. Its hybrid structure and enhanced performance in training and inference make it a promising tool for a wide range of applications in language and vision processing.

Image source: Shutterstock

Credit: Source link

ShareTweetSendPinShare
Previous Post

Billionaire Shark Tank Star Mark Cuban Blasts SEC, Says the Regulator Fails To Protect Investors

Next Post

Registered Funds Want Exposure To BTC

Next Post
Registered Funds Want Exposure To BTC

Registered Funds Want Exposure To BTC

You might also like

Senate’s 60-Vote Gap Looms Over CLARITY Act Before August Recess

Senate’s 60-Vote Gap Looms Over CLARITY Act Before August Recess

June 23, 2026
BOJ Raises Rates To 1% As Crypto Traders Watch Yen Carry Risk

SBI And Startale Put Yen Stablecoins Back In The Institutional Spotlight

June 24, 2026
XRP Forms Channel Support That Puts Market In Difficult Spot, But Bulls Still Have A Chance

Ripple And SBI Launch RLUSD Stablecoin In Japan After Regulatory Approval

June 25, 2026
Bitcoin Addresses Holding Between 100 and 10,000 BTC Hit a 7-Week High

AI Adoption Among General Counsel Hits 87% in 2026

June 23, 2026
VanEck flags $50B miner funding gap as Polymarket pegs BTC >$54K at 99.95%

Tech-stock slump rattles crypto as Polymarket puts 99% on BTC above $54K

June 26, 2026
Senate Democrats Demand Probe Into Trump Family Crypto Venture’s UAE Links

Senate Democrats Demand Probe Into Trump Family Crypto Venture’s UAE Links

June 24, 2026
CryptoABC.net

This is an Australian online news/education portal that aims to provide the latest crypto news, real-time updates, education and reviews within Australia and around the world. Feel free to get in touch with us!

What's New Here!

Ripple Pilots Private Version of XRP Ledger for CBDC Issuance

XRP Price Prediction: $1.00 Make-or-Break — Tactical Bounce or a Flush Into the Low $0.90s Within 72 Hours

June 30, 2026
Build It Here or Buy It Later

Build It Here or Buy It Later

June 30, 2026

Subscribe Now

  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA

© 2021 cryptoabc.net - All rights reserved!

No Result
View All Result
  • Live Crypto Prices
  • Crypto News
    • Worldwide
      • Bitcoin
      • Ethereum
      • Altcoin
      • Blockchain
      • Regulation
    • Australian Crypto News
  • Education
    • Cryptocurrency For Beginners
    • Where to Buy Cryptocurrency
    • Where to Store Cryptos
    • Cryptocurrency Tax in Australia 2021

© 2021 cryptoabc.net - All rights reserved!

Welcome Back!

Login to your account below

Forgotten Password?

Create New Account!

Fill the forms below to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Please enter CoinGecko Free Api Key to get this plugin works.