• Live Crypto Prices
  • Crypto News
    • Worldwide
      • Bitcoin
      • Ethereum
      • Altcoin
      • Blockchain
      • Regulation
    • Australian Crypto News
  • Education
    • Cryptocurrency For Beginners
    • Where to Buy Cryptocurrency
    • Where to Store Cryptos
    • Cryptocurrency Tax in Australia 2021
No Result
View All Result
CryptoABC.net
No Result
View All Result

Together AI Expands DeepSeek-R1 Deployment with Enhanced Serverless APIs and Reasoning Clusters

February 13, 2025
in Blockchain
Reading Time: 2min read
0 0
A A
0
Bitcoin Holdings in Public Company Treasuries Exceed 200,000 BTC
0
SHARES
8
VIEWS
ShareShareShareShareShare


Felix Pinkston
Feb 13, 2025 11:11

Together AI enhances DeepSeek-R1 deployment with new serverless APIs and reasoning clusters, offering high-speed and scalable solutions for large-scale reasoning model applications.





Together AI has announced significant advancements in the deployment of its DeepSeek-R1 reasoning model, introducing enhanced serverless APIs and dedicated reasoning clusters. This move is aimed at supporting the increasing demand from companies integrating sophisticated reasoning models into their production applications.

Enhanced Serverless APIs

The new Together Serverless API for DeepSeek-R1 is reportedly twice as fast as any other API currently available in the market, enabling low-latency, production-grade inference with seamless scalability. This API is designed to offer companies fast, responsive user experiences and efficient multi-step workflows, crucial for modern applications relying on reasoning models.

Key features of the serverless API include instant scalability without infrastructure management, flexible pay-as-you-go pricing, and enhanced security with hosting in Together AI’s data centers. The OpenAI-compatible APIs further facilitate easy integration into existing applications, offering high rate limits of up to 9000 requests per minute on the scale tier.

Introduction of Together Reasoning Clusters

To complement the serverless solution, Together AI has launched Together Reasoning Clusters, which provide dedicated GPU infrastructure optimized for high-throughput, low-latency inference. These clusters are particularly suited for handling variable, token-heavy reasoning workloads, achieving decoding speeds of up to 110 tokens per second.

The clusters leverage the proprietary Together Inference Engine, which is reported to be 2.5 times faster than open-source engines like SGLang. This efficiency allows for the same throughput with significantly fewer GPUs, reducing infrastructure costs while maintaining high performance.

Scalability and Cost Efficiency

Together AI offers a range of cluster sizes to match different workload demands, with contract-based pricing models ensuring predictable costs. This setup is particularly beneficial for enterprises with high-volume workloads, providing a cost-effective alternative to token-based pricing.

Additionally, the dedicated infrastructure ensures secure, isolated environments within North American data centers, meeting privacy and compliance requirements. With enterprise support and service level agreements guaranteeing 99.9% uptime, Together AI ensures reliable performance for mission-critical applications.

For more information, visit Together AI.

Image source: Shutterstock


Credit: Source link

ShareTweetSendPinShare
Previous Post

BNB Flips Solana’s Market Cap – Breakout To $700 Coming?

Next Post

BounceBit Mainnet v1.6.1 Enhances Performance and Security

Next Post
Bitcoin Holdings in Public Company Treasuries Exceed 200,000 BTC

BounceBit Mainnet v1.6.1 Enhances Performance and Security

You might also like

Solana ETFs Build ‘Serious Investor Base,’ Outpacing Bitcoin in Key Metrics

Solana ETFs Build ‘Serious Investor Base,’ Outpacing Bitcoin in Key Metrics

March 9, 2026
Stablecoin Market Breaks Records — USDC Controls 70% Of $1.8 Trillion Volume

Stablecoin Market Breaks Records — USDC Controls 70% Of $1.8 Trillion Volume

March 7, 2026
Standard Chartered Identifies Two Major Catalysts

Ripple Launches $750 Million Share Buyback, Boosting Valuation To $50 Billion

March 11, 2026
Dimensional Becomes Second Firm to Win SEC ETF-Mutual Fund Hybrid Approval

Crypto News Today: $2.6 Billion Options Expiry With Volatility Expected

March 6, 2026
Bitcoin’s Stock Correlation Doesn’t Undermine Its Diversification Role

Bitcoin’s Stock Correlation Doesn’t Undermine Its Diversification Role

March 9, 2026
Ethereum Foundation Positions Blockchain as Trust Layer for the Age of AI

Ethereum Foundation Positions Blockchain as Trust Layer for the Age of AI

March 6, 2026
CryptoABC.net

This is an Australian online news/education portal that aims to provide the latest crypto news, real-time updates, education and reviews within Australia and around the world. Feel free to get in touch with us!

What's New Here!

TVL Spikes 23% In Less Than Two Weeks

TVL Spikes 23% In Less Than Two Weeks

March 12, 2026
Ethereum Price Prediction: Wall Street Is Choosing Ethereum — Is ETH Becoming the Backbone of Finance?

Ethereum Price Prediction: Wall Street Is Choosing Ethereum — Is ETH Becoming the Backbone of Finance?

March 12, 2026

Subscribe Now

  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA

© 2021 cryptoabc.net - All rights reserved!

No Result
View All Result
  • Live Crypto Prices
  • Crypto News
    • Worldwide
      • Bitcoin
      • Ethereum
      • Altcoin
      • Blockchain
      • Regulation
    • Australian Crypto News
  • Education
    • Cryptocurrency For Beginners
    • Where to Buy Cryptocurrency
    • Where to Store Cryptos
    • Cryptocurrency Tax in Australia 2021

© 2021 cryptoabc.net - All rights reserved!

Welcome Back!

Login to your account below

Forgotten Password?

Create New Account!

Fill the forms below to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Please enter CoinGecko Free Api Key to get this plugin works.