• Live Crypto Prices
  • Crypto News
    • Worldwide
      • Bitcoin
      • Ethereum
      • Altcoin
      • Blockchain
      • Regulation
    • Australian Crypto News
  • Education
    • Cryptocurrency For Beginners
    • Where to Buy Cryptocurrency
    • Where to Store Cryptos
    • Cryptocurrency Tax in Australia 2021
No Result
View All Result
CryptoABC.net
No Result
View All Result

LangSmith Enhances LLM Evaluations with Pytest and Vitest Integrations

January 25, 2025
in Blockchain
Reading Time: 2min read
0 0
A A
0
Understanding the Role and Capabilities of AI Agents
0
SHARES
8
VIEWS
ShareShareShareShareShare


Caroline Bishop
Jan 25, 2025 04:44

LangSmith introduces Pytest and Vitest integrations to enhance LLM application evaluations, offering improved testing frameworks for developers.





LangSmith has unveiled new integrations with Pytest and Vitest, aiming to streamline the evaluation process of Large Language Model (LLM) applications. These integrations, now in beta with version 0.3.0 of the LangSmith Python and TypeScript SDKs, provide developers with enhanced testing capabilities, according to LangChain’s blog.

Enhanced Testing Frameworks for LLM Evaluations

LLM evaluations (evals) are crucial for maintaining the reliability and quality of applications. By integrating with Pytest and Vitest, developers familiar with these frameworks can now leverage LangSmith’s advanced features, such as observability and sharing capabilities, without compromising on the developer experience they are accustomed to.

The integrations allow developers to debug tests more effectively, log detailed metrics beyond simple pass/fail results, and share results effortlessly across teams. The non-deterministic nature of LLMs adds complexity to debugging, which LangSmith addresses by saving inputs, outputs, and stack traces from test cases.

Utilizing Built-in Evaluation Functions

LangSmith provides built-in evaluation functions, such as expect.edit_distance(), which compute the string distance between test outputs and reference outputs. This feature is particularly useful for developers who need to ensure their applications consistently deploy the best version. Detailed insights into these functions can be found in LangSmith’s API reference.

Getting Started with Pytest and Vitest

To integrate with Pytest, developers need to add the @pytest.mark.langsmith decorator to their test cases. This setup logs all test case results, application traces, and feedback traces to LangSmith, providing a comprehensive view of the application’s performance.

Similarly, Vitest users can wrap their test cases in an ls.describe() block to achieve the same level of integration and logging. Both frameworks offer real-time feedback and can be seamlessly integrated into continuous integration (CI) pipelines, helping developers catch regressions early.

Advantages Over Traditional Evaluation Methods

Traditional evaluation methods often require predefined datasets and evaluation functions, which can be limiting. LangSmith’s new integrations offer flexibility by allowing developers to define specific test cases and evaluation logic, tailored to their application’s needs. This approach is particularly beneficial for applications that require testing across multiple tools or models with varying evaluation criteria.

The real-time feedback provided by these testing frameworks facilitates rapid iteration and local development, making it easier for developers to refine their applications quickly. Additionally, the integration with CI pipelines ensures that any potential regressions are identified and addressed early in the development process.

For more information on how to utilize these integrations, developers can refer to LangSmith’s comprehensive tutorials and how-to guides available on their documentation site.

Image source: Shutterstock


Credit: Source link

ShareTweetSendPinShare
Previous Post

NVIDIA Unveils OpenUSD Workflows to Propel Physical AI in Robotics and Autonomous Vehicles

Next Post

Intesa Sanpaolo Enters Bitcoin Market with Strategic Investment

Next Post
Bitcoin Holdings in Public Company Treasuries Exceed 200,000 BTC

Intesa Sanpaolo Enters Bitcoin Market with Strategic Investment

You might also like

Anthropic Launches Claude 3.5 Sonnet Android App with Advanced AI Features

Anthropic AI Discovers 22 Firefox Vulnerabilities in Two Weeks

March 6, 2026
Bitcoin Bounce Fails As Short-Term Holders Rush To Take Profit

Bitcoin Bounce Fails As Short-Term Holders Rush To Take Profit

March 7, 2026
Bitcoin Hovers Around $70K as Weak Demand and Defensive Positioning Signal Fragile Market, Says Glassnode

Bitcoin Hovers Around $70K as Weak Demand and Defensive Positioning Signal Fragile Market, Says Glassnode

March 6, 2026
OpenAI: Paf Leverages 85 Custom GPTs to Boost Developer Productivity

OpenAI Launches €500K Grant and SME Training Program in EU Push

March 5, 2026
Bitcoin Capitulation Or Buy Zone? What On-Chain Data Shows

Bitcoin Pattern Memory Predicts The Bottom, And It’s Below $40,000

March 4, 2026
SUI At Decision Point: RSI Trendline Could Trigger A Drop Or Bounce

SUI At Decision Point: RSI Trendline Could Trigger A Drop Or Bounce

March 9, 2026
CryptoABC.net

This is an Australian online news/education portal that aims to provide the latest crypto news, real-time updates, education and reviews within Australia and around the world. Feel free to get in touch with us!

What's New Here!

Polymarket Teams Up With Palantir to Monitor Sports Prediction Markets

Polymarket Teams Up With Palantir to Monitor Sports Prediction Markets

March 11, 2026
Solana (SOL) Rejected Near $90, Downtrend Threat Reappears

Solana (SOL) Rejected Near $90, Downtrend Threat Reappears

March 11, 2026

Subscribe Now

  • Contact Us
  • Privacy Policy
  • Terms of Use
  • DMCA

© 2021 cryptoabc.net - All rights reserved!

No Result
View All Result
  • Live Crypto Prices
  • Crypto News
    • Worldwide
      • Bitcoin
      • Ethereum
      • Altcoin
      • Blockchain
      • Regulation
    • Australian Crypto News
  • Education
    • Cryptocurrency For Beginners
    • Where to Buy Cryptocurrency
    • Where to Store Cryptos
    • Cryptocurrency Tax in Australia 2021

© 2021 cryptoabc.net - All rights reserved!

Welcome Back!

Login to your account below

Forgotten Password?

Create New Account!

Fill the forms below to register

All fields are required. Log In

Retrieve your password

Please enter your username or email address to reset your password.

Log In
Please enter CoinGecko Free Api Key to get this plugin works.