add tests to trajectory eval chain #8909

shibuiwilliam · 2023-08-08T11:04:35Z

What

add tests to trajectory eval chain

vercel · 2023-08-08T11:04:39Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		Aug 9, 2023 8:20am

eyurtsev · 2023-08-08T13:47:12Z

libs/langchain/langchain/evaluation/agents/trajectory_eval_chain.py

-        )  # Scan for first digit
-
-        if not 1 <= int(score_str) <= 5:
+        # Use regex to extract the score


nit: Would be useful to document the intended behavior to help verify implementation.

Is the reason for the change to change missing score into a None rather than a 0 ?

If so, could we keep the previous logic and replace like so:

score_str = next( (char for char in score_str if char.isdigit()), None ) # Scan for first digit

Thanks!
This change is intended to take care of cases where LLM makes invalid score like 1.1 or 20.
next((char for char in score_str if char.isdigit()), None) will return "1" for 1.1 and "2" for 20 which is invalid score.
My change will validate these case as error.

Added explanation on the code comment.

eyurtsev · 2023-08-08T13:47:54Z

Thanks for the change and the tests. Do you mind including somewhere an explanation of what behavior this PR modifies?

add tests to trajectory eval chain

69342d7

dosubot bot added Ɑ: agent Related to agents module 🤖:improvement Medium size change to existing code to handle new use-cases labels Aug 8, 2023

shibuiwilliam marked this pull request as ready for review August 8, 2023 11:16

eyurtsev reviewed Aug 8, 2023

View reviewed changes

shibuiwilliam added 3 commits August 9, 2023 17:11

add tests to trajectory eval chain

443b735

add tests to trajectory eval chain

b01a50f

add tests to trajectory eval chain

1d388c7

shibuiwilliam requested a review from eyurtsev August 9, 2023 09:14

eyurtsev approved these changes Aug 9, 2023

View reviewed changes

eyurtsev merged commit 3adb1e1 into langchain-ai:master Aug 9, 2023

This was referenced Aug 6, 2023

[Snyk] Upgrade @docusaurus/preset-classic from 2.4.0 to 2.4.1 dewankpant/langchain#2

Merged

[Snyk] Upgrade @docusaurus/core from 2.4.0 to 2.4.1 dewankpant/langchain#1

Closed

This was referenced Aug 27, 2023

[Snyk] Upgrade @docusaurus/core from 2.4.0 to 2.4.1 tristinbaileel/langchain#1

Open

[Snyk] Upgrade @docusaurus/preset-classic from 2.4.0 to 2.4.1 tristinbaileel/langchain#2

Open

This was referenced Sep 12, 2023

[Snyk] Upgrade @docusaurus/core from 2.4.0 to 2.4.1 dr-juetz/langchain#2

Open

[Snyk] Upgrade @docusaurus/preset-classic from 2.4.0 to 2.4.1 dr-juetz/langchain#3

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add tests to trajectory eval chain #8909

add tests to trajectory eval chain #8909

shibuiwilliam commented Aug 8, 2023

vercel bot commented Aug 8, 2023 •

edited

Loading

eyurtsev Aug 8, 2023

shibuiwilliam Aug 9, 2023 •

edited

Loading

shibuiwilliam Aug 9, 2023

eyurtsev commented Aug 8, 2023

add tests to trajectory eval chain #8909

add tests to trajectory eval chain #8909

Conversation

shibuiwilliam commented Aug 8, 2023

What

vercel bot commented Aug 8, 2023 • edited Loading

eyurtsev Aug 8, 2023

Choose a reason for hiding this comment

shibuiwilliam Aug 9, 2023 • edited Loading

Choose a reason for hiding this comment

shibuiwilliam Aug 9, 2023

Choose a reason for hiding this comment

eyurtsev commented Aug 8, 2023

vercel bot commented Aug 8, 2023 •

edited

Loading

shibuiwilliam Aug 9, 2023 •

edited

Loading