feat: added ResponseRelevanceEvaluator #112

poshinchen · 2026-01-30T16:02:17Z

Description

Add ResponseRelevanceEvaluator to assess the relevance of the LLM response to the question, in other words, how focused the LLM response is on the given question.
Implement 5-level scoring system (Not At All, Not Generally, Neutral/Mixed, Generally Yes, Completely Yes)

Related Issues

#100

Documentation PR

WIP

Type of Change

New feature ("ResponseRelevance Evaulator")

Notes

Need to run evaluations via agentcore online evaluator and compare the scores.

Testing

How have you tested the change? Verify that the changes do not break functionality or introduce warnings in consuming repositories: agents-docs, agents-tools, agents-cli

I ran hatch run prepare

Checklist

I have read the CONTRIBUTING document
I have added any necessary tests that prove my fix is effective or my feature works
I have updated the documentation accordingly
I have added an appropriate example to the documentation to outline the feature, or no new docs are needed
My changes generate no new warnings
Any dependent changes have been merged and published

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

poshinchen · 2026-01-30T21:53:42Z

src/strands_evals/evaluators/response_relevance_evaluator.py

+        )
+        return [result]
+
+    def _get_last_turn(self, evaluation_case: EvaluationData[InputT, OutputT]) -> TraceLevelInput:


There are some duplicated extraction logic comparing with other evaluators. The follow-up will be to extract them in a better place. Currently Evaluator base class is not the best option because it contains non trace-based evaluators implementation. This is not a blocker though.

feat: added ResponseRelevanceEvaluator

f616f3d

poshinchen deployed to auto-approve January 30, 2026 16:02 — with GitHub Actions Active

poshinchen changed the title ~~(WIP) feat: added ResponseRelevanceEvaluator~~ feat: added ResponseRelevanceEvaluator Jan 30, 2026

poshinchen commented Jan 30, 2026

View reviewed changes

poshinchen requested review from Unshure and afarntrog January 30, 2026 21:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: added ResponseRelevanceEvaluator #112

feat: added ResponseRelevanceEvaluator #112

Uh oh!

poshinchen commented Jan 30, 2026

Uh oh!

poshinchen Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: added ResponseRelevanceEvaluator #112

Are you sure you want to change the base?

feat: added ResponseRelevanceEvaluator #112

Uh oh!

Conversation

poshinchen commented Jan 30, 2026

Description

Related Issues

Documentation PR

Type of Change

Notes

Testing

Checklist

Uh oh!

poshinchen Jan 30, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant