feat: add optional BM25 retrieval confidence metadata by SeCuReDmE-main-dev · Pull Request #1 · SeCuReDmE-main-dev/haystack_case_study

SeCuReDmE-main-dev · 2026-05-18T04:27:12Z

Summary

add an opt-in include_confidence parameter to InMemoryBM25Retriever
attach BM25 confidence metadata via Document.meta only when include_confidence=True and scale_score=True
cover sync, async, and serialization behavior with targeted tests

Compatibility

preserves existing Document.score semantics
preserves default retriever behavior when include_confidence=False
does not change the Document dataclass or global retriever contracts

Not included

no cross-retriever confidence normalization
no new shared retriever helpers
no runtime/router/agent consumption changes

Validation

.venv\\Scripts\\python.exe -m pytest test/components/retrievers/test_in_memory_bm25_retriever.py
.venv\\Scripts\\ruff.exe check haystack/components/retrievers/in_memory/bm25_retriever.py test/components/retrievers/test_in_memory_bm25_retriever.py

qodo-code-review · 2026-05-18T04:27:16Z

Qodo reviews are paused for this user.

Troubleshooting steps vary by plan Learn more →

On a Teams plan?
Reviews resume once this user has a paid seat and their Git account is linked in Qodo.
Link Git account →

Using GitHub Enterprise Server, GitLab Self-Managed, or Bitbucket Data Center?
These require an Enterprise plan - Contact us
Contact us →

gemini-code-assist

Code Review

This pull request introduces an include_confidence parameter to the InMemoryBM25Retriever component, allowing users to include retrieval confidence metadata when scores are scaled. The implementation covers the constructor, serialization, and both synchronous and asynchronous execution paths, supported by new unit tests. Feedback from the reviewer focused on improving documentation consistency by explicitly detailing the metadata keys in the docstrings of the run and run_async methods.

gemini-code-assist · 2026-05-18T04:29:14Z

+        :param include_confidence:
+            When `True`, adds optional retrieval confidence metadata to returned documents when `scale_score` is also
+            `True`. When `False`, no retrieval confidence metadata is added.


The docstring for include_confidence in the run method should be consistent with the one in __init__ by explicitly mentioning the metadata keys. This improves clarity for users interacting with the component's run method.

Suggested change

:param include_confidence:

When `True`, adds optional retrieval confidence metadata to returned documents when `scale_score` is also

`True`. When `False`, no retrieval confidence metadata is added.

:param include_confidence:

When `True`, adds retrieval confidence metadata to returned documents when `scale_score` is also

`True`. The metadata is exposed via `Document.meta["retrieval_confidence"]` and

`Document.meta["retrieval_confidence_source"]`. When `False`, no retrieval confidence metadata is added.

gemini-code-assist · 2026-05-18T04:29:14Z

+        :param include_confidence:
+            When `True`, adds optional retrieval confidence metadata to returned documents when `scale_score` is also
+            `True`. When `False`, no retrieval confidence metadata is added.


The docstring for include_confidence in the run_async method should be consistent with the one in __init__ by explicitly mentioning the metadata keys. This improves clarity for users interacting with the component's run_async method.

Suggested change

:param include_confidence:

When `True`, adds optional retrieval confidence metadata to returned documents when `scale_score` is also

`True`. When `False`, no retrieval confidence metadata is added.

:param include_confidence:

When `True`, adds retrieval confidence metadata to returned documents when `scale_score` is also

`True`. The metadata is exposed via `Document.meta["retrieval_confidence"]` and

`Document.meta["retrieval_confidence_source"]`. When `False`, no retrieval confidence metadata is added.

feat: add optional BM25 retrieval confidence metadata

64efaa7

SeCuReDmE-main-dev mentioned this pull request May 18, 2026

feat: expose evaluator row status for LLM evaluators deepset-ai/haystack#11333

Open

gemini-code-assist Bot reviewed May 18, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add optional BM25 retrieval confidence metadata#1

feat: add optional BM25 retrieval confidence metadata#1
SeCuReDmE-main-dev wants to merge 1 commit into
feature/haystack-evaluator-uncertainty-phase1from
feature/haystack-retrieval-confidence-phase2

SeCuReDmE-main-dev commented May 18, 2026

Uh oh!

qodo-code-review Bot commented May 18, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot May 18, 2026

Uh oh!

gemini-code-assist Bot May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

SeCuReDmE-main-dev commented May 18, 2026

Summary

Compatibility

Not included

Validation

Uh oh!

qodo-code-review Bot commented May 18, 2026

Qodo reviews are paused for this user.

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot May 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot May 18, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant