Skip to content

[bot] Merge master/2a72b14c into rel/dev#1681

Merged
yenkins-admin merged 4 commits into
rel/devfrom
snapshot-master-2a72b14c-to-rel/dev
Jul 3, 2026
Merged

[bot] Merge master/2a72b14c into rel/dev#1681
yenkins-admin merged 4 commits into
rel/devfrom
snapshot-master-2a72b14c-to-rel/dev

Conversation

@yenkins-admin

Copy link
Copy Markdown
Contributor

🚀 Automated PR to perform merge from master into rel/dev with changes up to 2a72b14 (created by https://github.com/gooddata/gooddata-python-sdk/actions/runs/28648738929).

Tomkess and others added 4 commits June 27, 2026 22:33
_args_match checked emit_widget (renamed to user_requested_search in the
tool schema) and limit (optional, server-side default). Both mismatched on
every real model call, so tool_correctness was always False regardless of
whether the model used the right keywords and object types.

Fix: evaluate only keywords (case-insensitive) and object_types — the two
fields that actually determine whether the search was semantically correct.
parsed_arguments() returns raw model-emitted JSON, so a bad tool call
like {"keywords":[1]} or mixed-type object_types would raise on
.lower()/sorted() and abort the whole eval run instead of scoring
tool_correctness=False. Add _normalize_str_list to drop non-string
entries defensively; valid comparisons (incl. case-insensitive keyword
match) are unchanged.

Addresses CodeRabbit review on PR #1675.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Address review nits on search_tool _args_match:
- reword _normalize_str_list comment: dropping non-string entries prevents
  a crash, it does not force a mismatch (surviving strings still compare)
- note that object_types is compared case-sensitively on purpose (controlled
  ObjectType StrEnum values emitted verbatim)

No behavior change. keywords/object_types are declared list[str] in the
search_objects schema, so string-collapse false-negatives cannot occur.

JIRA: TRIVIAL
risk: nonprod
fix(eval): fix search_tool correctness always scoring 0%
@yenkins-admin yenkins-admin merged commit f0c2161 into rel/dev Jul 3, 2026
1 check passed
@yenkins-admin yenkins-admin deleted the snapshot-master-2a72b14c-to-rel/dev branch July 3, 2026 08:34
@codecov

codecov Bot commented Jul 3, 2026

Copy link
Copy Markdown

Codecov Report

❌ Patch coverage is 88.88889% with 1 line in your changes missing coverage. Please review.
✅ Project coverage is 77.80%. Comparing base (39d7676) to head (2a72b14).
⚠️ Report is 530 commits behind head on rel/dev.

Files with missing lines Patch % Lines
...l/src/gooddata_eval/core/evaluators/search_tool.py 88.88% 1 Missing ⚠️
Additional details and impacted files
@@           Coverage Diff            @@
##           rel/dev    #1681   +/-   ##
========================================
  Coverage    77.79%   77.80%           
========================================
  Files          271      271           
  Lines        18599    18602    +3     
========================================
+ Hits         14470    14474    +4     
+ Misses        4129     4128    -1     

☔ View full report in Codecov by Harness.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants