fix(evaluation): return last assistant message on max_iterations fallback by Ricardo-M-L · Pull Request #173 · agent-infra/sandbox

Ricardo-M-L · 2026-04-16T17:26:36Z

Summary

Fixes #172.

When the evaluation agent loop hits max_iterations, both AzureOpenAIAgentLoop and OpenAIAgentLoop returned messages[-1].get(\"content\", \"\"). Since each tool call appends a {\"role\": \"tool\", ...} entry to messages, the final element is typically a tool response — not the assistant's reasoning — so the evaluation harness treats raw tool output as the agent's answer.

This PR walks messages in reverse to find the last assistant message with non-empty content, falling back to \"\" only if none exists.

Changes

evaluation/agent_loop.py — apply the fallback walk in both agent loops (identical 4-line change in each)
evaluation/tests/test_max_iterations_fallback.py — 3 unit tests per loop covering:
- last message is a tool response → returns earlier assistant content
- only tool responses present → returns \"\"
- last message is assistant → returned directly

Notes

This is Bug 2 from the closed PR fix: resolve 6 verified bugs across SDK, evaluation, and examples #167 — resubmitting individually per the policy stated in that PR's closing comment. Bug 1 was already resubmitted as fix(sdk): use & separator when baseUrl already contains query params #171.
No production behaviour change outside the max_iterations fallback path.

🤖 Generated with Claude Code

…back When the agent loop hits `max_iterations` without producing a final answer, both `AzureOpenAIAgentLoop` and `OpenAIAgentLoop` returned `messages[-1].get("content", "")`. But the loop appends a `{"role": "tool", ...}` message after each tool call, so the last message is often a tool response — not the assistant's last reasoning/answer. Walk the list in reverse to find the last assistant message with non-empty content instead, falling back to "" only if none exists. Also add unit tests covering: - last message is a tool response -> returns earlier assistant content - only tool responses present -> returns "" - last message is assistant -> returned directly Broken out from the closed PR agent-infra#167 per the "one fix per PR" policy. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(evaluation): return last assistant message on max_iterations fallback#173

fix(evaluation): return last assistant message on max_iterations fallback#173
Ricardo-M-L wants to merge 1 commit into
agent-infra:mainfrom
Ricardo-M-L:fix/agent-loop-max-iterations-fallback

Ricardo-M-L commented Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Ricardo-M-L commented Apr 16, 2026

Summary

Changes

Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant