BattleTag
444d58726a
refactor: native tool calling + generic forced-retry + terminal exit
- llm_client: switch tool_call_loop from text-based <tool_call> regex
to OpenAI-native tools=[...] / structured tool_calls field; accumulate
delta.reasoning_content for DeepSeek thinking-mode echo-back; fold
preserves system msg and aligns boundary to never orphan role:tool
- base_agent: generic forced-retry via mandatory_record_tools class attr
(filesystem -> add_phenomenon, timeline -> add_temporal_edge,
hypothesis -> add_hypothesis, report -> save_report); count via
executor wrapper
- terminal_tools class attr + loop short-circuit: when a terminal tool
is called, loop exits with its raw return as final_text. ReportAgent
declares save_report as terminal - replaces the <answer>-tag stop
signal that native tool calling broke
- _execute_*: return (raw, formatted) - terminal exit uses untruncated
raw, conversation history uses 3000-char-capped formatted
- evidence_graph + orchestrator: LLM-derived InvestigationArea support
(hypothesis-driven coverage check, replaces hardcoded _AREA_KEYWORDS /
_AREA_TOOLS); manual yaml block kept as optional seed
- strip <answer> references from agent prompts (no longer load-bearing)
Verified on CFReDS image across 4 smoke runs: 0 JSON parse failures
(was 3); 22 temporal edges from Phase 4 (was 0); ReportAgent exits via
save_report (was max_iterations regression). 78/78 unit tests pass.
Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>
2026-05-13 13:51:19 +08:00
..
2026-05-09 17:36:26 +08:00
2026-05-09 17:36:26 +08:00
2026-05-09 17:36:26 +08:00
2026-05-13 13:51:19 +08:00
2026-05-09 17:36:26 +08:00
2026-05-09 17:36:26 +08:00
2026-05-13 13:51:19 +08:00
2026-05-13 13:51:19 +08:00