MASForensic

Author	SHA1	Message	Date
BattleTag	444d58726a	refactor: native tool calling + generic forced-retry + terminal exit - llm_client: switch tool_call_loop from text-based <tool_call> regex to OpenAI-native tools=[...] / structured tool_calls field; accumulate delta.reasoning_content for DeepSeek thinking-mode echo-back; fold preserves system msg and aligns boundary to never orphan role:tool - base_agent: generic forced-retry via mandatory_record_tools class attr (filesystem -> add_phenomenon, timeline -> add_temporal_edge, hypothesis -> add_hypothesis, report -> save_report); count via executor wrapper - terminal_tools class attr + loop short-circuit: when a terminal tool is called, loop exits with its raw return as final_text. ReportAgent declares save_report as terminal - replaces the <answer>-tag stop signal that native tool calling broke - _execute_*: return (raw, formatted) - terminal exit uses untruncated raw, conversation history uses 3000-char-capped formatted - evidence_graph + orchestrator: LLM-derived InvestigationArea support (hypothesis-driven coverage check, replaces hardcoded _AREA_KEYWORDS / _AREA_TOOLS); manual yaml block kept as optional seed - strip <answer> references from agent prompts (no longer load-bearing) Verified on CFReDS image across 4 smoke runs: 0 JSON parse failures (was 3); 22 temporal edges from Phase 4 (was 0); ReportAgent exits via save_report (was max_iterations regression). 78/78 unit tests pass. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-13 13:51:19 +08:00
BattleTag	0a2b344c84	fix: share _safe_json_loads with tool-call parser, not just orchestrator Move _safe_json_loads from orchestrator.py to llm_client.py and have _extract_tool_calls use it when parsing <tool_call> JSON blocks from model output. orchestrator now imports it from llm_client. Background: in the first full DeepSeek run (runs/2026-05-12T17-25-38), ~10 'Failed to parse tool call JSON' warnings appeared, all from regex patterns where the LLM wrote \. or \* inside JSON string values: Failed to parse tool call JSON: {..., "pattern": "Outlook Express\|...\|\.dbx"} Failed to parse tool call JSON: {..., "pattern": "ethereal.\.pcap"} Failed to parse tool call JSON: {..., "pattern": "lookatlan.\.txt\|..."} These are exactly the kind of stray-backslash errors stage-1 sanitize already handles for orchestrator JSON calls — but tool-call extraction was using bare json.loads. Result: each failed tool call silently dropped on the floor, the LLM never got a result, and at least one network agent burned 14m26s spinning before hitting max_iterations=40. Now the sanitize/log-on-failure path is shared. Verified against the three failure cases from yesterday's log: all three now parse cleanly. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 20:29:21 +08:00
BattleTag	0a966d8476	feat: switch LLM client to OpenAI SDK for DeepSeek compatibility The previous LLMClient used raw httpx + Claude Messages API (/v1/messages, x-api-key, Anthropic SSE event types). Incompatible with DeepSeek. Rewrite LLMClient.__init__/chat/close to use openai.AsyncOpenAI: - /v1/chat/completions endpoint, OpenAI message format - Bearer auth, native SDK error types - Stream chunks via async for + chunk.choices[0].delta.content Tool calling protocol (ReAct text-based tags) and all surrounding helpers (_apply_progressive_decay, _fold_old_messages, _partition_tool_calls, tool_call_loop, etc.) are unchanged — endpoint-agnostic by design. New optional config params surfaced to config.yaml.agent: - reasoning_effort: "high" \| "medium" \| "low" — DeepSeek/o1-style depth - thinking_enabled: bool — DeepSeek extra_body.thinking switch main.py and regenerate_report.py pass these through to LLMClient. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 17:13:54 +08:00
BattleTag	097d2ce472	Initial commit Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-09 17:36:26 +08:00

4 Commits