ds4eval - Agent Harness 对比报告

生成于 2026-04-26T13:15:26.498Z · 原始数据 report.json

opencode

1/1 success195.04s avg- tokens1.0 turns

Model × Harness 成功率

Modelopencode
deepseek-v4-pro
DeepSeek V4 Pro
1/1

最新评测结果

FX monitor and alert · DeepSeek V4 Pro success

opencode · opencode · 195.04s · - tokens

output/forex-alert.txt

FOREX ALERT: CNY/JPY cross rate moved by +13.88% (from 20.45 to 23.2878), exceeding the 2% threshold.

output/forex-result.json

{
  "observed_rates": {
    "date": "2026-04-25",
    "source": "https://open.er-api.com/v6/latest/USD",
    "usd_to_cny": 6.847089,
    "usd_to_jpy": 159.453848
  },
  "calculation": {
    "cny_to_jpy_cross_rate": 23.2878,
    "formula": "USD/JPY / USD/CNY"
  },
  "comparison": {
    "last_week_reference": 20.45,
    "change_pct": 13.88,
    "change_abs": 2.8378
  },
  "decision": {
    "threshold_pct": 2,
    "exceeds_threshold": true,
    "alert_written": true,
    "result_json_written": true
  }
}

查看完整执行详情