[experiments] Daily Experiment Report — 2026-06-22 #40763
Closed
Replies: 1 comment
-
|
This discussion has been marked as outdated by daily-experiment-report. A newer discussion is available at Discussion #40993. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
🧪 Daily Experiment Report — 2026-06-22
37 experiments across 34 workflows. 9 ready (all variants ≥⚠️ 1 balance alert.
min_samples), 28 still collecting.⚡ Quick Stats
🟢 Ready for Analysis (9 experiments)
cavemansmoke-copilotno/yessubagent_modelsmoke-copilotsmall/largesub_agent_strategysmoke-geminisingle_agent/sub_agentssub_agent_strategysmoke-antigravitysingle_agent/sub_agentscavemansmoke-copilot-aoai-apikeyyes/nosubagent_modelsmoke-copilot-aoai-apikeysmall/largeprompt_styledaily-community-attributionconcise/verbosecavemansmoke-copilot-aoai-entrano/yessubagent_modelsmoke-copilot-aoai-entrasmall/large📊 Charts for top 5 experiments
caveman·smoke-copilotmodel·smoke-copilot_subagentstrategy·smoke-gemini_sub_agentstrategy·smoke-antigravity_sub_agentcaveman·smoke-copilot-aoai-apikey🟡 Still Collecting — Top 10 by runs
tone_variantaw-failure-investigatornarrative: ████░░░░ 25/50prompt_styledaily-astrostylelite-markdown-spellcheckconcise: ██████░░ 24/30output_formatdaily-issues-reportcollapsible: ██████░░ 22/30prompt_styledaily-newsconcise: █████░░░ 18/30reasoning_depthdaily-security-red-teamsingle_pass: █████░░░ 19/30output_formatdaily-compiler-qualitydetailed: ██████░░ 15/20output_formatdaily-code-metricsexecutive_summary: ███████░ 17/20semgrep_output_formatdaily-semgrep-scanbullet_list: ███░░░░░ 10/30output_formatdeep-reportfull_briefing: ██████░░ 11/15prompt_compressionagent-performance-analyzerverbose: ███████░ 12/14+18 more — see all in git state branches.
🔍 Detailed Analysis — READY Experiments
caveman·smoke-copilotBalance: ✅ balanced (chi2=0.01, p=0.9002) · min_samples=20 · total runs=193
no← ctrl: n=97 (50.3%) ████████ 97/20yes: n=96 (49.7%) ████████ 96/20Rec: READY_FOR_ANALYSIS
subagent_model·smoke-copilotBalance: ✅ balanced (chi2=0.01, p=0.8978) · min_samples=20 · total runs=173
small← ctrl: n=86 (49.7%) ████████ 86/20large: n=87 (50.3%) ████████ 87/20Rec: READY_FOR_ANALYSIS
sub_agent_strategy·smoke-geminiBalance:⚠️ IMBALANCED (chi2=7.71, p=0.0055) · min_samples=30 · total runs=150
single_agent← ctrl: n=58 (38.7%) ████████ 58/30sub_agents: n=92 (61.3%) ████████ 92/30Rec: EXTEND (fix balance first)
sub_agent_strategy·smoke-antigravityBalance: ✅ balanced (chi2=0.21, p=0.6539) · min_samples=30 · total runs=121
single_agent← ctrl: n=58 (47.9%) ████████ 58/30sub_agents: n=63 (52.1%) ████████ 63/30Rec: READY_FOR_ANALYSIS
caveman·smoke-copilot-aoai-apikeyBalance: ✅ balanced (chi2=0.01, p=0.8713) · min_samples=20 · total runs=69
yes← ctrl: n=35 (50.7%) ████████ 35/20no: n=34 (49.3%) ████████ 34/20Rec: READY_FOR_ANALYSIS
Warning
Firewall blocked 2 domains
The following domains were blocked by the firewall during workflow execution:
proxy.golang.orgreleaseassets.githubusercontent.comSee Network Configuration for more information.
Beta Was this translation helpful? Give feedback.
All reactions