[experiment] generate a single final report instead of separate sections #114

vbarda · 2025-06-06T20:02:22Z

No description provided.

--------- Co-authored-by: Lance Martin <[email protected]>

* add evals * update input formatting * add prompt caching * gitignore * allow returning source string * add groundedness eval * add option to summarize search results * add option to split & rerank webpage chunks * rename justification -> reasoning * caching for summarization * caching for summarization * separate keys * retry * add generate report helper * update evaluator & add date to system prompt * add multi-agent helper * add multi-agent helper * propagate retrieved source from multi-agent * fixes * split evals files & add overall quality eval * add MCP support (#112) * bump requirements * Add Question tool, update prompt * nits * Add file for testing * Update * rename & make question tool optional * improve prompts * Update README * Improvements * Minor updates * Add evaluation script, change config default to sonnet-3-5 * Update tests, config, Anthropic version * Updates --------- Co-authored-by: vbarda <[email protected]> Co-authored-by: Vadym Barda <[email protected]> Co-authored-by: Nick Huang <[email protected]>

…nd-to-end

…t-end-to-end

vbarda and others added 11 commits June 6, 2025 15:33

[experiment] generate a single final report instead of separate sections

80886e1

Add evals, MCP, and improvements (#111)

db01c7e

--------- Co-authored-by: Lance Martin <[email protected]>

Fix

00363b1

Merge remote-tracking branch 'origin' into vb/generate-final-report-e…

75910d3

…nd-to-end

Fixes

df50fd6

Updates

44505f9

Update

0082943

Merge branch 'vb/evals-and-improvements' into vb/generate-final-repor…

6629737

…t-end-to-end

Fix research agent bug

b0f9975

Get one-shot generation working

b102b3a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[experiment] generate a single final report instead of separate sections #114

[experiment] generate a single final report instead of separate sections #114

Uh oh!

vbarda commented Jun 6, 2025

Uh oh!

Uh oh!

[experiment] generate a single final report instead of separate sections #114

Are you sure you want to change the base?

[experiment] generate a single final report instead of separate sections #114

Uh oh!

Conversation

vbarda commented Jun 6, 2025

Uh oh!

Uh oh!