logo

Command Palette

Search for a command to run...

Agentic Deep Researcher

benchmarks
other-benchmarks

A Deep Researcher system that iterates and updates its research path as it finds new information across web pages and PDFs, including reports and academic papers.

It scored 47.29% in the overall category, above the OpenAI o3 researcher in the Deep Research Bench. (I couldn't run all the benchmark queries as I ran out of API credits.)

Check out this example report written by the agent:

Architecture

The entire system consists of two main components: the Researcher Agent and the Report Writer Agent.

The Researcher Agent carries out the research process in iterations. It first breaks down the query using first principles and creates an initial plan for the first iteration. It then finds 15–20 web sources and PDFs to gather relevant information and stores this in a knowledge base. This information is then used to make the plan for the next iteration. This process continues for 8–10 iterations until the topic has been covered with sufficient breadth and depth.

The Writer Agent uses the knowledge base to write a report. First, a planner agent creates a structure for the report, including its sections and subsections. Then, individual section writer agents write the sections, and the final report is generated. I noticed that dividing the report into smaller sections and writing them separately produces a better report than writing it all at once, although this does increase the cost somewhat due to repetitive input context.

Comparison's from other agents

Research is an iterative process. When you start searching, you find something new and then you research more about the new stuff. This iterative workflow has allowed the system write better reports.

Still a lot of prompt and cost optimization to do.