HCDSA

本项目实现了一个高并发的深度搜索智能体（High-Concurrency Deep Search Agent），能够支持高并发请求下的并行搜索，实现高效推理。项目为清华大学深圳国际研究生院《大数据系统基础（B）》课程大项目。

项目框架如下所示：

本项目实现了一个基于 AgentScope 框架的高并发深度搜索智能体（HCDSA），旨在解决传统大模型串行搜索效率低下的痛点。系统能够将复杂的调研问题自动拆解为可并行的子任务 DAG 图，利用多线程并发执行网络搜索；同时结合 RAG（检索增强生成） 技术与本地缓存策略，有效减少重复的外部 API 调用并降低成本。此外，项目集成了 API 连接池机制以突破服务商频率限制，最终能够自动合成逻辑严密、数据详实的深度研究报告。

1 项目背景

大模型搜索和检索增强（RAG）是一项非常重要的工具，能够帮助大模型不需要训练和微调就能实现规范化可信的输出。当前的大语言模型搜索基本都是采用串行策略（如GPT、Gemini等），因为串行策略，大模型能够基于上一个回答动态地修改下一次的搜索prompt细节。然而，很多搜索是可以并行的，尤其是小领域微调模型，比如：

调研2022、2023、2024，中、美、英、法国的经济增长趋势和通货膨胀率的关系
子任务-->
1. 2022 中国的 经济增长趋势 
2. 2022 中国的 通货膨胀率
……

这里，至少能拆分出$3\times4\times2=24$个可以并行的子任务。因此，我们可以先绘制概念DAG图，对于同层的问题展开并行。

此外，在小领域专用大语言模型，用户经常会提问相似的问题，比如：

用户A：调研2022、2023、2024，中、美、英、法国的经济增长趋势和通货膨胀率的关系
用于B：比较2022、2023、2024，中、美、英的经济GDP增长率
用户C：2024年，查询通货膨胀率是如何影响到经济增长率的

这些需要深度搜索的问题，模型都需要先填写一样的知识空白，在互联网中查询末年末国具体的数据，因此多查询请求是可以使用缓存策略并行的

2 环境安装

在自己的环境目录下安装 AgentScope，注意不是该项目目录，下面四种方式选择其一即可。

# Make sure the running path is path/to/your/envs/, instead of this project
# From source
git clone -b main https://github.com/agentscope-ai/agentscope.git
cd agentscope
pip install -e .

# Using uv (recommended for faster installs)
git clone -b main https://github.com/agentscope-ai/agentscope.git
cd agentscope
uv pip install -e .

# From PyPi
pip install agentscope

# Or with uv
uv pip install agentscope

为了启动MCP服务需要安装npm。

apt-get install npm
# 确保以下指令能够正常运行
npx -y tavily-mcp@latest   # Tavily MCP server running on stdio

安装该项目及其环境依赖。

git clone https://github.com/Cevaaa/HCDSA.git
cd HCDSA 
# Make sure the running path is HCDSA/,（暂时无需这一步）
pip install -e .

3 运行示例

A runnable example of a Deep Research Agent using Agentscope and an MCP Tavily client.

Quickstart:

Prepare environment variables (see scripts) or export via shell.
Run demo:

# Make sure the running path is HCDSA/
bash scripts/example.sh

Requirements:

Python 3.10+
Environment variables:
- DASHSCOPE_API_KEY
- TAVILY_API_KEY
- AGENT_OPERATION_DIR

注意要先准备千问和tavily的API。

运行后会在指定的AGENT_OPERATION_DIR目录下生成过程文件和输出报告。

4 功能介绍

4.1 单次调研逻辑图建立和子问题并行

运行脚本

bash scripts/example.sh

运行结果

会输出子问题DAG：

==================== Semantic Parallel Plan (Experimental) ====================
- Subtask #1: Retrieve 2023 GDP growth rate for China from official or reputable economic data sources.
  depends_on: []
- Subtask #2: Retrieve 2023 GDP growth rate for the USA from official or reputable economic data sources.
  depends_on: []
- Subtask #3: Retrieve 2023 GDP growth rate for Germany from official or reputable economic data sources.
……
- Subtask #7: Research and summarize the historical GDP and inflation data for China from 2013 to 2023 using official or reputable sources.
  depends_on: [1, 4]
- Subtask #8: Research and summarize the historical GDP and inflation data for the USA from 2013 to 2023 using official or reputable sources.
  depends_on: [2, 5]
……

会展示分层并行和串行的时间差：

[Semantic Execution] Layer 1 - 6 subtasks (parallel)
  - #1: Retrieve 2023 GDP growth rate for China from official or reputable economic data sources.
  - #2: Retrieve 2023 GDP growth rate for the USA from official or reputable economic data sources.
  - #3: Retrieve 2023 GDP growth rate for Germany from official or reputable economic data sources.
  - #4: Retrieve 2023 inflation rate for China from official or reputable economic data sources.
  - #5: Retrieve 2023 inflation rate for the USA from official or reputable economic data sources.
  - #6: Retrieve 2023 inflation rate for Germany from official or reputable economic data sources.
[Semantic Execution] Layer 1 parallel time : 2.49 s
[Semantic Execution] Layer 1 serial estimate: 11.09 s (speedup ~ 4.45x)

4.2 多问题缓存策略

运行脚本

bash scripts/para_test.sh

运行结果

现在可以实现，247个子问题命中49个，命中率20%

4.3 RAG方法降低反复查询Tavily的次数

运行脚本

bash scripts/rag_test.sh

或者也可以直接执行下面四条代码中的一个对其进行评估，例如可以执行以下代码：

python scripts/benchmark_rag.py

运行结果

会返回不用rag时的搜索速度：


[Test 1] Running WITHOUT RAG (all queries hit Tavily)...

  Total time      : 31.54 s
  Tavily searches : 15

以及使用rag情况下带来的速度增益：

[Test 2] Running WITH RAG (check RAG first, then Tavily)...
  [SEARCH]   2023 GDP growth rate China...
  [RAG HIT]  2023 GDP growth rate United States...
  [RAG HIT]  2023 GDP growth rate Germany...
  [SEARCH]   2023 inflation rate China...
  [SEARCH]   2023 inflation rate United States...
  [SEARCH]   2023 inflation rate Germany...
  [RAG HIT]  China GDP growth 2023...
  [RAG HIT]  US GDP growth rate 2023...
  [SEARCH]   Germany economic growth 2023...
  [RAG HIT]  China inflation 2023...
  [SEARCH]   United States inflation rate 2023...
  [SEARCH]   German inflation 2023...
  [RAG HIT]  What is China's GDP growth in 2023...
  [SEARCH]   2023 US economic growth rate...
  [RAG HIT]  inflation rate in Germany 2023...

  Total time      : 13.39 s
  Tavily searches : 8
  RAG hits        : 7
  RAG misses      : 8
  Hit rate        : 46.7%

假如想长期持久化保存rag搜索结果，可以使用--persist-pah参数永久保存rag_cache

# 使用持久化
python scripts/benchmark_rag.py --persist-path ./test/rag_cache.json

假如想手动设置相似度阈值，可以调整--threshold参数进行调整

# 调整相似度阈值
python scripts/benchmark_rag.py --threshold 0.6

最后会返回命中率和平均时间的统计：

[Test 3] RAG Retrieval Performance...
  Total queries        : 15
  Hits                 : 9
  Misses               : 6
  Avg similarity score : 0.774
  Avg retrieval time   : 3.088 ms

4.4 高并发优化：添加APIpool解决模型和MCP服务商请求频率瓶颈问题

目前在multi_query_benchmark_w_APIpool.py中添加了APIpool，解决了模型和MCP服务商请求频率瓶颈问题。可以通过para_test_w_APIpool.sh脚本进行测试。由于目前的query最多设置为10个并发查询，远远没有达到服务商的QPS上限。因此当前默认LLM APIpool和MCP服务商APIpool的大小分别2和2，仅用于功能实现。

Acknowledge

感谢小组成员的付出。
特别感谢老师和助教的帮助和指导。
感谢所有为本项目做出贡献的开发者们：

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
deep_research_agent		deep_research_agent
inputs		inputs
scripts		scripts
tests		tests
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
generated_questions.csv		generated_questions.csv
pipeline_HCDSA.png		pipeline_HCDSA.png
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HCDSA

1 项目背景

2 环境安装

3 运行示例

4 功能介绍

4.1 单次调研逻辑图建立和子问题并行

4.2 多问题缓存策略

4.3 RAG方法降低反复查询Tavily的次数

4.4 高并发优化：添加APIpool解决模型和MCP服务商请求频率瓶颈问题

Acknowledge

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

HCDSA

1 项目背景

2 环境安装

3 运行示例

4 功能介绍

4.1 单次调研逻辑图建立和子问题并行

4.2 多问题缓存策略

4.3 RAG方法降低反复查询Tavily的次数

4.4 高并发优化：添加APIpool解决模型和MCP服务商请求频率瓶颈问题

Acknowledge

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages