可视化分析 Visualization Analysis

基于 68 篇论文的多维度统计 Multi-dimensional statistics based on 68 papers

年份分布 Year Distribution

2005
1
2013
1
2019
1
2020
1
2021
2
2023
2
2024
17
2025
37
2026
6

论文类型 Paper Type

system
45
benchmark
8
empirical-study
7
survey
6
defense
1
position-paper
1

研究范围 Research Scope

penetration-testing
39
ctf
13
general-cybersecurity
5
vulnerability-assessment
5
defense
4
red-teaming
2

Agent 架构 Agent Framework

single-agent
31
multi-agent
21
none
5
hierarchical
3
human-in-the-loop
3
Surveys multiple: single-agent, multi-agent, hierarchical, human-in-the-loop
1
single-agent with two variants: fully-autonomous and human-assisted (semi-autonomous)
1
N/A
1
N/A - position paper discussing the evolution from workflow agents to trained agents
1

规划策略 Planning Strategy

ReAct
21
none
15
RL-based
5
chain-of-thought
4
hierarchical-planning
3
tree-of-thought
2
classical-planning
2
Task Coordination Graph (TCG) - a structured acyclic digraph with iterative planning and task sessions, combined with Check and Reflection mechanism for dynamic re-planning upon task failure. Uses LLM-driven plan updates (Algorithm 2) and success-preserving merge (Algorithm 3) to maintain execution continuity.
1
finite-state-machine
1
Task-graph-based planning with a Penetration Task Graph (PTG) modeled as a directed acyclic graph, combined with a Check and Reflection mechanism for iterative plan refinement based on task execution feedback
1
prompt-chaining
1
iterative-prompting
1
Surveys multiple: ReAct, hierarchical planning, Chain-of-Thought, sampling-based strategy exploration
1
POMDP-based
1
game-theoretic-markov-game
1
ReAct (with decomposed summary-thought-action procedures)
1
utility-based-decision-theory
1
ReWOO
1
reactive-event-driven
1
ReAct-style (JSON responses with reasoning field and shell_command field; the agent reflects on output and decides next action each iteration)
1
game-theoretic-equilibrium
1
state-machine-guided RAG with reflexion-inspired self-reflection
1

自动化程度 Automation Level

fully-autonomous
50
human-in-the-loop
6
semi-autonomous
6
copilot
2
Surveys full spectrum: fully-autonomous, semi-autonomous, human-in-the-loop, copilot (vibe-hacking)
1
fully-autonomous and semi-autonomous (two separate agent architectures evaluated)
1
N/A
1

攻击阶段覆盖 Attack Phase Coverage

每篇论文覆盖的渗透测试阶段(共 68 篇) Phases covered by each paper (Total 68 papers)

59 reconnaissance
51 scanning
58 enumeration
64 exploitation
25 post exploitation
31 privilege escalation
17 lateral movement
14 reporting