Multi-Agent PDF Analysis System

A Streamlit and LangGraph based PDF analysis application powered by DeepSeek. The system combines Retrieval-Augmented Generation (RAG), dynamic agent planning, conversation memory, multi-agent collaboration, and lightweight reflection/retry to answer questions over one or more uploaded PDFs.

This project is designed as a practical AI engineering portfolio project. It focuses on clear agent responsibilities, observable workflow traces, graceful error handling, and maintainable LLM client abstraction.

Features

Upload and analyze one or more PDF files
Ask questions grounded in retrieved PDF content
Summarize documents
Extract keywords and key concepts
Compare multiple PDFs
Retrieve supporting sources and page references
Add stable citation IDs such as [paper.pdf, page 3] for retrieved PDF evidence
Maintain recent conversation and summarized long-term chat context
Dynamically select specialist agents with an LLM planner
Coordinate multi-agent outputs with a Collaboration Agent
Use lightweight reflection/retry when important information is missing
Synthesize final answers with a Reviewer Agent
Display workflow traces in the Streamlit UI
Centralize DeepSeek client configuration and error handling

Architecture

User Question
    |
    v
Streamlit UI
    |
    v
Session State
    |
    +-- uploaded PDFs
    +-- chat history
    |
    v
LangGraph Workflow
    |
    v
Planner Agent
    |
    v
Specialist Agent(s)
    |
    +-- Retriever -> FAISS PDF chunks
    |
    +-- Memory Manager -> conversation/session context
    |
    v
LLM Prompt Context
    |
    v
Workflow Controller
    |
    +-- single-agent output -------> Reviewer Agent ---> Final Answer
    |
    +-- multi-agent output --------> Collaboration Agent
                                      |
                                      v
                                  Revision Controller
                                      |
                                      +-- retry needed ---> Specialist Agent(s)
                                      |
                                      +-- no retry -------> Reviewer Agent
                                                              |
                                                              v
                                                         Final Answer

The Memory Manager is not a standalone LangGraph node. It is one of the context sources used when specialist agents call the LLM. Retrieved PDF chunks and conversation/session memory are combined into the final prompt context.

Agent Design

Planner Agent

The Planner Agent analyzes the user request and returns an ordered list of specialist steps.

Example:

User: Compare these papers and provide sources.
Planner: ["compare", "source"]

Supported specialist steps:

qa
source
summary
keyword
compare

Specialist Agents

Each specialist agent owns one focused capability:

QA Agent: answers user questions using retrieved PDF context
Source Agent: retrieves supporting sources, filenames, and page numbers
Summary Agent: generates document summaries
Keyword Agent: extracts important concepts and terminology
Compare Agent: compares multiple PDFs across topics, methods, findings, and conclusions

Collaboration Agent

The Collaboration Agent runs only when multiple specialist outputs are available. It coordinates agent outputs before final review.

It returns structured collaboration notes:

{
    "answer": "...",
    "issues": [...],
    "missing_information": [...],
    "recommendations": [...],
    "needs_revision": false,
    "next_steps": []
}

The collaboration layer helps the system:

merge overlapping findings
detect contradictions
identify missing evidence
decide whether another specialist step is worth running
avoid unnecessary retry for simple single-agent requests

Revision Controller

The Revision Controller reads the Collaboration Agent decision and controls retry behavior.

Current retry policy:

MAX_REVISION_COUNT = 1

This keeps the workflow adaptive without allowing infinite loops.

Reviewer Agent

The Reviewer Agent synthesizes specialist outputs and collaboration notes into one final answer for the user. It is responsible for final answer quality and natural presentation, not planning or retry decisions.

Workflow Examples

Single-Agent Question

User asks a direct question
    |
Planner -> ["qa"]
    |
QA Agent
    |
Reviewer Agent
    |
Final Answer

The system skips collaboration and retry for simple requests to reduce latency and token cost.

Multi-Agent Question With Collaboration

User asks for comparison and sources
    |
Planner -> ["compare", "source"]
    |
Compare Agent
    |
Source Agent
    |
Collaboration Agent
    |
Revision Controller
    |
Reviewer Agent
    |
Final Answer

Multi-Agent Question With Lightweight Retry

Planner -> ["compare", "source"]
    |
Compare Agent + Source Agent
    |
Collaboration Agent detects missing summary
    |
Revision Controller adds "summary"
    |
Summary Agent
    |
Collaboration Agent
    |
Reviewer Agent
    |
Final Answer

RAG Pipeline

Uploaded PDFs
    |
PyPDFLoader
    |
Text splitting
    |
Embeddings
    |
FAISS vector store
    |
Retriever tools
    |
Specialist agents

The system retrieves relevant PDF chunks before calling the LLM, so answers are grounded in uploaded document content.

Memory

The memory manager provides context for follow-up questions:

Recent conversation memory
LLM-generated summary of older conversation history
Current PDF session memory based on all uploaded PDF filenames
Cached conversation summaries

If memory summarization fails, the system falls back gracefully instead of stopping the workflow.

Retrieved PDF chunks are handled separately as Relevant PDF Context, so session memory describes which documents are available while retrieval context describes which chunks were selected for the current question.

UI Workflow Trace

The Streamlit UI shows a workflow trace for each assistant answer, including:

planned specialist steps
whether collaboration was skipped or used
Collaboration Agent status
detected issues
missing information
recommendations
revision attempts
requested retry steps

This makes the agent behavior observable during demos and debugging.

Error Handling

The project includes graceful fallback paths:

Planner failure falls back to ["qa"]
Collaboration failure skips retry and preserves available agent outputs
Memory summary failure skips summary memory
Reviewer failure returns raw specialist outputs
Specialist agent failures are recorded without stopping the entire workflow
Streamlit catches graph-level failures and returns a readable error message

Source Citations

Retrieved PDF chunks are formatted with stable citation IDs before they are sent to the LLM:

Citation: [example.pdf, page 3]
PDF: example.pdf
Page: 3
Content:
...

The assistant can then cite evidence using IDs such as [example.pdf, page 3], and the Streamlit UI displays each source with its filename, page number, and excerpt.

Temporary File Handling

Uploaded PDFs are written to temporary files only long enough for PyPDFLoader to read them. After loading, the app removes the temporary PDF path and also performs cleanup when the user clears all PDFs and chat history.

Technology Stack

Area	Technology
Language	Python
UI	Streamlit
Agent workflow	LangGraph
LLM provider	DeepSeek via OpenAI-compatible API
PDF loading	LangChain PyPDFLoader
Vector database	FAISS
Retrieval	RAG
Embeddings	HuggingFace sentence-transformers
Environment config	python-dotenv

Project Structure

PDF-Agent/
|-- agent/
|   |-- graph_agent.py
|   |-- state.py
|   |-- llm_planner.py
|   |-- collaboration_agent.py
|   |-- reviewer_agent.py
|   |-- qa_agent.py
|   |-- source_agent.py
|   |-- summary_agent.py
|   |-- keyword_agent.py
|   |-- compare_agent.py
|
|-- tools/
|   |-- pdf_qa_tool.py
|   |-- source_tool.py
|   |-- pdf_summary_tool.py
|   |-- keyword_tool.py
|   |-- compare_tool.py
|
|-- utils/
|   |-- llm_client.py
|   |-- deepseek_client.py
|   |-- final_synthesizer.py
|   |-- memory.py
|   |-- text_splitter.py
|   |-- vectorstore.py
|
|-- main.py
|-- requirements.txt
|-- README.md

Installation

Install dependencies:

pip install -r requirements.txt

Create a .env file:

DEEPSEEK_API_KEY=your_deepseek_api_key
DEEPSEEK_MODEL=deepseek-v4-flash

Run the application:

streamlit run main.py

Portfolio Highlights

This project demonstrates:

LangGraph stateful workflow design
Dynamic LLM-based agent planning
RAG over uploaded PDF documents
Multi-agent specialization
Collaboration-based output coordination
Lightweight reflection/retry with bounded revision count
Centralized LLM client abstraction
Graceful error handling and fallback design
Observable workflow tracing in the UI

Future Improvements

Add automated evaluation examples
Add unit tests for planner parsing and collaboration retry decisions
Add optional parallel execution for independent specialist agents
Add structured logging for agent workflow events
Improve source citation formatting
Add multimodal PDF support for figures and tables

License

This project is intended for educational and portfolio use.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Agent PDF Analysis System

Features

Architecture

Agent Design

Planner Agent

Specialist Agents

Collaboration Agent

Revision Controller

Reviewer Agent

Workflow Examples

Single-Agent Question

Multi-Agent Question With Collaboration

Multi-Agent Question With Lightweight Retry

RAG Pipeline

Memory

UI Workflow Trace

Error Handling

Source Citations

Temporary File Handling

Technology Stack

Project Structure

Installation

Portfolio Highlights

Future Improvements

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
agent		agent
tools		tools
utils		utils
.gitignore		.gitignore
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Multi-Agent PDF Analysis System

Features

Architecture

Agent Design

Planner Agent

Specialist Agents

Collaboration Agent

Revision Controller

Reviewer Agent

Workflow Examples

Single-Agent Question

Multi-Agent Question With Collaboration

Multi-Agent Question With Lightweight Retry

RAG Pipeline

Memory

UI Workflow Trace

Error Handling

Source Citations

Temporary File Handling

Technology Stack

Project Structure

Installation

Portfolio Highlights

Future Improvements

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages