PDF Reading and Analysis

VerifiedSafe

Extracts and analyzes text and visual content from PDF files. Supports summarizing, extracting specific information (tables, code, references), and comparing multiple PDFs. Includes a Python utility for batch operations. Best used when the user provides a PDF file path or requests PDF content analysis.

Sby Skills Guide Bot
ProductivityBeginner
406/2/2026
Claude Code
#pdf#text-extraction#document-summarization#file-reading

Recommended for

Our review

This skill enables reading, extracting, and analyzing PDF documents directly within Claude Code.

Strengths

  • Native PDF reading without external tools
  • Extraction of both text and visual content
  • Ability to summarize and compare multiple PDFs

Limitations

  • No OCR for scanned PDFs
  • Issues with copy-protected PDFs
  • Large PDFs (100+ pages) may require pagination
When to use it

Use this skill when you need to extract or analyze text content from a PDF file.

When not to use it

Avoid this skill if the PDF is a scanned image or encrypted, as extraction will be limited.

Security analysis

Safe
Quality score90/100

The skill instructs the agent to read PDF files using a native Read tool and optionally run a Python utility with uv. There are no destructive commands, no exfiltration, and no obfuscation. The utility is a local tool for PDF operations, posing no inherent security risk.

No concerns found

Examples

Summarize a PDF
Summarize the key points from /path/to/report.pdf
Extract tables
Extract all tables from /path/to/data.pdf and format them as markdown
Compare two PDFs
Compare /path/to/v1.pdf and /path/to/v2.pdf and list the differences

name: pdf description: This skill should be used when the user asks to "read pdf", "view pdf", "extract text from pdf", "summarize pdf", or shares a PDF file path.

/pdf

Read, extract, and analyze PDF documents.

Instructions

When the user provides a PDF path or asks about PDF content:

1. Reading PDFs

Use the Read tool directly on the PDF file. Claude Code supports reading PDF files natively:

Read: /path/to/document.pdf

This extracts both text and visual content for analysis.

2. Common Tasks

Summarize: Provide a concise summary of the document

  • Key points and main arguments
  • Document structure overview
  • Important figures, tables, or data

Extract specific info: Find and extract:

  • Tables (convert to markdown format)
  • Code snippets
  • Citations/references
  • Contact information
  • Dates and deadlines

Compare PDFs: When given multiple PDFs:

  • Identify differences
  • Highlight common themes
  • Cross-reference information

3. Python Utility

For batch operations or programmatic access:

uv run tools/pdf_util.py extract /path/to/file.pdf
uv run tools/pdf_util.py info /path/to/file.pdf
uv run tools/pdf_util.py search /path/to/file.pdf "search term"

4. Output Format

When presenting PDF content:

  • Use markdown headers for document sections
  • Convert tables to markdown tables
  • Note page numbers for reference: (p. 5)
  • Flag any extraction issues (scanned images, encrypted content)

5. Limitations

  • Scanned PDFs may have limited text extraction (OCR not included)
  • Some PDFs have copy-protection that prevents text extraction
  • Very large PDFs (100+ pages) may need pagination
Related skills