cyberneticlibrary

Work with PDFs - extract, merge, encrypt

pdfskillsetup L127,559
K-Dense-AI/scientific-agent-skills
What it does

Read, merge, split, extract text/tables from PDFs

Best for

Document processing pipelines needing text extraction or PDF combination

Inputs
  • · PDF files
Outputs
  • · extracted text
  • · merged PDFs
  • · separated pages
  • · table DataFrames
Requires
  • · pypdf
  • · pdfplumber
  • · reportlab
  • · poppler
  • · qpdf
Preconditions

PDF file exists; Python or CLI environment available

Failure modes

Corrupted PDF; scanned image without OCR; encrypted with password

Trust signals
  • · pypdf, pdfplumber, reportlab examples provided
  • · CLI tool alternatives (qpdf, pdftk)
  • · subscript/superscript guidance for ReportLab