Skip to main content
Back to registry

extracting-pdf-text

letta-ai/skills

This skill provides tools and guidance for extracting text from PDFs in formats suitable for language model consumption.

Installs178
Install command
npx skills add https://github.com/letta-ai/skills --skill extracting-pdf-text
Security audits
Gen Agent Trust HubPASS
SocketPASS
SnykWARN
About this skill
This skill provides tools and guidance for extracting text from PDFs in formats suitable for language model consumption. Best for: Text-heavy PDFs, speed-critical workflows, basic structure preservation. The script outputs markdown with preserved headings and paragraphs. For LLM-optimized output, it uses pymupdf4llm which formats text for RAG systems. Best for: PDFs with tables, financial documents, structured data. Tables are converted to markdown format. Note: pdfplumber works best on machine-generated PDFs, not scanned documents. Best for: Scanned PDFs when API access is unavailable.

Source description provided by the upstream skill listing. Community reviews and install context appear in the sections below.

Community Reviews

Latest reviews

Sign in to review

No community reviews yet. Be the first to review.

Browse this skill in context
FAQ
What does extracting-pdf-text do?

This skill provides tools and guidance for extracting text from PDFs in formats suitable for language model consumption.

Is extracting-pdf-text good?

extracting-pdf-text does not have approved reviews yet, so SkillJury cannot publish a community verdict.

What agent does extracting-pdf-text work with?

extracting-pdf-text currently lists compatibility with codex, gemini-cli, opencode, cursor, kimi-cli, github-copilot.

What are alternatives to extracting-pdf-text?

Skills in the same category include telegram-bot-builder, flutter-app-size, sharp-edges, iterative-retrieval.

How do I install extracting-pdf-text?

npx skills add https://github.com/letta-ai/skills --skill extracting-pdf-text

Related skills

Alternatives in Software Engineering