doc-to-text

Extract text content from various file formats. Supports PDF, Excel, Word, Images (OCR), Email, and ZIP Archives. Use for summarizing or analyzing binary files.

famaoai-creator 1 2 Updated 1w ago

GitHub

Install

npx skillscat add famaoai-creator/gemini-skills/doc-to-text

Install via the SkillsCat registry.

SKILL.md

Document to Text Converter

Overview

This skill acts as a universal converter to extract plain text and structured data from various binary and complex file formats. It enables Gemini to "read" files that are otherwise inaccessible.

Capabilities

1. Document Extraction

PDF (.pdf): Extracts plain text.
Excel (.xlsx): Converts sheets to CSV and performs OCR on embedded images.
Word (.docx): Extracts text and performs OCR on embedded images.
PowerPoint (.pptx): Extracts slide text and performs OCR on embedded images.

2. Image OCR

Images (.png, .jpg, .jpeg, .webp): Uses Tesseract.js to perform OCR (Optical Character Recognition) and extract text from images. Supports English and Japanese.

3. Data & Archives

Email (.eml): Parses headers (From, To, Subject) and body text.
ZIP Archive (.zip): Lists files and extracts content of text-based files within the archive without extracting to disk.

Usage

To read a file, execute the extract.cjs script with the file path.

node scripts/extract.cjs <path/to/file>

Example:
User: "What does the error screenshot say?"
Action: node scripts/extract.cjs error.png

Dependencies

This skill requires Node.js packages.
Run npm install in the skill directory before using.

Knowledge Protocol

This skill adheres to the knowledge/orchestration/knowledge-protocol.md. It automatically integrates Public, Confidential (Company/Client), and Personal knowledge tiers, prioritizing the most specific secrets while ensuring no leaks to public outputs.

doc-to-text

Install

Document to Text Converter

Overview

Capabilities

1. Document Extraction

2. Image OCR

3. Data & Archives

Usage

Dependencies

Knowledge Protocol

Install

Recommended Skills