study-assistant

Study assistant workflow for lecture-slide exam prep using the `pdfocr` CLI. Use when a task involves reading PDF slides, transcribing slide text, cleaning OCR output, and generating exam-focused deliverables such as study notes, lecture-style explanations, ELI5 explanations, flashcards, Mermaid mind maps, quizzes, essay questions, or one-step PDF-to-notes output.

planetis-m 5 Updated 4mo ago

GitHub

Install

npx skillscat add planetis-m/study-assistant

Install via the SkillsCat registry.

SKILL.md

Study Assistant

Follow this workflow exactly to convert lecture material into exam-ready outputs.

Select Mode

Map the user request to one mode:

transcribe: Convert PDF slides into markdown while keeping educational text verbatim.
analyze: Generate structured study notes from provided text.
lecture: Turn provided content into cohesive professor-style teaching narrative.
eli5: Explain provided material in plain English while keeping technical depth.
flashcard: Generate two-column markdown flashcards.
mindmap: Generate Mermaid mindmap only.
quiz: Generate mixed quiz and answer key.
essay: Generate 3-4 essay prompts and sample answers.
study-notes: End-to-end pipeline (OCR from PDF, then generate notes in one pass).

Session OCR Cache

For PDF-based requests, avoid repeated OCR in the same session by using a local cache.

Cache directory: .study-assistant-cache under current workspace.
Cache key inputs:
- Absolute PDF path
- Page selector (all-pages or explicit range string)
- Source file mtime and size
Cache files:
- <key>.raw.jsonl: original pdfocr output, one JSON object per page
- <key>.meta: key inputs for traceability

Workflow:

Before OCR, follow references/ocr-cache.md to check cache.
If cache hit, reuse cached JSONL and skip pdfocr.
If cache miss, run pdfocr and write raw JSONL cache for future mode requests.
Re-run OCR only when PDF changed or page selection changed.

Process PDF Input

If the source is a PDF, always run pdfocr through shell execution.

Before first OCR call:

Check availability with command -v pdfocr.
If pdfocr is missing, attempt install by following references/pdfocr-install.md.
Install only to user-home absolute paths ($HOME/.local/...), never ./.local in workspace.
Retry command -v pdfocr after installation.
If still missing, stop and report the failed install attempt plus the exact command/output.
Verify credentials:
- Prefer DEEPINFRA_API_KEY environment variable.
- If env key is missing, resolve the real binary path first, then check config.json in that real binary directory.
- Do not check config.json next to a symlink wrapper path such as $HOME/.local/bin/pdfocr.
- If neither is configured, stop and ask user for DeepInfra API key setup.
Ask user permission before every networked OCR execution:
- Request unrestricted network/escalated execution first.
- Do not run a sandboxed pdfocr attempt as a probe when network access is required.
Never read PDFs with direct file readers or ad-hoc parsers.
Use full document extraction:
- pdfocr INPUT.pdf --all-pages
If page ranges are provided, pass them to pdfocr:
- pdfocr INPUT.pdf --pages:"8-20,22-27"
Parse stdout as JSONL:
- Treat each line as one JSON object.
- Keep "text" only for records with "status":"ok".
- Report pages with "status":"error" but continue with successful pages.
- Read cached .raw.jsonl directly; do not generate extra parsed cache files.

Clean OCR Text

Before generation, remove only clear metadata:

Instructor details
Headers and footers
Page numbers
Timestamps
Course codes

Preserve educational content (concepts, definitions, examples). If text is severely fragmented, skip that fragment instead of guessing.

Generate Output

Read and apply mode-specific rules from references/commands.md. Use the section matching the selected mode.

Global rules across all modes:

Base all factual content only on user-provided material and extracted OCR text.
Do not add outside facts, theories, examples, or claims.
Use markdown output.
Use LaTeX with $...$ (inline) and $$...$$ (display) for math.
Do not include conversational intros or conclusions.

For study-notes, do OCR and notes generation in one workflow and do not recursively call other mode names.

study-assistant

Install

Study Assistant

Select Mode

Session OCR Cache

Process PDF Input

Clean OCR Text

Generate Output

Categories

Install

Recommended Skills