Scraping

Web scraping and data extraction

Showing 121-144 of 697 skills
NeverSight

extract-transcripts

by NeverSight

Extract readable transcripts from Claude Code and Codex CLI session JSONL files

Auth 156 4mo ago
raphaelmansuy

playwright-ux-ui-capture

by raphaelmansuy

Capture EdgeQuake WebUI routes with Playwright and write artifacts immediately (screenshots + per-page request JSON + capture index). Use when adding/updating Playwright E2E capture specs or when asked to automate UI screenshot collection.

Processing 2K 3mo ago
boshu2

forge

by boshu2

'Mine transcripts for knowledge - decisions, learnings, failures, patterns. Triggers: "forge insights", "mine transcripts", "extract knowledge".'

Auth 377 3mo ago
krafton-ai

web-navigation-strategies

by krafton-ai

Strategic navigation patterns and selector guides for thorough web exploration using Playwright MCP. Provides decision trees, navigation strategies, and site-specific selectors for reading multiple pages systematically. Use when planning how to navigate websites, determining reading depth, or finding the right selectors for Playwright MCP commands.

Scraping 887 3mo ago
Gentleman-Programming

typescript

by Gentleman-Programming

TypeScript strict patterns and best practices. Trigger: When writing TypeScript code - types, interfaces, generics.

Code Gen 1.8K 4mo ago
Gentleman-Programming

typescript

by Gentleman-Programming

TypeScript strict patterns and best practices. Trigger: When writing TypeScript code - types, interfaces, generics.

Code Gen 1.8K 4mo ago
adobe

scrape-webpage

by adobe

Scrape webpage content, extract metadata, download images, and prepare for import/migration to AEM Edge Delivery Services. Returns analysis JSON with paths, metadata, cleaned HTML, and local images.

Docker 113 3mo ago
danielmiessler

ExtractWisdom

by danielmiessler

Dynamic wisdom extraction that adapts sections to content. USE WHEN extract wisdom, analyze video, analyze podcast, extract insights, what's interesting, extract from YouTube, what did I miss, key takeaways. Replaces static extract_wisdom with content-adaptive extraction.

Finance 14.6K 3mo ago
phodal

pdf

by phodal

Extract and analyze information from PDF documents

Code Review 4.5K 4mo ago
tradingstrategy-ai

extract-vault-protocol-logo

by tradingstrategy-ai

Extract a logo for vault protocol metadata

Docker 818 4mo ago
darrenhinde

context-manager

by darrenhinde

Context management skill providing discovery, fetching, harvesting, extraction, compression, organization, cleanup, and guided workflows for project context

CLI Tools 4.2K 3mo ago
boshu2

extract

by boshu2

'Extract decisions and learnings from Claude session transcripts. Triggers: "extract learnings", "process pending", SessionStart hook.'

Auth 376 3mo ago
Automattic

test

by Automattic

Testing patterns for PHPUnit and Playwright E2E tests. Use when writing tests, debugging test failures, setting up test coverage, or implementing test patterns for ActivityPub features.

Scraping 570 3mo ago
OneWave-AI

meeting-intelligence-system

by OneWave-AI

Analyze meeting transcripts to extract decisions, action items, blockers, sentiment, and generate follow-up emails. Use when user provides meeting notes, transcripts, or recordings and needs structured summaries or action tracking.

Code Gen 169 7mo ago
krafton-ai

pdf

by krafton-ai

Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.

CLI Tools 887 3mo ago
krafton-ai

email-action-extractor

by krafton-ai

Extract actionable tasks assigned to the user from email text. Filters out informational emails (announcements, newsletters, ads, automated reports) and only processes emails with clear action requests. Handles group emails by identifying user-specific assignments.

Code Review 887 3mo ago
firecrawl

firecrawl

by firecrawl

Official Firecrawl CLI skill for web scraping, search, crawling, and browser automation. Returns clean LLM-optimized markdown. USE FOR: - Web search and research - Scraping pages, docs, and articles - Site mapping and bulk content extraction - Browser automation for interactive pages Must be pre-installed and authenticated. See rules/install.md for setup, rules/security.md for output handling.

Processing 432 3mo ago
zephyrwang6

pdf

by zephyrwang6

Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.

CLI Tools 310 4mo ago
jimmc414

histolab

by jimmc414

Digital pathology image processing toolkit for whole slide images (WSI). Use this skill when working with histopathology slides, processing H&E or IHC stained tissue images, extracting tiles from gigapixel pathology images, detecting tissue regions, segmenting tissue masks, or preparing datasets for computational pathology deep learning pipelines. Applies to WSI formats (SVS, TIFF, NDPI), tile-based analysis, and histological image preprocessing workflows.

Analytics 529 6mo ago
forcedotcom

playwright-e2e

by forcedotcom

writing, running, and debugging Playwright tests. working with their output from github actions

CI/CD 1K 3mo ago
cyberkaida

ctf-rev

by cyberkaida

Solve CTF reverse engineering challenges using systematic analysis to find flags, keys, or passwords. Use for crackmes, binary bombs, key validators, obfuscated code, algorithm recovery, or any challenge requiring program comprehension to extract hidden information.

Processing 742 6mo ago
lackeyjb

playwright-skill

by lackeyjb

Complete browser automation with Playwright. Auto-detects dev servers, writes clean test scripts to /tmp. Test pages, fill forms, take screenshots, check responsive design, validate UX, test login flows, check links, automate any browser task. Use when user wants to test websites, automate browser interactions, validate web functionality, or perform any browser-based testing.

API Dev 2.7K 5mo ago
michalparkola

tapestry

by michalparkola

Unified content extraction and action planning. Use when user says "tapestry <URL>", "weave <URL>", "help me plan <URL>", "extract and plan <URL>", "make this actionable <URL>", or similar phrases indicating they want to extract content and create an action plan. Automatically detects content type (YouTube video, article, PDF) and processes accordingly.

Automation 425 7mo ago
alexei-led

testing-e2e

by alexei-led

E2E testing with Playwright MCP for browser automation, test generation, and UI testing. Use when discussing E2E tests, Playwright, browser testing, UI automation, visual testing, or accessibility testing. Supports TypeScript tests and Go/HTMX web applications.

Accessibility 30 3mo ago