Scraping

Web scraping and data extraction

Showing 169-192 of 697 skills
lingzhi227

slide-generation

by lingzhi227

Convert a completed paper into presentation slides (Beamer LaTeX) or poster. Extract key figures, tables, equations, and create a narrative flow for oral presentation. Identified gap in existing tools — designed from best practices.

Academic 76 3mo ago
colonelpanic8

playwright-cli

by colonelpanic8

Automate browser interactions from the shell using Playwright via the playwright-cli command (open/goto/snapshot/click/type/screenshot, tabs/storage/network). Use when you need deterministic browser automation for web testing, form filling, screenshots/PDFs, or data extraction.

Auth 215 3mo ago
michalparkola

learn-this

by michalparkola

Unified content extraction and action planning. Use when user says "learn-this <URL>", "learn this <URL>", "weave <URL>", "help me plan <URL>", "extract and plan <URL>", "make this actionable <URL>", or similar phrases indicating they want to extract content and create an action plan. Automatically detects content type (YouTube video, article, PDF) and processes accordingly.

Automation 424 3mo ago
wcygan

playwright-cli

by wcygan

Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages. Keywords: browser, automation, playwright, web testing, screenshot, form, click, navigate, scrape

CLI Tools 192 3mo ago
wcygan

design-research

by wcygan

Research website design, styling, and frontend implementation using Playwright CLI to capture screenshots and analyze structure. Use when a software engineer wants to understand and implement a similar design. Keywords: design research, website design, frontend analysis, UI study, design system, screenshot, Playwright

Design 192 3mo ago
wcygan

refactor-scan

by wcygan

Scan code for refactoring opportunities to improve modularity, testability, extensibility, reusability, understandability, and cognitive complexity. Use when reviewing code quality, identifying technical debt, planning a refactor, or improving code structure. Keywords: refactor, refactoring, code quality, modularity, testability, extensibility, reusability, cognitive complexity, code smell, clean code, SOLID, DRY, coupling, cohesion

Code Review 192 3mo ago
modelscope

pdf

by modelscope

Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.

CLI Tools 4.3K 4mo ago
WenJunDuan

webapp-testing

by WenJunDuan

Toolkit for interacting with and testing local web applications using Playwright. Supports verifying frontend functionality, debugging UI behavior, capturing browser screenshots, and viewing browser logs.

Automation 175 4mo ago
trailofbits

openai-develop-web-game

by trailofbits

'Use when the agent is building or iterating on a web game (HTML/JS) and needs a reliable

Code Review 425 3mo ago
oaustegard

extracting-keywords

by oaustegard

Extract keywords from documents using YAKE algorithm with support for 34 languages (Arabic to Chinese). Use when users request keyword extraction, key terms, topic identification, content summarization, or document analysis. Includes domain-specific stopwords for AI/ML and life sciences. Optional deeper extraction mode (n=2+n=3 combined) for comprehensive coverage.

Processing 125 5mo ago
oaustegard

reviewing-ai-papers

by oaustegard

Analyze AI/ML technical content (papers, articles, blog posts) and extract actionable insights filtered through enterprise AI engineering lens. Use when user provides URL/document for AI/ML content analysis, asks to "review this paper", or mentions technical content in domains like RAG, embeddings, fine-tuning, prompt engineering, LLM deployment.

Embeddings 125 5mo ago
freekmurze

pdf

by freekmurze

Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.

CLI Tools 950 4mo ago
am-will

gemini-computer-use

by am-will

Build and run Gemini 2.5 Computer Use browser-control agents with Playwright. Use when a user wants to automate web browser tasks via the Gemini Computer Use model, needs an agent loop (screenshot → function_call → action → function_response), or asks to integrate safety confirmation for risky UI actions.

CLI Tools 937 4mo ago
aAAaqwq

chrome-automation

by aAAaqwq

Chrome 浏览器自动化操作。当用户需要自动化浏览器操作、网页测试、数据抓取或 UI 自动化时使用此技能。

Debugging 63 3mo ago
vm0-ai

firecrawl

by vm0-ai

Firecrawl web scraping API via curl. Use this skill to scrape webpages, crawl websites, discover URLs, search the web, or extract structured data.

CLI Tools 63 5mo ago
quran

pdf

by quran

Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs,

CLI Tools 1.9K 4mo ago
qdhenry

extract-video-frames

by qdhenry

Extracts frames and timestamped audio segments from video files (GIF, MP4, MOV) at configurable intervals and stores them in a directory with a manifest file. Use when analyzing video content, preparing frames for visual review, extracting audio for transcription, or creating frame+audio sequences for another agent to process.

Agents 1.3K 3mo ago
einverne

chrome-devtools

by einverne

Browser automation, debugging, and performance analysis using Puppeteer CLI scripts. Use for automating browsers, taking screenshots, analyzing performance, monitoring network traffic, web scraping, form automation, and JavaScript debugging.

CLI Tools 117 5mo ago
einverne

cloudflare-browser-rendering

by einverne

Guide for implementing Cloudflare Browser Rendering - a headless browser automation API for screenshots, PDFs, web scraping, and testing. Use when automating browsers, taking screenshots, generating PDFs, scraping dynamic content, extracting structured data, or testing web applications. Supports REST API, Workers Bindings (Puppeteer/Playwright), MCP servers, and AI-powered automation. (project)

Auth 117 5mo ago
blacktop

ipsw

by blacktop

Apple firmware and binary reverse engineering with the ipsw CLI tool. Use when analyzing iOS/macOS binaries, disassembling functions in dyld_shared_cache, dumping Objective-C headers from private frameworks, downloading IPSWs or kernelcaches, extracting entitlements, analyzing Mach-O files, or researching Apple security. Triggers on requests involving Apple RE, iOS internals, kernel analysis, KEXT extraction, or vulnerability research on Apple platforms.

CLI Tools 56 5mo ago
BrianRWagner

brand-voice-extractor

by BrianRWagner

"Extract or build a distinct brand voice profile that AI agents can use to produce on-brand content every time. Two modes: Extract (analyze content you're proud of) or Build (construct a voice from scratch). Outputs a complete voice profile with personality traits, tone spectrum, vocabulary guide, rhythm patterns, and example phrases. Use this before any content creation skill for consistent, human-sounding output."

Performance 315 3mo ago
knoopx

retype

by knoopx

Refactors TypeScript codebases with AST-aware rename, extract, and reference finding. Use for moving functions between files, renaming across codebase, or finding all usages of a symbol.

CLI Tools 57 3mo ago
henkisdabro

webapp-testing

by henkisdabro

Toolkit for interacting with and testing local web applications using Playwright. Supports verifying frontend functionality, debugging UI behaviour, capturing browser screenshots, and viewing browser logs. Use when user asks to test a web app, verify UI, capture screenshots, check browser logs, or debug frontend issues.

Automation 63 3mo ago
trailofbits

openai-playwright

by trailofbits

Use when the task requires automating a real browser from the terminal (navigation, form

Automation 422 3mo ago