- Home
- /
- Categories
- /
- Scraping
Scraping
Web scraping and data extraction
playwright
by devinschumacher
Canonical Playwright hub for E2E tests and ad-hoc browser automation. Use when the user explicitly mentions "Playwright", "@playwright/test", "npx playwright", "playwright.config.ts", "PWDEBUG", "trace viewer", or "toHaveScreenshot". Avoid using for generic browser automation unless Playwright is requested, and avoid using for pure web scraping.
extract-and-export
by Crawlio-app
Use this skill when the user asks to "download and export a site", "crawl and extract content", "archive a website", "export as WARC/ZIP/PDF", or wants a complete crawl-extract-export pipeline. Crawls the site, extracts structured content, and exports in the requested format.
cloudflare-browser-rendering
by brixtonpham
Guide for implementing Cloudflare Browser Rendering - a headless browser automation API for screenshots, PDFs, web scraping, and testing. Use when automating browsers, taking screenshots, generating PDFs, scraping dynamic content, extracting structured data, or testing web applications. Supports REST API, Workers Bindings (Puppeteer/Playwright), MCP servers, and AI-powered automation. (project)
bouncer
by timmyg
"Name's not on the list. Scans every skill, plugin, and MCP server you've got installed, checks their IDs at the door, and walks you through setting up anyone who showed up without credentials."
setup-scheduled-scraper
by sawyerh
Set up a local Playwright + Next.js scraper project with scheduled runs and an optional web viewer. Use when asked to build a project that scrapes specific URLs on a schedule, stores results locally (JSON), and optionally provides a Next.js App Router + Tailwind + Shadcn data viewer. Assumes macOS launchd automation by default.
cendoj-scraper
by cjescudero
Buscar resoluciones judiciales en CENDOJ mediante automatizacion con Playwright e interceptacion de red (sin extraer resultados del DOM). Usar cuando se necesite consultar sentencias por texto libre, tribunal, identificador ROJ o identificador ECLI, y devolver resultados estructurados en JSON desde scripts/scraper.py.
extract
by Alexu0317-FATHER
"Scan the current session for cognition/thinking/preference signals and write to extract-buffer.md. Run in the target session (current or --resume'd). Manual invocation only."
nextfriday-types
by next-friday
Next Friday TypeScript patterns for props, interfaces, and return types. Use when defining types or writing function signatures.
complexity
by zhongjis
Use when reducing cognitive complexity, flattening nested code, or simplifying functions. Triggers on "reduce complexity", "simplify", "too nested".
playwright
by zhongjis
Browser automation via Playwright MCP tools. Use when AI Agent needs to interact with web browsers for: (1) Web scraping and data extraction, (2) Form interaction and submission, (3) Browser testing and validation, (4) Taking screenshots of web pages, or any browser automation tasks.
skill-learn
by rxdaozhang
Consolidate actionable lessons after another skill is used and required multiple correction rounds. Use this skill at task closure to extract concrete failures from the current conversation, propose safe incremental updates to the original skill files, show a diff first, and only apply changes after explicit user confirmation. Works for Claude and Codex skill workflows.
firecrawl
by edxeth
Scrapes and crawls web pages, converting them to clean markdown or structured JSON for LLM consumption. Use when needing to extract content from URLs, crawl entire websites, map site structure, search the web with scraping, or extract structured data from pages. Best for web scraping, site crawling, URL discovery, and converting web content to LLM-ready formats.
extracting-keywords
by arielperez82
Extract keywords from documents using YAKE algorithm with support for
by CodingHeader
Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.
qa
by AgustinAlbonico
"Alias corto para ejecutar pruebas E2E y QA manual. Usar cuando quieras probar la ultima funcionalidad implementada con /qa."
personal-context-agent
by Cursedpotential
"Extract and structure personal context from AI chat transcripts into themed markdown files. Use when (1) Processing Claude, Claude Code, or other AI conversation exports, (2) Building personalized AI assistants from chat history, (3) Creating context files for Claude Projects, GPTs, or Gems, (4) Consolidating scattered knowledge from multiple conversations. Optimized for Claude Haiku."
Testing
by tachfineamnay
Jest unit tests, Playwright E2E tests, fixtures, and CI integration.
personal-context-agent
by Cursedpotential
"Extract personal facts about a user from AI chat transcripts and structure into themed markdown files. Use when (1) Processing Claude, Claude Code, or ChatGPT conversation exports, (2) Building personalized AI context from chat history, (3) Creating context files for Claude Projects, GPTs, or Gems. Optimized for Claude Haiku."
by zhuxining
Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.
linkedin-scraper
by aspenas
Scrape LinkedIn profiles using the user's Chrome profile. Use when asked to find leads, scrape LinkedIn profiles, extract contact data from LinkedIn, or build prospect lists. Triggers include "find founders on LinkedIn", "scrape this LinkedIn profile", "get LinkedIn data for these people", "build a lead list from LinkedIn".
start-issue
by vkehfdl1
Start working on a Marshroom cart issue — creates branch, injects context, updates status to running
Playwright Web Scraping Skill
by gustavo-p0
```
Playwright Browser Automation
by yousufjoyian
Claude agent skills and workspace configuration
webapp-testing
by dvorkinguy
Toolkit for interacting with and testing local web applications using Playwright. Supports verifying frontend functionality, debugging UI behavior, capturing browser screenshots, and viewing browser logs.