Scraping

Web scraping and data extraction

Showing 553-576 of 700 skills

e2e-bugfix

by penkzhou

This skill should be used when the user asks to "debug E2E tests", "fix Playwright failures", "fix Cypress tests", "analyze timeout errors", or mentions keywords like "Playwright", "Cypress", "Timeout exceeded", "locator", "selector", "flaky test". It provides the complete bugfix workflow knowledge including error classification, confidence scoring, and E2E-specific debugging techniques.

Debugging 2 7mo ago

tapestry

by Krosebrook

Unified content extraction and action planning. Use when user says "tapestry <URL>", "weave <URL>", "help me plan <URL>", "extract and plan <URL>", "make this actionable <URL>", or similar phrases indicating they want to extract content and create an action plan. Automatically detects content type (YouTube video, article, PDF) and processes accordingly.

Automation 2 8mo ago

refactor

by iulspop

Plans and executes safe refactoring with tests as a safety net. Use when restructuring code, extracting functions, renaming across files, or simplifying complex logic without changing behavior.

File Ops 1 5mo ago

pdf

by Jackiexiao

"Use this skill whenever the user wants to work with PDF files: read/extract, merge/split, rotate, watermark, create, fill forms, encrypt/decrypt, image extraction, and OCR."

CLI Tools 1 5mo ago

remotion-best-practices

by xdrshjr

Best practices for Remotion - Video creation in React

Animation 1 5mo ago

crossplane-debug

by kanzifucius

Debug Crossplane compositions, functions (KCL, Go templates, patch-and-transform, auto-ready), and managed resources. Use when troubleshooting composition rendering issues, function errors, resource creation failures, dependency problems, or claim status issues. Supports remote cluster debugging when composition files are not available locally. Triggers on keywords like "crossplane", "composition", "XR", "claim", "function-kcl", "managed resource", or when debugging Kubernetes resources created by Crossplane.

CLI Tools 1 5mo ago

remotion-best-practices

by xdrshjr

Best practices for Remotion - Video creation in React

Animation 1 5mo ago

Brian — Knowledge Specialist (Concise)

by mmcmedia

McKinzie decides human sharing

Code Review 1 5mo ago

cloudflare-browser-rendering

by ma1orek

Add headless Chrome automation with Puppeteer/Playwright on Cloudflare Workers. Use when: taking screenshots, generating PDFs, web scraping, crawling sites, browser automation, or troubleshooting XPath errors, browser timeouts, binding not passed errors, session limits, page.evaluate __name errors, or waitForSelector timeout issues.

Cloud 1 5mo ago

yoitao-jimeng-sessionid

by yoitaoai

当需要获取或刷新即梦（jimeng.jianying.com）登录态中的 sessionid cookie 时使用。自动打开即梦网站检查登录状态，并在已登录时返回 sessionid 值给调用方。

Processing 1 5mo ago

aico-frontend-style-extraction

by yellinzero

Extract design tokens (colors, typography, spacing, effects) from reference website or screenshot to create project design system. UNIQUE VALUE: Creates standardized design-system.md file with all design tokens extracted systematically. Use this skill when: - User shares reference website URL and wants to extract its style - User provides screenshot or image and asks to "extract design", "extract style" - Running /frontend.init and need to create design system from reference - User asks to "create design system", "extract colors", "extract typography" - Need to establish consistent design tokens before starting frontend work Methods: URL (via Playwright MCP screenshot) or direct screenshot analysis Output: ALWAYS write to docs/reference/frontend/design-system.md

Code Gen 1 6mo ago

ogie

by AConfusedBoi

Extract OpenGraph, Twitter Cards, and metadata from URLs or HTML. Use when building link previews, SEO tools, or scraping webpage metadata.

Processing 1 5mo ago

playwright-cli

by mmcmedia

Browser automation via Playwright CLI. Open pages, interact with elements, take screenshots, and more. Ideal for coding agents and automated testing workflows.

Auth 1 5mo ago

social-fetcher

by JNHFlow21

统一抓取社交媒体内容（Twitter/X、小红书、抖音）。使用 Playwright + 持久化浏览器上下文，支持登录状态保存，一次登录后重复抓取。

Auth 1 4mo ago

playwright-local

by fefogarcia

Build browser automation and web scraping with Playwright on your local machine. Prevents 10 documented errors including CI timeout hangs, extension testing failures, and Ubuntu compatibility issues. Includes stealth mode for anti-bot bypass, authenticated sessions, infinite scroll handling, screenshot/PDF generation, and v1.57 Speedboard performance analysis. Use when: automating browsers, scraping protected sites, testing with real IPs, bypassing bot detection, generating screenshots/PDFs, or troubleshooting "target closed", "page.pause() hangs CI", "permission prompts block tests", or "Ubuntu 25.10 installation" errors.

Debugging 1 5mo ago

e2e-tests

by iulspop

Generates end-to-end tests using Playwright with the "given/should" prose format. Use when writing e2e tests for user flows, page interactions, or integration scenarios that exercise the full application stack.

Processing 1 5mo ago

skills

by vibeindex

Claude Code skills from Vibe Index

API Dev 1 5mo ago

redbook-creator-publish

by yanquankun

"小红书帖子创作与发布技能。用于:(1) 生成小红书风格的帖子内容(标题+正文+标签)(2) 获取/生成帖子配图 (3) 自动上传到小红书创作者平台。触发词:小红书创作、create redbook、小红书、红书、笔记创作、帖子创作"

Processing 1 5mo ago

xiaohongshu-automation

by wrt820232

小红书自动化控制 - 通过 Playwright CDP 连接 OpenClaw 浏览器实现发布、搜索、评论等功能

Scraping 1 5mo ago

beauty-step1

by Within-7

"Document content analysis and merging. Automatically invoked during step 1 of the beauty command to fully understand source document content, extract key information, and establish content structure. 文档内容分析合并。在beauty命令的步骤1执行时自动调用，用于完整理解源文档内容，提取关键信息，建立内容结构。"

Processing 1 5mo ago

qa-run

by ajaywadhara

"8-agent QA loop: browser exploration via Playwright MCP, then analyze, plan, test, audit, heal, expand, snapshot. Quality gate score >= 85 to pass."

Debugging 1 5mo ago

website-crawler

by leobrival

High-performance web crawler for discovering and mapping website structure. Use when users ask to crawl a website, map site structure, discover pages, find all URLs on a site, analyze link relationships, or generate site reports. Supports sitemap discovery, checkpoint/resume, rate limiting, and HTML report generation.

CLI Tools 1 5mo ago

just-scrape

by Jackiexiao

"CLI tool for AI-powered web scraping, data extraction, search, and crawling via ScrapeGraph AI. Use when the user needs to scrape websites, extract structured data from URLs, convert pages to markdown, crawl multi-page sites, search the web for information, automate browser interactions (login, click, fill forms), get raw HTML, discover sitemaps, or generate JSON schemas. Triggers on tasks involving: (1) extracting data from websites, (2) web scraping or crawling, (3) converting webpages to markdown, (4) AI-powered web search with extraction, (5) browser automation, (6) generating output schemas for scraping. The CLI is just-scrape (npm package just-scrape)."

Processing 1 5mo ago

stock_ticker

by cliuxinxin

Get real-time stock prices and financial info for US stocks (like AAPL, TSLA, NVDA).

Automation 1 6mo ago