Scraping

Web scraping and data extraction

Showing 505-528 of 700 skills

video-downloader

by LeeFeee

通用视频下载工具。基于 yt-dlp 支持从 YouTube、Bilibili、Twitter/X、抖音、小红书等 1000+ 视频网站下载视频并保存到本地。首次使用会自动安装 yt-dlp 依赖。当用户提供视频链接、要求下载视频、或提到"保存视频"、"下载视频"时触发此技能。支持指定输出目录、选择视频质量、仅下载音频等选项。

CLI Tools 3 5mo ago

pdf

by Crumbgrabber

Comprehensive PDF manipulation toolkit for extracting text and tables,

CLI Tools 3 6mo ago

assessment

by borisghidaglia

Fitness and nutrition assessment. Activate when users want to evaluate their training or diet, identify gaps, get an initial assessment, or ask "what am I doing wrong?" or "where should I start?"

Automation 3 5mo ago

playwright-py-skill

by akaihola

Complete browser automation with Playwright. Auto-detects dev servers, writes clean test scripts to /tmp. Test pages, fill forms, take screenshots, check responsive design, validate UX, test login flows, check links, automate any browser task. Use when user wants to test websites, automate browser interactions, validate web functionality, or perform any browser-based testing.

CLI Tools 3 5mo ago

playwright-testing

by C0ntr0lledCha0s

Automatically activated when user works with Playwright tests, mentions Playwright configuration, asks about selectors/locators/page objects, or has files matching *.spec.ts in e2e or tests directories. Provides Playwright-specific expertise for E2E and integration testing.

Auth 3 8mo ago

webapp-testing

by vibery-studio

Toolkit for interacting with and testing local web applications using Playwright. Supports verifying frontend functionality, debugging UI behavior, capturing browser screenshots, and viewing browser logs.

Automation 3 6mo ago

playwright-frontend-testing

by liauw-media

"Use when testing frontend applications. AI-assisted browser testing with Playwright MCP. Fast, deterministic, no vision models needed."

Scraping 3 8mo ago

website-to-vite-scraper

by breverdbidder

Multi-provider website scraper that converts any website (including CSR/SPA) to deployable static sites. Uses Playwright, Apify RAG Browser, Crawl4AI, and Firecrawl for comprehensive scraping. Triggers on requests to clone, reverse-engineer, or convert websites.

Embeddings 5 7mo ago

PDF Processing

by dy9759

Extract text and tables from PDF files, fill forms, merge documents. Use when working with PDF files or when the user mentions PDFs, forms, or document extraction.

Processing 5 8mo ago

playwright-reviewing

by meriley

Review Playwright E2E tests for best practices violations. Detects mocked app data, explicit timeouts, CSS selectors, skipped tests, and assertion anti-patterns. Use when reviewing Playwright PRs or auditing test quality.

Code Review 5 6mo ago

pdf

by QuestForTech-Investments

Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.

CLI Tools 5 8mo ago

Playwright Browser Automation

by QuestForTech-Investments

Automation 5 8mo ago

playwright-writing

by meriley

Write reliable Playwright E2E tests following official best practices. Prioritizes user-facing locators, web-first assertions, and test isolation. NEVER mock application data. Avoid explicit waits unless component-specific. Use when writing, reviewing, or debugging Playwright tests.

Scraping 5 6mo ago

Extract Member Profiles

by lycfyi

To start fresh, delete the profile files manually before extracting

CLI Tools 5 6mo ago

malware-forensics

by SherifEldeeb

Analyze malware samples for forensic investigation. Use when investigating malware infections, determining malware capabilities, extracting IOCs, or understanding attack techniques. Supports static and dynamic analysis of executables, scripts, and documents.

Automation 5 6mo ago

@tank/bdd-e2e-testing

by tankpkg

"BDD end-to-end testing against real systems. Covers web apps (Playwright), libraries (pytest-bdd + Docker), APIs, CLIs, message queues. Gherkin writing, step definitions, Page Objects, Screenplay, 3-layer architecture, CI/CD, multi-language (TypeScript, Python, Java, .NET). Triggers: BDD test, Gherkin, Cucumber, feature file, Given When Then, playwright-bdd, pytest-bdd, Behave, Cucumber-JVM, Serenity BDD, Reqnroll, Example Mapping, Three Amigos, living documentation, BDD setup, BDD architecture."

API Dev 1 4mo ago

skill-seekers

by jakerains

Convert documentation websites, GitHub repositories, and PDFs into Claude AI skills. Use when creating Claude skills from docs, scraping documentation, packaging websites into skills, or converting repos/PDFs to Claude knowledge.

Prompts 1 6mo ago

browserless

by microlinkhq

Automate websites with browserless and Puppeteer for screenshots, PDFs, HTML/text extraction, URL status checks, and Lighthouse audits. Use when the user mentions browserless, @browserless/cli, headless Chrome automation, Puppeteer wrappers, website screenshots, PDF generation from URLs, or extracting rendered page content.

CLI Tools 1 5mo ago

@tank/figma-to-code

by tankpkg

Use when implementing UI from Figma designs with pixel-perfect fidelity. Covers the complete Figma-to-code pipeline: extracting design specs via Figma MCP tools (get_design_context, get_screenshot, get_variable_defs), translating every visual property to exact CSS (fonts, colors, spacing, shadows, gradients, border-radius, opacity), building design token systems, mapping Auto Layout to Flexbox/Grid, converting component variants to props, and verifying implementation against Figma screenshots. Requires: Figma desktop MCP (figma-desktop) for design extraction. Synthesizes Figma Dev Mode API (2024-2025), CSS Specifications (Color L4, Flexbox, Grid, Container Queries), WAI-ARIA Authoring Practices, Google Fonts Best Practices, and Playwright Visual Testing. Trigger phrases: "implement this Figma", "Figma to code", "pixel perfect", "match the design", "copy the Figma", "design implementation", "implement this design", "build from Figma", "Figma handoff", "translate design to code", "code this component from Figma", "design to HTML", "design to React", "design to Tailwind", "match Figma exactly", "implement UI from design", "Figma screenshot", "convert design", "make it look like Figma", "same as design", "design specs", "implement mockup", "from Figma file"

Accessibility 1 4mo ago

webapp-testing

by icartsh

Playwright를 사용하여 로컬 웹 애플리케이션과 상호작용하고 테스트하기 위한 툴킷입니다. 프런트엔드 기능 검증, UI 동작 디버깅, 브라우저 스크린샷 캡처 및 브라우저 로그 확인을 지원합니다.

CI/CD 4 7mo ago

yt-dlp

by lwmxiaobei

Download videos and extract audio from various platforms using yt-dlp. Use when user provides a video URL, asks to download a video, or when conversation contains video links from YouTube, Twitter/X, Vimeo, TikTok, Instagram, etc.

Processing 4 6mo ago

web-to-markdown

by otrebu

Batch-process web pages via headless Playwright browser, extract HTML, convert to markdown using Turndown, and save to timestamped scratchpad file. Use when user asks to "capture these pages as markdown", "save web content", "fetch and convert webpages", or needs clean markdown from HTML. All URLs from one prompt → single file at docs/web-captures/<timestamp>.md.

Docs Gen 4 8mo ago

airflow-dag-patterns

by EngineerWithAI

Build production Apache Airflow DAGs with best practices for operators, sensors, testing, and deployment. Use when creating data pipelines, orchestrating workflows, or scheduling batch jobs.

Automation 4 7mo ago

tapestry

by silvainfm

Unified content extraction and action planning. Use when user says "tapestry <URL>", "weave <URL>", "help me plan <URL>", "extract and plan <URL>", "make this actionable <URL>", or similar phrases indicating they want to extract content and create an action plan. Automatically detects content type (YouTube video, article, PDF) and processes accordingly.

Automation 2 9mo ago