Scraping

Web scraping and data extraction

Showing 505-528 of 697 skills
AConfusedBoi

ogie

by AConfusedBoi

Extract OpenGraph, Twitter Cards, and metadata from URLs or HTML. Use when building link previews, SEO tools, or scraping webpage metadata.

Processing 1 4mo ago
DUZ1287

web-auto-form

by DUZ1287

JSON 驱动的浏览器表单自动化工具,为 AI Agent 提供原生 function-calling 集成,支持表单填写、条件分支、数据提取与 PII 脱敏

Debugging 1 8d ago
tankpkg

@tank/figma-to-code

by tankpkg

Use when implementing UI from Figma designs with pixel-perfect fidelity. Covers the complete Figma-to-code pipeline: extracting design specs via Figma MCP tools (get_design_context, get_screenshot, get_variable_defs), translating every visual property to exact CSS (fonts, colors, spacing, shadows, gradients, border-radius, opacity), building design token systems, mapping Auto Layout to Flexbox/Grid, converting component variants to props, and verifying implementation against Figma screenshots. Requires: Figma desktop MCP (figma-desktop) for design extraction. Synthesizes Figma Dev Mode API (2024-2025), CSS Specifications (Color L4, Flexbox, Grid, Container Queries), WAI-ARIA Authoring Practices, Google Fonts Best Practices, and Playwright Visual Testing. Trigger phrases: "implement this Figma", "Figma to code", "pixel perfect", "match the design", "copy the Figma", "design implementation", "implement this design", "build from Figma", "Figma handoff", "translate design to code", "code this component from Figma", "design to HTML", "design to React", "design to Tailwind", "match Figma exactly", "implement UI from design", "Figma screenshot", "convert design", "make it look like Figma", "same as design", "design specs", "implement mockup", "from Figma file"

Accessibility 1 2mo ago
sumik5

developing-react

by sumik5

React 19.x development guide covering internals (rendering, reconciliation, Fiber), performance optimization (47+ react-doctor rules, memoization, bundle size), UI animation patterns (CSS transitions, easing, hover/touch), and React Testing Library (RTL queries, interactions, TDD patterns). Use when package.json contains 'react' (without 'next'), or when working on React-specific concerns in any framework. For Next.js-specific features (App Router, Server Components, Cache Components), use developing-nextjs instead. For E2E testing with Playwright, use testing-e2e-with-playwright. For general testing methodology, use testing-code.

Scraping 1 3mo ago
famaoai-creator

browser-navigator

by famaoai-creator

Automates browser actions using Playwright CLI. Can record, replay, and generate browser automation scenarios stored in the knowledge base. Useful for UI testing, data extraction, and visual auditing.

Code Gen 1 3mo ago
michelg10

PDF Processing

by michelg10

Comprehensive PDF manipulation toolkit for extracting text and tables,

CLI Tools 6 8mo ago
thoreinstein

refactor

by thoreinstein

Analyze code and suggest refactoring opportunities with blast radius assessment, risk evaluation, and recommended order of operations

Refactoring 3 4mo ago
1uokun

xiaohongshu-skill

by 1uokun

小红书内容发布技能,提供检查登录状态和发布图文内容的功能。不依赖MCP,使用内置JavaScript脚本执行小红书相关操作。

API Dev 3 4mo ago
icartsh

webapp-testing

by icartsh

Playwright를 사용하여 로컬 웹 애플리케이션과 상호작용하고 테스트하기 위한 툴킷입니다. 프런트엔드 기능 검증, UI 동작 디버깅, 브라우저 스크린샷 캡처 및 브라우저 로그 확인을 지원합니다.

CI/CD 3 5mo ago
tapania

5d-reflect

by tapania

"Extract learning from completed work to improve future cycles. Use when: (1) After VERIFY phase shows feature complete, (2) End of development iteration, (3) User asks for 'retrospective,' 'lessons learned,' or 'what went well/wrong,' (4) Before starting next major feature. This phase prevents repeating mistakes and compounds learning."

Code Gen 3 4mo ago
horuz-ai

pdf

by horuz-ai

Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.

CLI Tools 3 5mo ago
C0ntr0lledCha0s

playwright-testing

by C0ntr0lledCha0s

Automatically activated when user works with Playwright tests, mentions Playwright configuration, asks about selectors/locators/page objects, or has files matching *.spec.ts in e2e or tests directories. Provides Playwright-specific expertise for E2E and integration testing.

Auth 3 6mo ago
spitoglou

research-intelligence

by spitoglou

Extract insights, analyze claims, and synthesize knowledge from research content. Use when processing academic papers, articles, podcasts, videos, meeting transcripts, or any content where the goal is to extract wisdom, analyze arguments, summarize findings, or compile references. Triggers include "analyze this paper", "extract key insights", "summarize the research", "what are the main claims", "extract wisdom from", "compile references", "critique this argument".

Academic 3 4mo ago
bahayonghang

pdf-analysis

by bahayonghang

This skill should be used when the user asks to "解析PDF", "解读文档", "分析PDF文件", "PDF解读", "extract content from PDF", "analyze PDF document", "parse academic paper", or provides a PDF file path for content extraction and analysis. Provides comprehensive PDF document analysis and content extraction capabilities for WeChat content creation.

Academic 3 5mo ago
dy9759

PDF Processing

by dy9759

Extract text and tables from PDF files, fill forms, merge documents. Use when working with PDF files or when the user mentions PDFs, forms, or document extraction.

Processing 5 6mo ago
jakerains

skill-seekers

by jakerains

Convert documentation websites, GitHub repositories, and PDFs into Claude AI skills. Use when creating Claude skills from docs, scraping documentation, packaging websites into skills, or converting repos/PDFs to Claude knowledge.

Prompts 1 4mo ago
manastalukdar

e2e-generate

by manastalukdar

Generate end-to-end tests with Playwright browser automation

Code Gen 1 4mo ago
rarestg

cf-browser

by rarestg

Browse and scrape websites using Cloudflare's Browser Rendering REST API. Use when the agent needs to fetch rendered web content, extract structured data from pages, take screenshots, or scrape specific elements via CSS selectors. Triggers on tasks like "scrape this site", "get listings from this page", "extract data from this URL", "take a screenshot of this page", "browse this website", or any task requiring headless browser access to read, crawl, or extract information from live web pages. Also use when WebFetch is insufficient (JS-heavy sites, SPAs, pages requiring cookies, or when structured extraction is needed).

API Dev 1 3mo ago
DobroslavRadosavljevic

ogie

by DobroslavRadosavljevic

Extract OpenGraph, Twitter Cards, and metadata from URLs or HTML. Use when building link previews, SEO tools, or scraping webpage metadata.

Processing 1 4mo ago
otrebu

web-to-markdown

by otrebu

Batch-process web pages via headless Playwright browser, extract HTML, convert to markdown using Turndown, and save to timestamped scratchpad file. Use when user asks to "capture these pages as markdown", "save web content", "fetch and convert webpages", or needs clean markdown from HTML. All URLs from one prompt → single file at docs/web-captures/<timestamp>.md.

Docs Gen 4 7mo ago
QuestForTech-Investments

pdf

by QuestForTech-Investments

Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.

CLI Tools 4 6mo ago
QuestForTech-Investments

Playwright Browser Automation

by QuestForTech-Investments

Complete browser automation with Playwright. Auto-detects dev servers, writes clean test scripts to /tmp. Test pages, fill forms, take screenshots, check responsive design, validate UX, test login flows, check links, automate any browser task. Use when user wants to test websites, automate browser interactions, validate web functionality, or perform any browser-based testing.

Automation 4 6mo ago
silvainfm

tapestry

by silvainfm

Unified content extraction and action planning. Use when user says "tapestry <URL>", "weave <URL>", "help me plan <URL>", "extract and plan <URL>", "make this actionable <URL>", or similar phrases indicating they want to extract content and create an action plan. Automatically detects content type (YouTube video, article, PDF) and processes accordingly.

Automation 2 7mo ago
fast-gateway-protocol

ffmpeg-daemon

by fast-gateway-protocol

Fast video/audio processing via FGP daemon - 5-20x faster than spawning ffmpeg per operation. Use when user needs to convert videos, extract audio, trim clips, resize, add watermarks, or transcode. Triggers on "convert video", "extract audio", "trim video", "compress video", "ffmpeg", "video editing", "transcode".

CLI Tools 2 4mo ago