Scraping

Web scraping and data extraction

Showing 337-360 of 698 skills
AIDotNet

puppeteer

by AIDotNet

使用Puppeteer(Google)进行浏览器自动化和PDF生成。支持无头Chrome控制,用于网页爬虫、截图、PDF生成和自动化测试。

Scraping 80 4mo ago
AIDotNet

playwright

by AIDotNet

微软开发的跨浏览器自动化框架。使用单一API支持Chromium、Firefox和WebKit,用于测试、爬虫和自动化。

Scraping 80 4mo ago
wasintoh

Test Engineer Skill

by wasintoh

```

Debugging 79 5mo ago
michaelboeding

pdf

by michaelboeding

Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.

CLI Tools 13 4mo ago
martinholovsky

browser-automation

by martinholovsky

"Expert in browser automation using Chrome DevTools Protocol (CDP) and WebDriver. Specializes in secure web automation, testing, and scraping with proper credential handling, domain restrictions, and audit logging. HIGH-RISK skill due to web access and data handling."

Auth 38 6mo ago
Xueheng-Li

chat-history-summarizer

by Xueheng-Li

Extract and summarize Claude Code chat history into structured documentation. Use when the user asks to export, summarize, or document a conversation session, extract prompts and actions from chat logs, or create a record of what was accomplished in a session.

Auth 43 4mo ago
organvm-iv-taxis

pdf

by organvm-iv-taxis

Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.

CLI Tools 6 4mo ago
kernel

kernel-typescript-sdk

by kernel

Build browser automation scripts using the Kernel TypeScript SDK with Playwright, CDP, and remote browser management.

Code Gen 5 3mo ago
manutej

apache-airflow-orchestration

by manutej

Complete guide for Apache Airflow orchestration including DAGs, operators, sensors, XComs, task dependencies, dynamic workflows, and production deployment

Automation 57 7mo ago
YuniorGlez

e2e-testing-expert

by YuniorGlez

Senior End-to-End (E2E) Test Architect for 2026. Specialized in Playwright orchestration, visual regression testing, and high-performance CI/CD sharding. Expert in building resilient, auto-waiting test suites using the Page Object Model (POM), automated accessibility auditing (Axe-core), and deep-trace forensic debugging.

Accessibility 11 4mo ago
YuniorGlez

pdf-pro

by YuniorGlez

"Master of PDF engineering, specialized in AI-driven extraction, high-fidelity Generation (Puppeteer), and PDF 2.0 Security."

Scraping 11 4mo ago
joabgonzalez

e2e-testing

by joabgonzalez

"End-to-end testing patterns and best practices. Trigger: When writing or reviewing E2E tests for any layer."

CI/CD 6 3mo ago
joabgonzalez

playwright

by joabgonzalez

"Cross-browser E2E testing with Playwright. Trigger: When writing or running end-to-end tests with Playwright."

Auth 6 3mo ago
yoanbernabeu

supabase-extract-anon-key

by yoanbernabeu

Extract the Supabase anon/public API key from client-side code. This key is expected in client apps but important for RLS testing.

Processing 43 4mo ago
tinyfish-io

tinyfish

by tinyfish-io

Use TinyFish web agent to extract/scrape websites, extract data, and automate browser actions using natural language. Use when you need to extract/scrape data from websites, handle bot-protected sites, or automate web tasks.

API Dev 44 3mo ago
agentbay-ai

agentbay-monitor-skills

by agentbay-ai

舆情监控技能,最终产出舆情报告。当用户问「某事件/话题舆情如何」「舆论怎么样」「做舆情分析」「运行舆情分析」或按关键词/平台爬取并生成舆情报告时,使用本技能。约定:凡舆情相关意图即执行全流程(爬取→情感分析→生成报告)。爬取由本技能完成;情感分析由主 Agent 按提示词自主判断;报告由 generate_report 生成。

Agents 40 3mo ago
agentbay-ai

Web Scraper

by agentbay-ai

Linting 40 3mo ago
triggerdotdev

trigger-config

by triggerdotdev

Configure Trigger.dev projects with trigger.config.ts. Use when setting up build extensions for Prisma, Playwright, FFmpeg, Python, or customizing deployment settings.

Database 27 4mo ago
v1-io

complexity

by v1-io

Use when reducing cognitive complexity, flattening nested code, or simplifying functions. Triggers on "reduce complexity", "simplify", "too nested".

Code Review 5 4mo ago
bnadlerjr

using-playwright-cli

by bnadlerjr

Browser automation via playwright-cli (microsoft/playwright-cli), the AI-agent-focused CLI for controlling browsers through Bash commands. Covers element reference system, snapshot workflow, session management, cookies, storage, network interception, and content capture. Use when the user asks to automate a browser, scrape a webpage, fill a form, test a UI flow interactively, capture screenshots, manage cookies/storage, intercept network requests, or drive any browser interaction from the terminal. NOT for npx playwright test runner or codegen.

CLI Tools 5 3mo ago
v1-io

e2e-testing

by v1-io

Use when implementing E2E tests, debugging flaky tests, testing web applications with Playwright, or establishing E2E testing standards. Triggers on "e2e test", "end-to-end", "Playwright", "flaky test", "browser test".

Auth 5 4mo ago
ncklrs

remotion-best-practices

by ncklrs

Best practices for Remotion - Video creation in React

Animation 26 4mo ago
nonameplum

swift-async-stream-patterns

by nonameplum

Patterns and best practices for building robust AsyncStream and AsyncSequence types, learned from swift-async-algorithms.

Code Gen 13 3mo ago
run-llama

Extract structured data from unstructured files (PDF, PPTX, DOCX...)

by run-llama

Invoke this skill BEFORE implementing any structured data extraction from documents to learn the correct llama_cloud_services API usage. Required reading before writing extraction code. Requires llama_cloud_services package and LLAMA_CLOUD_API_KEY as an environment variable.

API Dev 176 7mo ago