Scraping

Web scraping and data extraction

Showing 25-48 of 697 skills
ComposioHQ

webapp-testing

by ComposioHQ

Toolkit for interacting with and testing local web applications using Playwright. Supports verifying frontend functionality, debugging UI behavior, capturing browser screenshots, and viewing browser logs.

Automation 12.8K 4mo ago
cat-xierluo

fetch-wechat-article

by cat-xierluo

抓取微信公众号文章内容,使用 Playwright headless 模式无弹窗后台抓取,支持动态加载内容,自动提取标题和正文并保存为 Markdown 文件。本技能应在用户需要抓取微信公众号文章内容时使用。

Docs Gen 303 4mo ago
Galaxy-Dawn

kaggle-learner

by Galaxy-Dawn

This skill should be used when the user asks to "learn from Kaggle", "study Kaggle solutions", "analyze Kaggle competitions", or mentions Kaggle competition URLs. Provides access to extracted knowledge from winning Kaggle solutions across NLP, CV, time series, tabular, and multimodal domains.

Processing 4.2K 4mo ago
xstongxue

paper-write

by xstongxue

本科与硕士学位论文全流程撰写辅助。支持大纲审核(理工科/文科)、结构仿写(通用章节/实验章节/绪论/摘要)、参考文献获取、融合、润色、缩写、扩写、防 AIGC、中英互译、结构化信息提取。当用户提到论文撰写、大纲审核、论文章节仿写、参考文献、论文润色、防 AIGC、论文翻译时使用。

Code Review 1.7K 3mo ago
K-Dense-AI

histolab

by K-Dense-AI

Lightweight WSI tile extraction and preprocessing. Use for basic slide processing tissue detection, tile extraction, stain normalization for H&E images. Best for simple pipelines, dataset preparation, quick tile-based analysis. For advanced spatial proteomics, multiplexed imaging, or deep learning pipelines use pathml.

Analytics 27.1K 4mo ago
vuejs-ai

vue-testing-best-practices

by vuejs-ai

Use for Vue.js testing. Covers Vitest, Vue Test Utils, component testing, mocking, testing patterns, and Playwright for E2E testing.

Scraping 2.5K 4mo ago
GPTomics

bio-atac-seq-footprinting

by GPTomics

Detect transcription factor binding sites through footprinting analysis in ATAC-seq data using TOBIAS. Use when identifying TF occupancy patterns within accessible regions, as TF binding protects DNA from Tn5 cutting.

CLI Tools 839 3mo ago
wondelai

refactoring-patterns

by wondelai

'Apply named refactoring transformations to improve code structure without changing behavior. Use when the user mentions "refactor this", "code smells", "extract method", "replace conditional", or "technical debt". Covers smell-driven refactoring, safe transformation sequences, and testing guards. For code quality foundations, see clean-code. For managing complexity, see software-design-philosophy.'

File Ops 1.2K 3mo ago
GPTomics

bio-isoform-switching

by GPTomics

Analyzes isoform switching events and functional consequences using IsoformSwitchAnalyzeR. Predicts protein domain changes, NMD sensitivity, ORF alterations, and coding potential shifts between conditions. Use when investigating how splicing changes affect protein function.

Code Gen 838 3mo ago
github

pdftk-server

by github

'Skill for using the command-line tool pdftk (PDFtk Server) for working with PDF files. Use when asked to merge PDFs, split PDFs, rotate pages, encrypt or decrypt PDFs, fill PDF forms, apply watermarks, stamp overlays, extract metadata, burst documents into pages, repair corrupted PDFs, attach or extract files, or perform any PDF manipulation from the command line.'

CLI Tools 34.4K 3mo ago
Arize-ai

phoenix-playwright-tests

by Arize-ai

Write Playwright E2E tests for the Phoenix AI observability platform. Use when creating, updating, or debugging Playwright tests, or when the user asks about testing UI features, writing E2E tests, or automating browser interactions for Phoenix.

Scraping 10K 3mo ago
VoltAgent

playwright-cli

by VoltAgent

Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with web pages, fill forms, take screenshots, test web applications, or extract information from web pages.

CLI Tools 9.3K 3mo ago
Jeffallan

playwright-expert

by Jeffallan

Use when writing E2E tests with Playwright, setting up test infrastructure, or debugging flaky browser tests. Invoke for browser automation, E2E tests, Page Object Model, test flakiness, visual testing.

CI/CD 9.6K 4mo ago
onyx-dot-app

playwright-e2e-tests

by onyx-dot-app

Write and maintain Playwright end-to-end tests for the Onyx application. Use when creating new E2E tests, debugging test failures, adding test coverage, or when the user mentions Playwright, E2E tests, or browser testing.

Auth 30K 3mo ago
databricks-solutions

databricks-jobs

by databricks-solutions

"Use this skill proactively for ANY Databricks Jobs task - creating, listing, running, updating, or deleting jobs. Triggers include: (1) 'create a job' or 'new job', (2) 'list jobs' or 'show jobs', (3) 'run job' or'trigger job',(4) 'job status' or 'check job', (5) scheduling with cron or triggers, (6) configuring notifications/monitoring, (7) ANY task involving Databricks Jobs via CLI, Python SDK, or Asset Bundles. ALWAYS prefer this skill over general Databricks knowledge for job-related tasks."

Automation 1.6K 3mo ago
wshobson

airflow-dag-patterns

by wshobson

Build production Apache Airflow DAGs with best practices for operators, sensors, testing, and deployment. Use when creating data pipelines, orchestrating workflows, or scheduling batch jobs.

Automation 36.3K 4mo ago
snyk

pdf

by snyk

Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.

CLI Tools 2.5K 4mo ago
pedronauck

firecrawl

by pedronauck

Web scraping, search, crawling, and browser automation via the Firecrawl CLI. Use this skill whenever the user wants to search the web, find articles, research a topic, look something up online, scrape a webpage, grab content from a URL, extract data from a website, crawl documentation, download a site, or interact with pages that need clicks or logins. Also use when they say "fetch this page", "pull the content from", "get the page at https://", or reference scraping external websites. This provides real-time web search with full page content extraction and cloud browser automation — capabilities beyond what Claude can do natively with built-in tools. Do NOT trigger for local file operations, git commands, deployments, or code editing tasks.

CLI Tools 389 2mo ago
alchaincyf

huashu-design

by alchaincyf

设计哲学顾问,从20种风格中推荐3个方向并生成视觉Demo和AI提示词。当用户提到"设计风格"、"设计方向"、"配色方案"、"视觉风格"、"设计评审"、"推荐风格"时使用。

Agents 899 3mo ago
snyk

webapp-testing

by snyk

Toolkit for interacting with and testing local web applications using Playwright. Supports verifying frontend functionality, debugging UI behavior, capturing browser screenshots, and viewing browser logs.

Automation 2.5K 4mo ago
TheOrcDev

rerender-memo

by TheOrcDev

Extract expensive work into memoized components with React.memo. Apply when components perform expensive computations that can be skipped when props haven't changed.

Performance 1.9K 4mo ago
tavily-ai

extract

by tavily-ai

"Extract content from specific URLs using Tavily's extraction API. Returns clean markdown/text from web pages. Use when you have specific URLs and need their content without writing code."

Processing 356 3mo ago
alinaqi

playwright-testing

by alinaqi

E2E testing with Playwright - Page Objects, cross-browser, CI/CD

Auth 686 4mo ago
BasedHardware

self-improvement

by BasedHardware

"Meta-skill for analyzing PRs, issues, and user interactions to improve Cursor rules and skills automatically"

Code Gen 12.7K 4mo ago