content-moderation-api

Content moderation API integration using OpenAI Moderation, Perspective API, and others

a5c-ai 1,569 92 Updated 4mo ago

Install

npx skillscat add a5c-ai/babysitter/content-moderation-api

Install via the SkillsCat registry.

SKILL.md

Content Moderation API Skill

Capabilities

Integrate OpenAI Moderation API
Set up Perspective API for toxicity detection
Configure moderation thresholds
Implement content filtering pipelines
Design moderation response handling
Create moderation logging and reporting

Target Processes

content-moderation-safety
system-prompt-guardrails

Implementation Details

Moderation APIs

OpenAI Moderation: Hate, violence, self-harm, sexual content
Perspective API: Toxicity, insult, profanity, threat
Azure Content Safety: Text and image moderation
LlamaGuard: Open-source safety classifier

Configuration Options

API credentials and endpoints
Category thresholds
Action policies (block, warn, flag)
Logging configuration
Fallback behavior

Best Practices

Set appropriate thresholds
Handle edge cases gracefully
Log moderation decisions
Regular threshold review
Multi-layer moderation

Dependencies

openai
google-cloud-language (Perspective)
azure-ai-contentsafety