agent-vision
by rvanbaalen
"Use agent-vision to see and interact with the user's screen. Gives you eyes (screenshots) and hands (mouse, keyboard, element clicking) on any macOS window. Use this skill whenever you need to visually inspect a UI, take screenshots for feedback loops, click buttons, fill forms, navigate applications, or do anything that requires seeing what's on screen. Also use when the user mentions agent-vision, visual feedback, screen capture for development, or UI interaction."