SkillsCat
SkillsCat
Trending
Docs
TongmingLAIC

TongmingLAIC

@TongmingLAIC Organization

GitHub
1 Skills
220 Total Stars
June 2026 Joined

Public Skills

ako4all

by TongmingLAIC

Drive an agentic loop that iteratively optimizes a GPU kernel for maximum speedup. Use this skill whenever the user wants to optimize / speed up / benchmark a GPU kernel (CUDA, Triton, TileLang, C++, Python), mentions AKO / AKO4ALL / AKO4X / agentic kernel optimization, asks to "make this kernel faster", or has a kernel they want measured against a PyTorch reference. The skill handles setup, profiling (ncu), correctness checking, iteration logging, and git commits. Bootstraps a workspace in any directory the user points at.

CLI Tools 220 3d ago
SkillsCat
SkillsCat

An open platform for discovering, sharing, and installing AI agent skills.

Discover

  • Trending
  • Recently Added
  • Top Rated
  • Categories

Resources

  • Docs
  • Claude Code Docs
  • Claude Code GitHub
  • Cursor

Legal

  • Privacy Policy
  • Terms of Service

© 2026 SkillsCat. Open source under AGPL-3.0.

Docs · Privacy · Terms