Search Skills | Shipables

4 results for “evals”

v0.1.0

Evaluate, score, and systematically improve prompts in the codebase. Identifies weak prompts, generates test cases, scores outputs, and proposes optimized versions. Use when the user says "improve this prompt", "why is the AI doing X", "eval my prompts", or "optimize the agent".

Agent Skills

0/wk

Ngeeslin/poker-strategy

v1.0.1

Texas Hold'em poker intelligence — hand evaluation, pot odds, position strategy, and live player profiling for AI agents

Agent Skills

0/wkUpdated 3 months ago

Ngeeslininanalytics

Trolleroof/autolab-mani-skill

v1.0.0

Guides agents through autonomous ManiSkill and VSLAM evaluation, tuning, verification, memory storage, and summary writing using the Autolab MCP server and repo tooling.

Agent Skills

0/wkUpdated 3 months ago

Trolleroofinrobotics

yashpatel5400/pde-design-optimizer

v1.0.0

Iteratively optimize thermal designs by solving 2D heat equations (Poisson PDE). Parameterize heat source placement and material conductivity, simulate temperature distributions, evaluate performance metrics, and propose improvements autonomously. Use when the user asks to design, optimize, or analyze heat sinks, thermal layouts, cooling systems, or any steady-state thermal problem on a 2D domain.

Agent Skills

0/wkUpdated 3 months ago

yashpatel5400