llm-evaluation

by wshobson

29985

Updated 3/3/2026

Implement comprehensive evaluation strategies for LLM applications using automated metrics, human feedback, and benchmarking. Use when testing LLM performance, measuring AI application quality, or establishing evaluation frameworks.

Loading files...

Get Skill

View on GitHub

Related Skills

shellcheck-configuration

wshobson

bats-testing-patterns

wshobson

bash-defensive-patterns

wshobson