skill-up: command‑line framework that benchmarks LLM agent learning curves and tool expansion
skill-up is a command‑line evaluation framework that quantifies the learning curve of large‑language‑model agents, tracking adaptation speed, error correction, and toolbox growth. It targets AI developers who need reproducible, comparable benchmarks for autonomous agents.