Test, lint, and report on AI agent tool use before shipping.
ToolSmith v1.0.0 is the stable public local CLI baseline for testing and linting AI agent tool use.
Before shipping your AI agent, test whether it knows how to use its tools.
ToolSmith helps developers define tool files, define task files, run local evals, score tool selection, inspect failure categories, lint confusing definitions, generate reports, compare runs, and import basic OpenAPI specs into ToolSmith tool definitions.
ToolSmith remains local-first and mock-agent-based in v1.0.0.
--fail-under and saved-run comparison.toolsmith --help
toolsmith --version
toolsmith init
toolsmith lint <path>
toolsmith eval <path>
toolsmith report
toolsmith compare <baseline-run> <current-run>
toolsmith import openapi <path> --out <path>
Local development usage remains:
npm run dev -- <command>
ToolSmith v1.0.0 does not:
Real model providers and real side-effect integrations are future optional work only.
tools.json and tasks.json do not yet have separately published JSON Schema files.v1.0.0 preparation does not publish to npm, push to GitHub, create release tags, or enable GitHub Pages deployment.
Future npm publishing requires explicit approval and package name availability checks.
v1.1.0 is planned as optional real model provider work. It should remain opt-in and must not add real tool side effects by default.