ToolSmith

Test, lint, and report on AI agent tool use before shipping.

View the Project on GitHub landon-personal/toolsmith

ToolSmith v1.0.0 Release Notes

ToolSmith v1.0.0 is the stable public local CLI baseline for testing and linting AI agent tool use.

Before shipping your AI agent, test whether it knows how to use its tools.

Summary

ToolSmith helps developers define tool files, define task files, run local evals, score tool selection, inspect failure categories, lint confusing definitions, generate reports, compare runs, and import basic OpenAPI specs into ToolSmith tool definitions.

ToolSmith remains local-first and mock-agent-based in v1.0.0.

Major Features

Stable Commands

toolsmith --help
toolsmith --version
toolsmith init
toolsmith lint <path>
toolsmith eval <path>
toolsmith report
toolsmith compare <baseline-run> <current-run>
toolsmith import openapi <path> --out <path>

Local development usage remains:

npm run dev -- <command>

Safety Model

ToolSmith v1.0.0 does not:

Real model providers and real side-effect integrations are future optional work only.

Known Limitations

Publishing Status

v1.0.0 preparation does not publish to npm, push to GitHub, create release tags, or enable GitHub Pages deployment.

Future npm publishing requires explicit approval and package name availability checks.

Next Planned Version

v1.1.0 is planned as optional real model provider work. It should remain opt-in and must not add real tool side effects by default.