Skip to content

Tools: add AI Eval Forge (Core) and Agent Stack (Agent Harnesses)#11

Open
MukundaKatta wants to merge 1 commit intoVvkmnn:mainfrom
MukundaKatta:add-ai-eval-forge-and-agent-stack
Open

Tools: add AI Eval Forge (Core) and Agent Stack (Agent Harnesses)#11
MukundaKatta wants to merge 1 commit intoVvkmnn:mainfrom
MukundaKatta:add-ai-eval-forge-and-agent-stack

Conversation

@MukundaKatta
Copy link
Copy Markdown

Two additions:

Core Frameworks → AI Eval Forge — mixed-check regression testing for LLM and agent workflows. Combines deterministic checks, rubric-style review, and artifact tracking. Backed by a preprint on Zenodo (10.5281/zenodo.20044318) and SSRN.

Application and Agent Harnesses → Agent Stack — five small zero-dep npm libraries that solve concrete reliability gaps for tool-using agents (agentvet, agentguard, agentsnap, agentfit, agentcast). Inserted alphabetically between Agentrial and Athina.

Both projects are MIT, npm-published under @mukundakatta, also live on the official MCP Registry.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant