Skip to content
@evalops

EvalOps

EvalOps is an AI testing and monitoring platform that helps engineering teams ship reliable AI features with confidence.

Popular repositories Loading

  1. cognitive-dissonance-dspy cognitive-dissonance-dspy Public

    A multi-agent LLM system for detecting and resolving cognitive dissonance.

    Python 276 22

  2. dspy-0to1-guide dspy-0to1-guide Public

    A comprehensive 0-to-1 guide for building self-improving LLM applications with DSPy framework

    Python 211 15

  3. deep-code-reasoning-mcp deep-code-reasoning-mcp Public

    A Model Context Protocol (MCP) server that provides advanced code analysis and reasoning capabilities powered by Google's Gemini AI

    TypeScript 105 13

  4. dspy-micro-agent dspy-micro-agent Public

    Minimal agent runtime built with DSPy modules and a thin Python loop. Includes CLI, FastAPI server, and eval harness with OpenAI/Ollama support.

    Python 71 6

  5. dspy-advanced-prompting dspy-advanced-prompting Public

    State-of-the-art prompting techniques implementation with DSpy - Manager-style prompts, role personas, meta-prompting, and more

    Python 53 3

  6. orbit-agent orbit-agent Public

    A brutally honest "high‑orbit" startup advisor you can text or run from the CLI. Built with DSPy, it provides opinionated, YC-style advice and financial tools for founders.

    Python 20

Repositories

Showing 10 of 28 repositories
  • service-runtime Public

    Shared Go runtime helpers for EvalOps services

    evalops/service-runtime’s past year of commit activity
    Go 0 0 13 0 Updated Apr 14, 2026
  • proto Public

    Canonical protobuf definitions for EvalOps cross-service contracts — 14 packages covering identity, metering, governance, approvals, entities, memory, prompts, skills, events, and more

    evalops/proto’s past year of commit activity
    TypeScript 0 0 1 0 Updated Apr 14, 2026
  • agent-mcp Public

    Unified MCP server for external agent integration — single config line gives any MCP-capable agent identity, governance, approvals, metering, and operating rules

    evalops/agent-mcp’s past year of commit activity
    Go 1 0 1 0 Updated Apr 14, 2026
  • lark Public

    Native macOS desktop client for Claude computer use - a floating pill UI that lets Claude control your computer through natural language

    evalops/lark’s past year of commit activity
    TypeScript 0 MIT 0 0 0 Updated Apr 14, 2026
  • ensemble-tap Public

    Ingest SaaS webhooks, polls, and CDC into NATS JetStream + ClickHouse — gives Ensemble continuous awareness of customer business systems

    evalops/ensemble-tap’s past year of commit activity
    Go 0 0 2 1 Updated Apr 14, 2026
  • mocktopus Public

    🐙 Multi-armed mocks for LLM apps - Drop-in replacement for OpenAI/Anthropic APIs for deterministic testing

    evalops/mocktopus’s past year of commit activity
    Python 6 MIT 0 1 0 Updated Apr 14, 2026
  • asb Public

    Agents-first secret broker control plane in Go

    evalops/asb’s past year of commit activity
    Go 0 Apache-2.0 0 22 0 Updated Apr 14, 2026
  • mcp-firewall Public

    A small MCP (Model Context Protocol) firewall that proxies JSON-RPC and enforces allow/deny policies for tools, resources, prompts, and methods

    evalops/mcp-firewall’s past year of commit activity
    Go 0 MIT 0 0 0 Updated Apr 14, 2026
  • mcp-openapi Public

    OpenAPI 3.x to MCP server bridge in TypeScript with stdio, StreamableHTTP, and SSE transports

    evalops/mcp-openapi’s past year of commit activity
    TypeScript 2 MIT 0 0 0 Updated Apr 14, 2026
  • shared-memory-mcp Public

    Shared Memory MCP server for agentic teams - solving coordination tax with 6x token efficiency

    evalops/shared-memory-mcp’s past year of commit activity
    TypeScript 7 MIT 1 0 0 Updated Apr 14, 2026

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…