Top AI Demos #24: Deterministic agent pipelines, code leaks and agent memory

Joe Heitzeberg • Founder at AI Tinkerers • ⏱️ 1 min read

Creating space for leading builders to share ideas, grow, and make an impact.

This week, builders are diving deep into agent orchestration, developer tooling, and system memory. We saw Kerry Ritter’s Harness Engineering tackle deterministic agent pipelines, while Jeff Rhatigan’s Wisdom Layer introduced an architecture for persistent memory and reflection.

In developer tooling, Daniel Motles shared insights from Claude Code: Source Leak Analysis, and Matt Walters presented GitKB: Distributed Knowledge Graph Protocol for collaborative coding.

We also saw systems enhance continuity, like Thang Chung’s MCP: Adaptive Tool Orchestration and Jason Gardner’s hiveWiki: Collaborative Agent Wiki.

Top 5 Picks (April 27)

1 TOP PICK

Claude Code: Source Leak Analysis

Daniel Motles

Manager, AI Acceleration at Qumulo

📍 AI Tinkerers - Seattle • Apr 13

Cooking

TECH STACK

Loading tech tags...

PROJECT LINKS

slideshare.net

@dmotles

2 RUNNER UP

Wisdom Layer: Agent Cognitive Architecture

Jeff Rhatigan

Cognitive Architect at Rhatigan Labs

📍 AI Tinkerers - Manchester New Hampshire • Apr 15

Cooking

TECH STACK

Loading tech tags...

PROJECT LINKS

wisdomlayer.ai

github.com

3 COMMUNITY FAVORITE

GitKB: Distributed Knowledge Graph Protocol

Matt Walters

Founder at GitKB

📍 AI Tinkerers - Austin • Apr 16

Cooking

TECH STACK

Loading tech tags...

PROJECT LINKS

gitkb.com

@iammattwalters

4 STANDOUT

Harness Engineering: Deterministic Agent Pipelines

Kerry Ritter

Founder/CTO at Zipper

📍 AI Tinkerers - St. Louis • Apr 14

Cooking

TECH STACK

Loading tech tags...

PROJECT LINKS

5 NOTABLE

Parallel AI Agent Development

Jamilton Alonso Quintero Osorio

CTO/CPO at SaleAds

📍 AI Tinkerers - Medellín • Apr 16

Cooking

TECH STACK

Loading tech tags...

PROJECT LINKS

saleads.ai

More Great Builds

Quick hits from the community — demos worth bookmarking:

hiveWiki: Collaborative Agent Wiki

Jason Gardner • AI Tinkerers - Upstate NY • Apr 14

Jason Gardner presented hiveWiki.ai, a wiki-based collaboration layer where independent agents, Claude Chat, a Cursor Agent, and a human share the same usable “wiki artifact” for transparent coordination via MCP. The demo showed how agents could draft, critique, and deploy a feature while keeping the context consistent and reviewable. For anyone building toward more autonomous workflows without the brittleness, this felt like a practical path to agent memory and human-in-the-loop clarity people kept coming back to.

Loading tech tags...

hivewiki.ai

HausHavn: Local AI Dev Team

Justin Bergeron • AI Tinkerers - Chicago • Apr 14

Justin Bergeron demoed his Mac Mini AI setup that worked like a scrappy local “dev team” for HausHavn, splitting responsibilities across architecture, coding, QA, and PM, then orchestrating handoffs into a coherent software workflow. The project emphasizes role design over one-shot prompting, using local agent coordination and practical automation to keep tasks moving. It stood out because it tackled the messy reality of making agents useful as process infrastructure, not magic, and (people kept saying this felt immediately adaptable). We liked how it points toward agentic workflows that can scale cost-effectively on modest hardware.

Loading tech tags...

haushavn.com

paperclip.ing

Video

IBM Granite: Visualizing MoE Routing

Bob Prendergast • AI Tinkerers - Manchester New Hampshire • Apr 15

Bob Prendergast showcased a real-time MoE neural telemetry engine that visualizes Mixture of Experts routing decisions from IBM Granite 3.0. The demo used a PyTorch forward hook to capture 40-dimensional routing weights from Layer 20 and render a live laptop heatmap of expert activation. After that, the model reflected on what it was doing by interpreting its own routing pattern, and people seemed to really enjoy that loop. For builders chasing efficient, local inference and agentic introspection, it felt like a practical blueprint for turning sparse activation into something you can actually debug, demo, and productize.

Loading tech tags...

Multi-Agent Sales Research Engine

Alula Zeryihun • AI Tinkerers - Houston • Apr 14

Alula Zeryihun presented a multi-agent sales engine that kills guesswork in outbound by running seven specialized agents before any outreach happens: identity, profile, activity, market, network, scoring, then synthesis. The workflow grounds decisions in live web data, passing structured JSON contracts between stages to keep messaging context-aware while controlling cost and latency. It felt especially relevant to builders because it turned a “single-shot prompt” into a decision pipeline, and people seemed to love the clarity of the orchestration. The demo also fits the current shift toward more agentic, local-first automation for regulated, real-world work.

Loading tech tags...

zuluengine.run

Boomerang: Agentic Browser Infrastructure

Henry George • AI Tinkerers - Cologne • Apr 16

Henry George from Super44 presented Boomerang, a custom agentic browser infra built as a replacement for tools like browserbase and Firecrawl, focused on reliable web automation. The system handles stealth-style browsing and orchestrates agent web flows while working through messy realities like Cloudflare friction, so the agents fail less and recover better. Given his hands-on serverless, event-driven background, it felt like an infrastructure case study you can actually reuse. We liked it because builders (people seemed to) want practical guidance on agentic web usage, right when high-speed local inference and long-horizon autonomy make this layer increasingly necessary.

Loading tech tags...

Anthropic Ambassador Chile

Felipe Pacheco • AI Tinkerers - Santiago • Apr 16

Felipe Pacheco from Poweredia presented Anthropic Ambassador Chile, a local-first push to help Latin America move from AI consumers to AI builders. The demo brought together Claude community leadership, Bendita IA’s hands-on builder ecosystem, and his Agent OS approach to orchestrating intelligent agents with clear organization-level governance. It tied fintech experimentation to practical tooling, and it landed well with the community, with feedback hinting that people wanted more “how to build” paths. We liked it because it turns agentic ambition into repeatable developer momentum.

Loading tech tags...

benditaia.cl

poweredia.com

Claude Code: Reverse Engineering Plugins

J.D Nicholls • AI Tinkerers - Medellín • Apr 16

J.D Nicholls spotlighted “Ingeniería Inversa de Plugins de Claude Code” and showed how the repo notebooklm-ai-plugin turns NotebookLM into a terminal-first workflow by reverse engineering the free RPC path using browser cookies plus Chrome CDP. He implemented the plugin in TypeScript and Bun, packaging multiple artifact types with a single .claude-plugin manifest for easy distribution. We liked how people leaned in to try it immediately (people loved it). It fit the broader agentic tools trend: practical orchestration you can actually run, adapt, and ship.

Loading tech tags...

github.com

MCP: Adaptive Tool Orchestration

Thang Chung • AI Tinkerers - Ho Chi Minh City • Apr 18

Thang Chung from Tiger Tribe presented a demo on adaptive tool orchestration with on-demand discovery and code mode execution. The project runs an MCP-based transform that does contextual tool resolution, then generates controlled C# workflows using the GitHub Copilot C#/.NET SDK and the MCP .NET SDK to chain calls with less prompt bloat. We liked how it shifted “pick from everything” into “query, narrow scope, then execute safely,” and the audience feedback quietly leaned positive. It also points toward a more product-ready, enterprise-friendly agent pattern where tool catalogs stay lean while orchestration stays deterministic.

Loading tech tags...

github.com

Autonomous Background Coding Agents

Alain Krok • AI Tinkerers - Houston • Apr 14

Alain Krok from AWS shared an autonomous background coding agent demo that runs long horizon tasks without constant supervision. The agent kept working in the background, managing state and task progression while tackling multi-step coding changes like an “always on” engineer. Given his systems plus full-stack experience in C++ and cloud tooling, he focused on the real engineering friction: reliability, focus, and safe continuation across long runs. People also seemed to like how pragmatic it felt for builders staring at codebase-scale work, and it pointed to the next shift toward dependable agentic execution.

Loading tech tags...

github.com

Pista Inteligente: Horse Race Ranking

Juan Campos Gomez • AI Tinkerers - Santiago • Apr 16

Juan Campos Gomez presentó Pista Inteligente: un pipeline end-to-end que convierte PDFs y texto de carreras en CSV con Claude como ETL estructurado, luego entrena un LightGBM LambdaRank con Walk-Forward Validation para ordenar por probabilidad de victoria y calibra con isotonic regression para que el score sea interpretable. Todo corrió en Python con mlflow para tracking de experimentos. Lo que gustó (people loved it) fue lo reutilizable de tres ideas: prompts con header fijo para colapsar ambigüedad, ranking vs regresión y calibración sin reentrenar. Además, el caso real para datos hípicos en Chile lo vuelve cercano y con potencial de producto tipo “predicciones con explicación” para apuestas o gestión.

Loading tech tags...

github.com

pista-inteligente.vercel.app

🏆 Hackathon Spotlight

Recent AI Tinkerers Hackathon Winners

🗓️ Hackathon: Benchmarking Small Language Models in the Real World
Paris • Apr 18, 2026

🥇 1st Darkwood

Team Darkwood engineered a high-performance RAG pipeline that enables the Qwen2.5-Coder-3B model to generate production-grade Polars code by combining dynamic few-shot retrieval with a specialized rule-based prompting strategy for strict syntax reliability. This collaborative trio features Mirza Marotsaha, CEO of Latro.Link, alongside fullstack developer Mathieu Ledru and aspiring engineer Victor-eliejah Garnier.

🏅 Winner bluebull

Bluebull optimized Qwen2.5-Coder-7B with 4-bit quantization and a clever self-repair loop to achieve a perfect 16/16 correctness score on Polars code generation within a strict 4GB VRAM footprint. Led by Vasiliki Doropoulou, a front-end developer at Ublo specializing in AI-driven web applications.

🏅 Winner Muon

Muon engineered a high-efficiency FastAPI pipeline that achieves 70% accuracy in generating executable Polars code by combining a 4B-parameter model with a clever "Pseudo-RAG" routing system to inject targeted API snippets. Led by Imane Momayiz, a freelance ML engineer with four years of experience who built and benchmarked the entire production-ready system as a solo hacker.

🎬 Latest Content

How to Ship Complex Features 10x Faster with AI Agents | Dex Horthy (HumanLayer)

One-Shot • Mar 04

Dex Horthy (HumanLayer) breaks down the “12 Factor Agents” approach to shipping multi-step agentic workflows faster: structured outputs, ...

Watch Now →

How to Run Open-Source LLMs Locally on a Mac with MLX-LM

Deep Dive Series • Jun 12

Run open-source LLMs locally on Apple Silicon with Apple’s MLX-LM: `pip install mlx-lm`, then `load()` a Hugging Face model and call `gen...

💼 Top Job Matches

Matched based on your meetup activity and profile

Founding Applied AI Lead

Paxos Health • New York & Toronto • $110k - $175k (varies w/ location/level); generous equity

Stanford-founded Seed-stage healthcare AI startup with >$5M in VC funding and AI agents deployed in production with cu...

Apply Now →

AI Engineer: Build Frontier Agents Reinventing Financial Modelling

Dex • London (5 days on-site) • £250,000

Frontier AI engineering role building the AI tooling layer for complex financial modelling.

Apply Now →

Software Engineer (All Levels) — Applied AI

Jakib AI • Columbus, OH

Jakib is a profitable, growing applied AI firm embedded with operator-led companies in logistics, manufacturing, and c...

Apply Now →

View All Jobs Post a Job

You are one of 95,000+ readers from Apple, Amazon, Microsoft, Google, Nvidia, OpenAI, Anthropic, Cohere, ElevenLabs, Scale AI, Groq, Mistral AI, and others — spanning frontier labs, big tech, startups, and top universities.

Top AI Demos #24: Deterministic agent pipelines, code leaks and agent memory