Top AI Demos #24: Deterministic agent pipelines, code leaks and agent memory [AI Tinkerers - Post-Training] .

Top AI Demos #24: Deterministic agent pipelines, code leaks and agent memory

AI Tinkerers

Top AI Demos #24: Deterministic agent pipelines, code leaks and agent memory

Issue #24 · Week of April 27

Joe Heitzeberg
Joe Heitzeberg • Founder at AI Tinkerers • ⏱️ 1 min read
Creating space for leading builders to share ideas, grow, and make an impact.

This week, builders are diving deep into agent orchestration, developer tooling, and system memory. We saw Kerry Ritter’s Harness Engineering tackle deterministic agent pipelines, while Jeff Rhatigan’s Wisdom Layer introduced an architecture for persistent memory and reflection.

In developer tooling, Daniel Motles shared insights from Claude Code: Source Leak Analysis, and Matt Walters presented GitKB: Distributed Knowledge Graph Protocol for collaborative coding.

We also saw systems enhance continuity, like Thang Chung’s MCP: Adaptive Tool Orchestration and Jason Gardner’s hiveWiki: Collaborative Agent Wiki.

Top 5 Picks (April 27)
1 TOP PICK

Claude Code: Source Leak Analysis

Profile photo

Daniel Motles

Manager, AI Acceleration at Qumulo

Cooking
TECH STACK
Loading tech tags...
2 RUNNER UP

Wisdom Layer: Agent Cognitive Architecture

Profile photo

Jeff Rhatigan

Cognitive Architect at Rhatigan Labs

Cooking
TECH STACK
Loading tech tags...
3 COMMUNITY FAVORITE

GitKB: Distributed Knowledge Graph Protocol

Profile photo

Matt Walters

Founder at GitKB

Cooking
TECH STACK
Loading tech tags...
4 STANDOUT

Harness Engineering: Deterministic Agent Pipelines

Profile photo

Kerry Ritter

Founder/CTO at Zipper

Cooking
TECH STACK
Loading tech tags...
PROJECT LINKS
5 NOTABLE

Parallel AI Agent Development

Profile photo

Jamilton Alonso Quintero Osorio

CTO/CPO at SaleAds

Cooking
TECH STACK
Loading tech tags...
PROJECT LINKS
saleads.ai

More Great Builds
Quick hits from the community — demos worth bookmarking:
Jason Gardner presented hiveWiki.ai, a wiki-based collaboration layer where independent agents, Claude Chat, a Cursor Agent, and a human share the same usable “wiki artifact” for transparent coordination via MCP. The demo showed how agents could draft, critique, and deploy a feature while keeping the context consistent and reviewable. For anyone building toward more autonomous workflows without the brittleness, this felt like a practical path to agent memory and human-in-the-loop clarity people kept coming back to.
Loading tech tags...
Profile photo
Justin BergeronAI Tinkerers - Chicago • Apr 14
Justin Bergeron demoed his Mac Mini AI setup that worked like a scrappy local “dev team” for HausHavn, splitting responsibilities across architecture, coding, QA, and PM, then orchestrating handoffs into a coherent software workflow. The project emphasizes role design over one-shot prompting, using local agent coordination and practical automation to keep tasks moving. It stood out because it tackled the messy reality of making agents useful as process infrastructure, not magic, and (people kept saying this felt immediately adaptable). We liked how it points toward agentic workflows that can scale cost-effectively on modest hardware.
Loading tech tags...
Bob Prendergast showcased a real-time MoE neural telemetry engine that visualizes Mixture of Experts routing decisions from IBM Granite 3.0. The demo used a PyTorch forward hook to capture 40-dimensional routing weights from Layer 20 and render a live laptop heatmap of expert activation. After that, the model reflected on what it was doing by interpreting its own routing pattern, and people seemed to really enjoy that loop. For builders chasing efficient, local inference and agentic introspection, it felt like a practical blueprint for turning sparse activation into something you can actually debug, demo, and productize.
Loading tech tags...
Alula Zeryihun presented a multi-agent sales engine that kills guesswork in outbound by running seven specialized agents before any outreach happens: identity, profile, activity, market, network, scoring, then synthesis. The workflow grounds decisions in live web data, passing structured JSON contracts between stages to keep messaging context-aware while controlling cost and latency. It felt especially relevant to builders because it turned a “single-shot prompt” into a decision pipeline, and people seemed to love the clarity of the orchestration. The demo also fits the current shift toward more agentic, local-first automation for regulated, real-world work.
Loading tech tags...
Henry George from Super44 presented Boomerang, a custom agentic browser infra built as a replacement for tools like browserbase and Firecrawl, focused on reliable web automation. The system handles stealth-style browsing and orchestrates agent web flows while working through messy realities like Cloudflare friction, so the agents fail less and recover better. Given his hands-on serverless, event-driven background, it felt like an infrastructure case study you can actually reuse. We liked it because builders (people seemed to) want practical guidance on agentic web usage, right when high-speed local inference and long-horizon autonomy make this layer increasingly necessary.
Loading tech tags...
Profile photo
Felipe PachecoAI Tinkerers - Santiago • Apr 16
Felipe Pacheco from Poweredia presented Anthropic Ambassador Chile, a local-first push to help Latin America move from AI consumers to AI builders. The demo brought together Claude community leadership, Bendita IA’s hands-on builder ecosystem, and his Agent OS approach to orchestrating intelligent agents with clear organization-level governance. It tied fintech experimentation to practical tooling, and it landed well with the community, with feedback hinting that people wanted more “how to build” paths. We liked it because it turns agentic ambition into repeatable developer momentum.
Loading tech tags...
J.D Nicholls spotlighted “Ingeniería Inversa de Plugins de Claude Code” and showed how the repo notebooklm-ai-plugin turns NotebookLM into a terminal-first workflow by reverse engineering the free RPC path using browser cookies plus Chrome CDP. He implemented the plugin in TypeScript and Bun, packaging multiple artifact types with a single .claude-plugin manifest for easy distribution. We liked how people leaned in to try it immediately (people loved it). It fit the broader agentic tools trend: practical orchestration you can actually run, adapt, and ship.
Loading tech tags...
Thang Chung from Tiger Tribe presented a demo on adaptive tool orchestration with on-demand discovery and code mode execution. The project runs an MCP-based transform that does contextual tool resolution, then generates controlled C# workflows using the GitHub Copilot C#/.NET SDK and the MCP .NET SDK to chain calls with less prompt bloat. We liked how it shifted “pick from everything” into “query, narrow scope, then execute safely,” and the audience feedback quietly leaned positive. It also points toward a more product-ready, enterprise-friendly agent pattern where tool catalogs stay lean while orchestration stays deterministic.
Loading tech tags...
Alain Krok from AWS shared an autonomous background coding agent demo that runs long horizon tasks without constant supervision. The agent kept working in the background, managing state and task progression while tackling multi-step coding changes like an “always on” engineer. Given his systems plus full-stack experience in C++ and cloud tooling, he focused on the real engineering friction: reliability, focus, and safe continuation across long runs. People also seemed to like how pragmatic it felt for builders staring at codebase-scale work, and it pointed to the next shift toward dependable agentic execution.
Loading tech tags...
Juan Campos Gomez presentó Pista Inteligente: un pipeline end-to-end que convierte PDFs y texto de carreras en CSV con Claude como ETL estructurado, luego entrena un LightGBM LambdaRank con Walk-Forward Validation para ordenar por probabilidad de victoria y calibra con isotonic regression para que el score sea interpretable. Todo corrió en Python con mlflow para tracking de experimentos. Lo que gustó (people loved it) fue lo reutilizable de tres ideas: prompts con header fijo para colapsar ambigüedad, ranking vs regresión y calibración sin reentrenar. Además, el caso real para datos hípicos en Chile lo vuelve cercano y con potencial de producto tipo “predicciones con explicación” para apuestas o gestión.
Loading tech tags...

🏆 Hackathon Spotlight
Recent AI Tinkerers Hackathon Winners
🥇 1st Darkwood
Team Darkwood engineered a high-performance RAG pipeline that enables the Qwen2.5-Coder-3B model to generate production-grade Polars code by combining dynamic few-shot retrieval with a specialized rule-based prompting strategy for strict syntax reliability. This collaborative trio features Mirza Marotsaha, CEO of Latro.Link, alongside fullstack developer Mathieu Ledru and aspiring engineer Victor-eliejah Garnier.
🏅 Winner bluebull
Bluebull optimized Qwen2.5-Coder-7B with 4-bit quantization and a clever self-repair loop to achieve a perfect 16/16 correctness score on Polars code generation within a strict 4GB VRAM footprint. Led by Vasiliki Doropoulou, a front-end developer at Ublo specializing in AI-driven web applications.
🏅 Winner Muon
Muon engineered a high-efficiency FastAPI pipeline that achieves 70% accuracy in generating executable Polars code by combining a 4B-parameter model with a clever "Pseudo-RAG" routing system to inject targeted API snippets. Led by Imane Momayiz, a freelance ML engineer with four years of experience who built and benchmarked the entire production-ready system as a solo hacker.

🎬 Latest Content

How to Ship Complex Features 10x Faster with AI Agents | Dex Horthy (HumanLayer)

One-Shot • Mar 04
Dex Horthy (HumanLayer) breaks down the “12 Factor Agents” approach to shipping multi-step agentic workflows faster: structured outputs, ...
Watch Now →

How to Run Open-Source LLMs Locally on a Mac with MLX-LM

Deep Dive Series • Jun 12
Run open-source LLMs locally on Apple Silicon with Apple’s MLX-LM: `pip install mlx-lm`, then `load()` a Hugging Face model and call `gen...
Read More →

💼 Top Job Matches
Matched based on your meetup activity and profile
Paxos Health • New York & Toronto • $110k - $175k (varies w/ location/level); generous equity
Stanford-founded Seed-stage healthcare AI startup with >$5M in VC funding and AI agents deployed in production with cu...
Apply Now →
Dex • London (5 days on-site) • £250,000
Frontier AI engineering role building the AI tooling layer for complex financial modelling.
Apply Now →
Jakib AI • Columbus, OH
Jakib is a profitable, growing applied AI firm embedded with operator-led companies in logistics, manufacturing, and c...
Apply Now →

You are one of 95,000+ readers from Apple, Amazon, Microsoft, Google, Nvidia, OpenAI, Anthropic, Cohere, ElevenLabs, Scale AI, Groq, Mistral AI, and others — spanning frontier labs, big tech, startups, and top universities.

Ready for more?

Check out other posts from this blog.

View all posts