⚡Local AI: Browser Agents + Ultra Low Latency Meeting Assist
Issue #3 · Week of August 25
We pulled the standouts from AI Tinkerers meetups around the globe from the last two weeks: lots of agent orchestration and programmatic prompt optimization, with nods to on-device agents matching the recent push toward efficient, tool-savvy systems. Henry Mao (Singapore) shows MCP-first integration in Tool Calls are the New Clicks; Mahir Isikli (Berlin) compiles prompts with DSPy, not vibes; Rakesh Kumar (Chicago) runs private agents in the browser and Enrico Foschi showed how to achieve ultra low latency with locally-run voice apps. We scored for tech merit and survey feedback—real, vetted, practical builds. Read on.
Smithery: Tool Calls
The future of the internet will be dominated by tool calls, not clicks. We're building Smithery to orchestrate the new era of AI-native services for AI agents. Henry Mao's demo showcased an AI agent leveraging multiple tools to perform a task autonomously and discussed the rising ecosystem of AI-native services (MCPs), and explained the technical underpinnings and approaches for enabling search and routing across MCPs to support agent tool discovery.
TECH STACK
|
DSPy Live Prompt Optimization
It was a demo of turning ad-hoc prompts into a repeatable optimization flow with DSPy. Mahir Isikli, an ML engineer, started with a brittle LLM pipeline, showed baseline accuracy, latency, and cost, then defined Signatures and Modules and ran an optimizer to generate candidates and re-tune prompts. He walked through the code and dataset wiring from the repo. Attendees left with a copyable recipe for prompt tuning; survey feedback hinted at its practical value for builders in our network.
|
AWS: AgentCore & Kiro
Pandurang Nayak pitched a demo on Recent Announcements from AWS, showcasing Kiro (an agentic coding IDE), AgentCore (new AWS agentic services), and Strands SDK for building agents. It highlighted cloud-native, production-friendly tooling that lowers barriers for agentic LLM apps, with practical demos and accessible repos. Pandurang, a seasoned AWS leader for ASEAN startups, conveyed solid execution. Feedback from the community hinted (people loved it), and the work points to a clear path for startup-scale automation and dev-prod workflows.
TECH STACK
|
VeoSpark: Veo 3 Ad Pipeline
Lewis (lekt9), an AI engineer and open-source contributor with projects like alBERT-launcher, demoed VeoSpark, a conversational agent for AI video creation that reimagines workflows with tools like Veo 3 and Kling. Instead of prompt engineering, creators direct videos through natural conversation—starting with a script, then refining details like “make the boss more confused in scene 2”—while the system preserves what works. Under the hood, entity-based JSON outputs enable partial regeneration, context preservation, and intelligent stitching, reducing fatigue from repeated full regenerations. Early user tests showed high activation and iteration, and the demo drew strong enthusiasm, pointing to a scalable product for teams that want to direct videos like stories, not prompts. See also: videothon.getvideos.app - a competition for making the best video - credits provided.
TECH STACK
|
ClueWing
Enrico presented his side project, ClueWing, a fully local meeting assistant, using local c++ modules for transcription (whisper.cpp - 30ms latency), llm processing (llama.cpp - 250ms processing on macbook m1 air 8gb), and a Flutter app able to get an overall realtime meeting coaching. All at 20% cpu usage.
TECH STACK
PROJECT LINKS
|
How to Ship Complex Features 10x Faster with AI Agents | Dex Horthy (HumanLayer)
How to Run Open-Source LLMs Locally on a Mac with MLX-LM
You are one of 95,000+ readers from Anthropic, OpenAI, Google, Microsoft, Meta, Apple, Amazon, Nvidia, Netflix, Stripe, Databricks, Snowflake, and others — spanning frontier labs, big tech, startups, and top universities.
⚡Local AI: Browser Agents + Ultra Low Latency Meeting Assist