Ben Zanghi
HomePortfolioPublicationsResume
HomePortfolioPublicationsResume

Publications

Insights and perspectives on AI, digital health, and emerging technologies

Engram: Cognitive-Inspired AI Memory Architecture
February 14, 202615 min read

Engram: Cognitive-Inspired AI Memory Architecture

Building a production memory system for conversational AI based on cognitive science principles. Dual storage, temporal validity, uncertainty tracking, and entity deduplication in production.

AI Research
AgentEvolve: When LLM Orchestration Helps — and When It Hurts
February 14, 202612 min read

AgentEvolve: When LLM Orchestration Helps — and When It Hurts

Evolutionary search over AI agent orchestration patterns. 18 experiments across 6 models (7B–405B parameters) reveal that orchestration effectiveness depends entirely on model scale.

AI Research
Claudia Voice: Two-Tier Conversational AI Architecture
February 14, 202610 min read

Claudia Voice: Two-Tier Conversational AI Architecture

Building a production voice assistant with local GLM routing and semantic memory. 85% of queries handled at 700ms latency with smart home integration and Engram memory.

AI Systems
PersonaPlex-MLX: Porting a Neural Codec to Apple Silicon
February 202610 min read

PersonaPlex-MLX: Porting a Neural Codec to Apple Silicon

Porting Kyutai's Mimi neural audio codec to Apple Silicon using MLX. On-device speech encoding and decoding with no cloud dependency, running at native Metal GPU speeds.

AI Research
Video Work
Connect with me on LinkedIn