LLM | Steph Locke

LLM

PrismML — Concentrating …

Intelligence Density: PrismML focuses on “intelligence density,” building ultra-dense models that maximize performance while minimizing size and energy consumption.
1-Bit Bonsai Architecture: The company has launched the first commercially viable 1-bit weight LLM family (Bonsai 8B, 4B, …

AI Machine Learning LLM Edge AI Open Source

LLM Optimization Gist - …

LLM Efficiency: Technical exploration of optimizing Large Language Model inference and performance.
Hardware Acceleration: Insights into leveraging specific hardware architectures for faster model execution.
Implementation Details: Detailed breakdown of memory management and compute kernels …

AI Machine Learning LLM Performance Developer Tools

LLM Wiki: Persistent …

Core argument: RAG retrieves from raw documents on each query but still rediscovers knowledge from scratch every time; a persistent wiki is better.
Persistent compounding: The wiki is a “persistent, compounding artifact” — synthesis becomes durable rather than disposable.
Human role: …

Knowledge Management AI LLM RAG Second Brain