<?xml version="1.0" encoding="UTF-8" ?> <?xml-stylesheet type="text/xsl" href="rss.xsl"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:dc="http://purl.org/dc/elements/1.1/"> <channel> <title>Squish</title><description>Fast local LLMs on Apple Silicon: sub-second model loads, faster than Ollama on long prompts. OpenAI and Ollama-compatible.</description><link>https://squish.run/</link><atom:link href="https://squish.run/feed_rss_updated.xml" rel="self" type="application/rss+xml" /> <docs>https://github.com/konjoai/squish</docs><language>en</language> <pubDate>Sun, 28 Jun 2026 11:56:43 -0000</pubDate> <lastBuildDate>Sun, 28 Jun 2026 11:56:43 -0000</lastBuildDate> <ttl>1440</ttl> <generator>MkDocs RSS plugin - v1.19.0</generator> <image> <url>None</url> <title>Squish</title> <link>https://squish.run/</link> </image> <item> <title>I Couldn&#39;t Find a Local LLM Tool Fast Enough, So I Built My Own</title> <author>Wesley Scholl</author> <category>Benchmarks</category> <category>apple-silicon</category> <category>benchmarks</category> <category>local-llm</category> <category>mlx</category> <category>quantization</category> <category>rust</category> <description>A local LLM inference server for Apple Silicon, up to 9.8× faster than Ollama on long prompts, with the honest benchmarks.</description> <link>https://squish.run/blog/local-llm-fast-enough/</link> <pubDate>Fri, 26 Jun 2026 03:16:00 +0000</pubDate> <source url="https://squish.run/feed_rss_updated.xml">Squish</source><guid isPermaLink="true">https://squish.run/blog/local-llm-fast-enough/</guid> <enclosure url="https://squish.run/assets/blog/chart-speed.png" type="image/png" length="None" /> </item> </channel> </rss>