arstechnica.com/apple/2026/03/running-local-models-on-macs-gets-faster-with-ollamas-mlx-support

Running local models on Macs gets faster with Ollama's MLX support - Ars Technica

This is the newest public snapshot for this URL and the best place to start reviewing the page.

Apr 1, 2026, 12:46 AM

Source URL

https://arstechnica.com/apple/2026/03/running-local-models-on-macs-gets-faster-with-ollamas-mlx-support/

About this page

This page reports that Ollama, a runtime system for local large language models, has introduced support for Apple's open-source MLX machine learning framework. Combined with improved caching performance and Nvidia's NVFP4 format support, this significantly enhances performance on Apple Silicon Macs. As developers face API rate limits and high subscription costs, local model experimentation is accelerating, particularly for coding tasks. The new MLX support is available in preview as Ollama 0.19, currently supporting Alibaba's Qwen3.5, and requires at least 32GB RAM on Apple Silicon-equipped systems.

Open latest saved version Open oldest saved version Open full history

Total saves

Latest save

Apr 1, 2026, 12:46 AM

First save

Apr 1, 2026, 12:46 AM

Open latest saved version Open oldest saved version Newest first Oldest first

Page 1

Saved versions

Running local models on Macs gets faster with Ollama's MLX support - Ars Technica

This page reports that Ollama, a runtime system for local large language models, has introduced supp...

4/1/2026

arstechnica.com/apple/2026/03/running-local-models-on-macs-gets-faster-with-ollamas-mlx-support web archives are listed here. You can still review the saved screenshot and HTML even if the original page disappears.

Save another page Search archives