Search Results

Tag: Local InferenceClear filters

Running MiniMax M2.5 Locally on NVIDIA DGX Spark

Michael MuellerFeb 202610 min read

AI LLM NVIDIA DGX Spark MiniMax Open Source Local Inference llama.cpp

How I got a 230B parameter open model running on desktop hardware using NVIDIA DGX Spark, Unsloth quantization, and llama.cpp - matching cloud API performance without cloud dependencies.

Back to all articles