HyperAIHyperAI

Command Palette

Search for a command to run...

Osaurus brings local and cloud AI to Mac

Osaurus, an open-source Mac-only LLM server, has emerged to address the growing need for a software layer that manages diverse AI models while prioritizing user data privacy. Founded by Terence Pae and Sam Yoo, the project originated from Pae's previous venture, Dinoki, which aimed to create an AI assistant similar to the digital Clippy. Users expressed reluctance to pay for token usage with cloud services when they could run powerful models locally. This feedback prompted Pae, a former engineer at Tesla and Netflix, to build a tool that allows users to switch seamlessly between local and cloud models while keeping files, memory, and tools on their own hardware. Today, Osaurus functions as a flexible harness, connecting various AI models and workflows through a single, consumer-friendly interface. Unlike developer-focused tools like OpenClaw or Hermes that often require terminal knowledge and may present security risks, Osaurus operates within a hardware-isolated virtual sandbox. This design ensures that AI interactions are limited in scope, safeguarding the user's computer and sensitive data. The platform supports a wide array of locally hosted models, including MiniMax M2.5, Gemma 4, Qwen3.6, GPT-OSS, Llama, and DeepSeek V4, as well as Apple's on-device foundation models and Liquid AI's LFM family. For cloud integration, users can access services from OpenAI, Anthropic, Gemini, xAI, and others. As a full Model Context Protocol server, Osaurus enables MCP-compatible clients to access user tools. It currently ships with over twenty native plugins for common macOS tasks such as email, calendar, vision, file management, web browsing, and code repositories. A recent update also introduced voice capabilities. Since its launch nearly a year ago, the project has recorded over 112,000 downloads. Running local AI models remains resource-intensive, requiring at least 64 GB of RAM for standard operation and up to 128 GB for larger models like DeepSeek V4. However, Pae predicts that the efficiency of local AI will improve significantly, noting that intelligence per wattage is on an upward trajectory. He highlights that modern local models can now execute tools, write code, and control applications, capabilities that were previously unattainable. Currently participating in the Alliance startup accelerator in New York, Osaurus is exploring future expansions into the business sector, particularly for industries like healthcare and law where data privacy is critical. The founders believe that as local AI capabilities grow, they could reduce the global demand for energy-intensive cloud data centers. Pae argues that deploying on-premises solutions like a Mac Studio offers cloud-like functionality with substantially lower power consumption, providing a sustainable alternative to centralized infrastructure.

Related Links