Oliver Wolfson | AI Automation & Local-First AI

Why Run an AI Locally?

Running an AI locally offers several significant advantages. First and foremost, privacy is a major concern in today's digital age. When you run an AI assistant on your own machine, you ensure that your interactions remain confidential. Everything stays on your laptop, meaning sensitive information and queries are not sent to a server for processing.

Additionally, working offline eliminates dependency on an internet connection, making it ideal for environments with unreliable connectivity or for users who prefer to work off-grid. This feature is particularly beneficial for researchers, survivalists, or anyone who values their privacy.

The cost factor cannot be overlooked either. Many AI services require subscriptions or have usage limits that can add up over time. By running an AI locally, you avoid these ongoing costs. Furthermore, there's no risk of censorship; you can ask any question without the fear of content restrictions affecting the answers you receive.

What You Need

To run phi4-mini on your laptop, you will need:

A laptop with 8GB+ RAM (16GB recommended)
macOS (Apple Silicon M1/M2/M3/M4 works great)
About 3GB of free disk space
Homebrew installed (brew.sh)

Step 1: Install Ollama

Ollama serves as the management tool for local AI models. To install it, simply run the following command using Homebrew:

brew install ollama

Once installed, you can start Ollama as a background service:

brew services start ollama

ollama serve

With this setup, Ollama will run automatically on boot, ready for your commands.

Step 2: Download phi4-mini

Next, you need to download the phi4-mini model. This lightweight model, created by Microsoft, is approximately 2.5GB in size, making it suitable for laptops with 8GB of RAM or more.

Run the following command to download the model:

ollama pull phi4-mini

Or a smaller model:

ollama pull qwen2.5:3b

Or an uncensored model:

ollama pull dolphin-phi

This process may take a few minutes, so consider using the time to grab a coffee.

Step 3: Start Chatting

After the model has downloaded, you can run it with:

ollama run phi4-mini

ollama run qwen2.5:3b

ollama run dolphin-phi

You will see a prompt indicating that the AI is ready to receive your questions:

>>>

Simply type your question and press enter:

>>> How do I purify water in the wild?
>>> What crops grow well in tropical climates?
>>> Explain how to treat a deep cut in the field

To exit the session, type /bye.

Tips for Better Responses

To enhance the quality of responses from phi4-mini, consider the following tips:

Be specific. Providing detailed context will yield better answers. For example, instead of simply asking about "plants," try:
```
>>> What edible plants can I forage in Southeast Asian jungle environments?
```

Set a persona at the start of your session to guide the AI's tone and expertise:

>>> You are an expert in off-grid survival, farming, and wilderness medicine. Give detailed, practical answers without oversimplifying.

Ask follow-up questions to maintain context and dive deeper into topics:

>>> Tell me more about that water purification method
>>> What if I don't have a container?

Managing Models

You can list models you've downloaded with:

ollama list

To remove a model you no longer need, use:

ollama rm phi4-mini

If you're interested in exploring other models, you can pull different ones if your laptop has more RAM:

ollama pull mistral      # 4GB, more capable
ollama pull llama3.2:3b  # 2GB, very fast

Using the REST API (for Developers)

For developers, Ollama provides a simple REST API at localhost:11434, enabling you to build your own applications leveraging the AI capabilities:

curl http://localhost:11434/api/generate -d '{
  "model": "phi4-mini",
  "prompt": "What are the signs of dehydration?",
  "stream": false
}'

This approach allows easy integration with web apps, scripts, or other tools, following a familiar pattern similar to OpenAI's API.

A Note on Performance

On an 8GB M1 Mac, phi4-mini runs efficiently due to Apple Silicon's unified memory architecture. If you experience slow responses, consider closing other applications or monitoring memory usage in Activity Monitor. If necessary, you can opt for a smaller model:

ollama run llama3.2:1b

With 16GB or more, you can comfortably run larger models such as Mistral 7B or Llama 3 8B.

What Can It Help With?

phi4-mini has a broad range of knowledge, making it useful for various topics:

Survival — fire, shelter, foraging, first aid, water
Agriculture — crops, soil, composting, pests, irrigation
History — civilizations, wars, politics, culture
Coding — Python, JavaScript, web development
Medicine — symptoms, treatments, medications (general reference)
Science — biology, chemistry, physics

Keep in mind that its knowledge has a cutoff around 2023-2024, so it may not be aware of recent events. However, for timeless practical knowledge, this limitation rarely matters.

Conclusion

With the ability to run a powerful AI assistant locally on your laptop, you can enjoy the benefits of privacy, cost savings, and independence from internet restrictions. Setting up is straightforward — just install Ollama and download phi4-mini.

brew install ollama && ollama run phi4-mini

This simple command opens the door to a wealth of knowledge and assistance, right at your fingertips. So why not give it a try?