Storage options

Recommended Configuration

Node	Boot Device	Storage	Size
clusterclaw (RPi5)	MicroSD	OS + Docker + OpenClaw + ThreadWeaver	128GB
clustercrush (Orin Nano)	MicroSD	OS (JetPack)	128GB
clustercrush (Orin Nano)	NVMe M.2	Models + llama.cpp	256GB

Filesystem: ext4 (default for both Raspbian and JetPack)

Component	Size
Raspbian Desktop (stripped + hardened)	~3.4GB
Docker engine	~500MB
OpenClaw container	~1.2GB
ThreadWeaver container	~800MB
Node.js 24	~230MB
Blinkt! LED daemon	~1MB
System overhead + logs	~500MB
Total used	~6.6GB
Free on 128GB card	~115GB

Workload is mostly reads (Docker image layers, serving web UI)
No heavy writes (logs are capped, no database)
High-endurance microSD cards (Samsung PRO Endurance, SanDisk MAX Endurance) handle the write load
M.2 NVMe via HAT adds cost and case height for speed that isn’t noticeable in this use case

The Orin Nano uses two storage devices:

Component	Size
JetPack Desktop (stripped + hardened)	~18GB
System utilities + security tools	~1GB
System overhead + logs	~500MB
Total used	~20GB
Free on 128GB card	~100GB

NVMe Size	Fits	Use Case
256GB	~20 Q4 models	Standard deployment
512GB	~40+ Q4 models	Multi-model testing, larger quantizations
1TB	Extensive model library	Research, dataset storage

Only one model runs at a time on the 8GB Orin Nano (GPU VRAM limit). Use model-switch to swap between them.

Model	Size (Q4_K_M)	Context	Gen Speed	Use Case
Llama 3.2 1B	0.7GB	128K	~80 t/s	Ultra-fast routing, classification
Llama 3.2 3B	1.9GB	128K	~18 t/s	Primary agent model
Phi-3.5 Mini 3.8B	2.3GB	128K	~17 t/s	Strong reasoning
Qwen 2.5 3B	2.0GB	32K	~18 t/s	Code / structured output
Llama 3.1 8B	4.7GB	128K	~10 t/s	Highest quality; slower
Moondream2	1.9GB	2K	~15 t/s	Vision (lightweight)
Llama 3.2 Vision 11B	6.2GB	128K	~7 t/s	Best vision + language; tight fit

# On clustercrush:
sudo update-clustercrush.sh add-model https://huggingface.co/.../model.gguf
sudo model-switch new-model.gguf