ZALVYUM-CORE v0.8.4 KERNEL · ONLINE ARCH: ARM64 · APPLE SILICON

The sovereign agentic OS, in technical detail.

Zalvyum is a local-first cognitive operating system with a quad-hemispheric memory architecture, deterministic process isolation, and zero cloud dependency by default. Built for evaluators who need to know what’s actually running.

RUNTIME
MLX · Apple Silicon
ORCHESTRATOR
Node.js · WebSocket
MEMORY STORE
SQLite · 4 hemispheres
DEFAULT MODE
Sovereign · 100% local

A cognitive engine with four hemispheres.

Each hemisphere is a specialized memory and reasoning substrate. They share a cognitive bus. Together they replace the “forget everything every conversation” behavior that makes chatbots useless for serious work.

ZALVYUM KERNEL · COGNITIVE LAYER
HEM_01
VECTOR
Episodic memory

Cosine-similarity retrieval over every interaction. Full-text search. The hippocampus — never loses a line of code or a meeting summary.

HEM_02
GRAPH
Logical reasoning

Native GraphRAG. Converts data into Subject → Relationship → Object triples. Multi-hop reasoning across business entities, clients, dependencies.

HEM_03
PROCEDURAL
Metacognition

The cerebellum. Harvests technical failures into immutable axioms. Once it learns a fix, it rewires its logic and never repeats the error.

HEM_04
ADMINISTRATIVE
Structured vault

Agentic ERP/CRM engine. Dynamic JSON SQLite tables. Generates Markdown reports, tables, and structured exports on demand.

RUNTIME · INFERENCE LAYER
MLX RUNTIME
Local model engine. Default: Qwen 2.5 7B. Runs natively on Apple Silicon unified memory.
NODE.JS ORCHESTRATOR
WebSocket task queue, process supervision, V8 sandbox for autopoietic tools, 15-second SIGKILL guillotine on runaway processes.
SQLITE COGNITIVE STORE
Single-file durable storage. WAL mode. Hot vacuum during REM cycle. Restore in under one hour to new hardware.
HARDWARE
APPLE SILICON · UNIFIED MEMORY · 16GB MIN · SCALES TO M-SERIES ULTRA

Where every byte goes. And where it doesn’t.

The default mode is sovereign — cognitive processing happens entirely on customer hardware. Two opt-in modes extend this, on the operator’s explicit terms.

SOVEREIGN MODE
DEFAULT
Operator → WebSocket → Zalvyum Kernel → MLX (local) → Response
                                              ↕
                                          SQLite (local)

Zero network calls. The kernel never reaches the public internet. Optimal for fully isolated deployments — regulated finance, healthcare, legal.

CONNECTED MODE
OPT-IN PER INTEGRATION
Operator → Kernel → MLX → Response
                  ↓
              Authorized integrations
              · ERP API (Tango, ContPaqi, SAP B1)
              · File storage
              · Scheduled exports

Cognitive work stays local. Outbound calls go only to integrations explicitly authorized during onboarding. No frontier model APIs called.

AUGMENTED MODE
OPT-IN PER QUERY
Operator → Kernel ──→ MLX (local) ──┐
                       │                         ↓
                       ├──→ OpenAI ───────→ Synthesis
                       ├──→ Anthropic ───→ Synthesis
                       └──→ Google ──────→ Synthesis

Operator explicitly invokes Augmented Mode for a specific query. The query is sent to selected frontier providers in parallel. Local engine synthesizes a final response. Operator pays providers directly — Zymbiotech never brokers payments or stores frontier responses.

What happens between “Enter” and the answer.

01
SYNAPSE

Operator prompt + compressed history + visual context travel through WebSocket to the Node.js TaskQueueManager.

02
SEMAPHORE

The orchestrator evaluates system thermal state and routes: sequential lock for local MLX, parallel dispatch for cloud APIs if Augmented Mode is active.

03
COGNITION

The active engine queries the four hemispheres in parallel. Vector retrieves precedent. Graph reasons across relationships. Procedural applies axioms. Administrative pulls structured data.

04
EXECUTION

If the response requires a tool, code is generated and run in a V8 virtual machine sandbox. 15-second SIGKILL on hanging processes. Async network calls capped at 10 seconds.

05
RESPONSE

The synthesized answer streams back to the operator via WebSocket. Tables, CSV exports, voice (Kokoro TTS), or plain Markdown depending on context.

06
CONSOLIDATION

At 03:00 AM the REM Sleep Protocol runs locally. New interactions are consolidated, failure logs forge new axioms, irrelevant memory is pruned, SQLite is vacuumed.

Defense in depth, by design.

PROCESS ISOLATION

All generated code runs in detached process groups inside V8 virtual machine sandboxes. Network calls have a 10-second timeout. Hanging processes are SIGKILL’d at 15 seconds without operator intervention.

KERNEL SHIELD

The system blocks write access to its own source code and core database files. Self-modification attempts are intercepted and denied at the kernel layer. The OS cannot mutate itself.

MEMORY PRIVACY LOCK

The REM consolidation cycle is forced through the local MLX engine. Sensitive business memory is never sent to a cloud model, even when Augmented Mode is enabled for the running query.

ZERO-TRUST REMOTE ACCESS

Optional encrypted tunnel (Cloudflare-style) lets the operator access their Zalvyum instance from anywhere via mobile or web. End-to-end encrypted. Zymbiotech cannot read your data through the tunnel.

What you need, by workload.

Zalvyum runs on Apple Silicon unified memory. RAM is the primary constraint. Local model context window scales linearly with available memory.

TIER
HARDWARE
RAM
BEST FOR
MIN
Mac mini M1 / M2
16 GB
Small ops, 1–10 users, basic ERP queries
REC
Mac mini M2 Pro / M4
32 GB
SMBs of 10–25, multi-source queries, large context
PRO
Mac Studio M2 Max / M4 Max
64 GB
SMBs of 25–50, heavy multi-user, larger models
MAX
Mac Studio M2 / M3 Ultra
128 GB+
Multi-team, very large knowledge bases

We size this during onboarding based on your actual workload. We can supply pre-configured hardware as part of the deployment.

When things break.

AUTOMATIC BACKUPS

Hot SQLite snapshots every hour. Daily full backup to operator-controlled storage. Backups are encrypted at rest. Restore to new hardware in under 60 minutes.

CRASH SUPERVISION

Node.js orchestrator supervises all subprocesses. Crashes are logged and surfaced via the procedural hemisphere, which forges new axioms to prevent recurrence.

MODEL FALLBACK

If the local model exceeds context window or hits a 3-minute watchdog timeout, the orchestrator reports the constraint to the operator with the option to escalate to Augmented Mode.

UPDATES

Kernel updates are pushed manually by the operator. Nothing auto-updates without explicit approval. Updates are signed and verified before installation.

Want to go deeper?

Schedule a 30-minute call with the engineering team. We’ll cover architecture, security model, integration patterns, and answer any technical question you have. NDA available on request.