ZALVYUM-CORE v0.8.4 │ KERNEL · ONLINE │ ARCH: ARM64 · APPLE SILICON

The sovereign agentic OS, in technical detail.

Zalvyum is a local-first cognitive operating system with a quad-hemispheric memory architecture, deterministic process isolation, and zero cloud dependency by default. Built for evaluators who need to know what’s actually running.

Technical Q&A call → Read the architecture

RUNTIME

MLX · Apple Silicon

ORCHESTRATOR

Node.js · WebSocket

MEMORY STORE

SQLite · 4 hemispheres

DEFAULT MODE

Sovereign · 100% local

KERNEL ARCHITECTURE

A cognitive engine with four hemispheres.

Each hemisphere is a specialized memory and reasoning substrate. They share a cognitive bus. Together they replace the “forget everything every conversation” behavior that makes chatbots useless for serious work.

ZALVYUM KERNEL · COGNITIVE LAYER

HEM_01

VECTOR

Episodic memory

Cosine-similarity retrieval over every interaction. Full-text search. The hippocampus — never loses a line of code or a meeting summary.

HEM_02

GRAPH

Logical reasoning

Native GraphRAG. Converts data into Subject → Relationship → Object triples. Multi-hop reasoning across business entities, clients, dependencies.

HEM_03

PROCEDURAL

Metacognition

The cerebellum. Harvests technical failures into immutable axioms. Once it learns a fix, it rewires its logic and never repeats the error.

HEM_04

ADMINISTRATIVE

Structured vault

Agentic ERP/CRM engine. Dynamic JSON SQLite tables. Generates Markdown reports, tables, and structured exports on demand.

↓

RUNTIME · INFERENCE LAYER

MLX RUNTIME

Local model engine. Default: Qwen 2.5 7B. Runs natively on Apple Silicon unified memory.

NODE.JS ORCHESTRATOR

WebSocket task queue, process supervision, V8 sandbox for autopoietic tools, 15-second SIGKILL guillotine on runaway processes.

SQLITE COGNITIVE STORE

Single-file durable storage. WAL mode. Hot vacuum during REM cycle. Restore in under one hour to new hardware.

↓

HARDWARE

APPLE SILICON · UNIFIED MEMORY · 16GB MIN · SCALES TO M-SERIES ULTRA

DATA FLOW & SOVEREIGNTY

Where every byte goes. And where it doesn’t.

The default mode is sovereign — cognitive processing happens entirely on customer hardware. Two opt-in modes extend this, on the operator’s explicit terms.

SOVEREIGN MODE

DEFAULT

Operator → WebSocket → Zalvyum Kernel → MLX (local) → Response
                                              ↕
                                          SQLite (local)

Zero network calls. The kernel never reaches the public internet. Optimal for fully isolated deployments — regulated finance, healthcare, legal.

CONNECTED MODE

OPT-IN PER INTEGRATION

Operator → Kernel → MLX → Response
                  ↓
              Authorized integrations
              · ERP API (Tango, ContPaqi, SAP B1)
              · File storage
              · Scheduled exports

Cognitive work stays local. Outbound calls go only to integrations explicitly authorized during onboarding. No frontier model APIs called.

AUGMENTED MODE

OPT-IN PER QUERY

Operator → Kernel ──→ MLX (local) ──┐
                       │                         ↓
                       ├──→ OpenAI ───────→ Synthesis
                       ├──→ Anthropic ───→ Synthesis
                       └──→ Google ──────→ Synthesis

Operator explicitly invokes Augmented Mode for a specific query. The query is sent to selected frontier providers in parallel. Local engine synthesizes a final response. Operator pays providers directly — Zymbiotech never brokers payments or stores frontier responses.

REQUEST LIFECYCLE

What happens between “Enter” and the answer.

SYNAPSE

Operator prompt + compressed history + visual context travel through WebSocket to the Node.js TaskQueueManager.

SEMAPHORE

The orchestrator evaluates system thermal state and routes: sequential lock for local MLX, parallel dispatch for cloud APIs if Augmented Mode is active.

COGNITION

The active engine queries the four hemispheres in parallel. Vector retrieves precedent. Graph reasons across relationships. Procedural applies axioms. Administrative pulls structured data.

EXECUTION

If the response requires a tool, code is generated and run in a V8 virtual machine sandbox. 15-second SIGKILL on hanging processes. Async network calls capped at 10 seconds.

RESPONSE

The synthesized answer streams back to the operator via WebSocket. Tables, CSV exports, voice (Kokoro TTS), or plain Markdown depending on context.

CONSOLIDATION

At 03:00 AM the REM Sleep Protocol runs locally. New interactions are consolidated, failure logs forge new axioms, irrelevant memory is pruned, SQLite is vacuumed.

SECURITY ARCHITECTURE

Defense in depth, by design.

PROCESS ISOLATION

All generated code runs in detached process groups inside V8 virtual machine sandboxes. Network calls have a 10-second timeout. Hanging processes are SIGKILL’d at 15 seconds without operator intervention.

KERNEL SHIELD

The system blocks write access to its own source code and core database files. Self-modification attempts are intercepted and denied at the kernel layer. The OS cannot mutate itself.

MEMORY PRIVACY LOCK

The REM consolidation cycle is forced through the local MLX engine. Sensitive business memory is never sent to a cloud model, even when Augmented Mode is enabled for the running query.

ZERO-TRUST REMOTE ACCESS

Optional encrypted tunnel (Cloudflare-style) lets the operator access their Zalvyum instance from anywhere via mobile or web. End-to-end encrypted. Zymbiotech cannot read your data through the tunnel.

HARDWARE SIZING

What you need, by workload.

Zalvyum runs on Apple Silicon unified memory. RAM is the primary constraint. Local model context window scales linearly with available memory.

TIER

HARDWARE

RAM

BEST FOR

MIN

Mac mini M1 / M2

16 GB

Small ops, 1–10 users, basic ERP queries

REC

Mac mini M2 Pro / M4

32 GB

SMBs of 10–25, multi-source queries, large context

PRO

Mac Studio M2 Max / M4 Max

64 GB

SMBs of 25–50, heavy multi-user, larger models

MAX

Mac Studio M2 / M3 Ultra

128 GB+

Multi-team, very large knowledge bases

We size this during onboarding based on your actual workload. We can supply pre-configured hardware as part of the deployment.

RELIABILITY & RECOVERY

When things break.

AUTOMATIC BACKUPS

Hot SQLite snapshots every hour. Daily full backup to operator-controlled storage. Backups are encrypted at rest. Restore to new hardware in under 60 minutes.

CRASH SUPERVISION

Node.js orchestrator supervises all subprocesses. Crashes are logged and surfaced via the procedural hemisphere, which forges new axioms to prevent recurrence.

MODEL FALLBACK

If the local model exceeds context window or hits a 3-minute watchdog timeout, the orchestrator reports the constraint to the operator with the option to escalate to Augmented Mode.

UPDATES

Kernel updates are pushed manually by the operator. Nothing auto-updates without explicit approval. Updates are signed and verified before installation.

TECHNICAL EVALUATION

Want to go deeper?

Schedule a 30-minute call with the engineering team. We’ll cover architecture, security model, integration patterns, and answer any technical question you have. NDA available on request.

Schedule technical call → Apply for Early Access