Skip to content

Promise vs Code Scorecard

Transparency notice: this page lists every major claim HART OS makes and whether the codebase actually delivers it. OSS contributors and evaluators should be able to verify each item themselves.

Result: 18 / 20 DELIVERED, 2 PARTIAL

Scoring Key

Status Meaning
DELIVERED Feature exists, tests pass, code path exercised
PARTIAL Core logic exists but has a known gap (documented below)

The 20 Claims

1. Recipe Pattern (CREATE / REUSE)

Status: DELIVERED

  • create_recipe.py decomposes prompts into flows and actions, saves JSON recipes.
  • reuse_recipe.py replays saved recipes without LLM calls.
  • Storage: prompts/{prompt_id}_{flow_id}_recipe.json.
  • Tests: test_pipeline_lifecycle_functional.py (22 tests) validates save/load round-trip and state machine.

2. ActionState Machine

Status: DELIVERED

  • lifecycle_hooks.py implements ASSIGNED -> IN_PROGRESS -> STATUS_VERIFICATION_REQUESTED -> COMPLETED/ERROR -> TERMINATED.
  • Invalid transitions are rejected; thread-safe under concurrent access.
  • Tests: test_pipeline_lifecycle_functional.py covers valid paths, invalid transitions, and concurrent state changes.

3. 90/9/1 Revenue Split

Status: DELIVERED

  • Canonical constants in integrations/agent_engine/revenue_aggregator.py: REVENUE_SPLIT_USERS=0.90, REVENUE_SPLIT_INFRA=0.09, REVENUE_SPLIT_CENTRAL=0.01.
  • Imported by ad_service.py, hosting_reward_service.py, finance_tools.py.
  • Tests: test_revenue_functional.py (6 tests) validates the split math against real SQLite.

4. Spark Settlement & Metered API Cost Recovery

Status: DELIVERED

  • revenue_aggregator.py settle_metered_api_costs() converts USD to Spark at SPARK_PER_USD=100.
  • budget_gate.py records metered usage; MeteredAPIUsage table in models.py.
  • Tests: test_revenue_functional.py covers settlement, self-task skip, write-off, and env overrides.

5. Federation Protocol (Equal Weighting)

Status: DELIVERED

  • integrations/agent_engine/federated_aggregator.py uses log1p(interactions) with floor=1.0.
  • No tier multipliers -- data quality, not hardware, determines weight.
  • HMAC signing with tamper detection; stale/future delta rejection.
  • Tests: test_federation_functional.py (23 tests) covers 3-node convergence, domination prevention, guardrail hash enforcement.

6. EventBus (Local + WAMP Bridge)

Status: DELIVERED

  • core/platform/events.py with Crossbar WAMP bridge via autobahn.
  • Topic mapping: theme.changed <-> com.hartos.event.theme.changed.
  • _from_wamp flag prevents echo loops.
  • Tests: test_message_bus_functional.py (8 tests) covers pub/sub, wildcards, thread safety.

7. Guardrail Immutability

Status: DELIVERED

  • security/hive_guardrails.py -- 10-class guardrail network, _FrozenValues, module-level __setattr__ guard.
  • 33 constitutional rules, re-verified every 300 s.
  • Protected files list includes origin_attestation.py and LICENSE.
  • Tests: test_security_modules_functional.py validates hash chain integrity and tamper detection.

8. Master Key (Ed25519 Kill Switch)

Status: DELIVERED

  • security/master_key.py -- public key verification flow, boot signature check.
  • security/key_delegation.py -- 3-tier certificate chain (central -> regional -> local).
  • Private key excluded from all code paths (AI exclusion zone enforced in CLAUDE.md).

9. Input Sanitization & DLP

Status: DELIVERED

  • security/sanitize.py -- SQL-escape, HTML-escape, path-traversal stripping, input validation.
  • security/dlp_engine.py -- PII scan/redact (email, phone, SSN, credit card, IP).
  • Tests: test_security_modules_functional.py (141 tests) -- 7 security modules exercised.

10. Action Classifier (Destructive Detection)

Status: DELIVERED

  • security/action_classifier.py -- pattern matching for destructive commands (rm, mkfs, dd, kill).
  • PREVIEW_PENDING / APPROVED states for human confirmation.
  • Tests: 19 classifier tests + 3 preview edge-case tests in the security functional suite.

11. Immutable Audit Log

Status: DELIVERED

  • security/immutable_audit_log.py -- SHA-256 hash chain, AuditLogEntry table.
  • Deletion or modification breaks chain; tamper detection catches it.
  • Tests: genesis hash, chain continuation, deletion detection, 100-entry chain validation, detail-JSON redaction.

12. Channel-to-Device Control

Status: DELIVERED

  • integrations/channels/device_control_tool.py -- any channel adapter routes commands to the user's local device via PeerLink.
  • SAME_USER trust only; fleet-command fallback; embedded handler for GPIO/serial.
  • Tests: test_device_control_functional.py (16 tests).

Status: DELIVERED

  • core/peer_link/ -- 7 files. Trust levels: SAME_USER, PEER (E2E), RELAY (E2E).
  • Connection budget: flat=10, regional=50, central=200.
  • Wired into bootstrap, gossip, federation, compute_mesh, world_model_bridge.

14. Agent Personality & Resonance Tuning

Status: DELIVERED

  • core/agent_personality.py -- 37 tests. core/resonance_tuner.py -- EMA tuning, oscillation detection, federation delta export.
  • core/resonance_profile.py -- 8-dimensional continuous floats.
  • 100 tests total (63 resonance + 37 personality).

15. Better Tomorrow Seed Goal

Status: DELIVERED

  • integrations/agent_engine/goal_seeding.py line 812 -- bootstrap_better_tomorrow.
  • Scans community needs, developer requests, contributor wellbeing, environmental impact.
  • Scores by lives_impacted x urgency x feasibility / cost.
  • requires_human_approval: True -- never auto-spends. Humans decide.

16. Autonomous Upgrade Pipeline

Status: DELIVERED

  • upgrade_orchestrator.py -- 7-stage pipeline (BUILD -> TEST -> AUDIT -> BENCHMARK -> SIGN -> CANARY -> DEPLOY).
  • is_upgrade_safe() blocks on 5% regression.
  • hart-update-service.py -- OTA systemd service (daily).

17. Anti-Fork Protection (BSL-1.1 + Origin Attestation)

Status: DELIVERED

  • LICENSE -- BSL-1.1 with anti-rebranding, master key integrity clause. Auto-converts to Apache 2.0 on 2030-01-01.
  • security/origin_attestation.py -- cryptographic origin proof, brand marker verification, federation handshake attestation.
  • 34 tests in test_origin_attestation.py.

18. Compute Democracy (Logarithmic Scaling)

Status: DELIVERED

  • Federation uses log1p(interactions) so no single entity controls >5% influence.
  • Budget gate enforces per-model Spark costs; local models cost 0 Spark.
  • Hosting reward uses weighted scoring across GPU hours, inferences, energy, API costs.

19. VLM Computer-Use Loop

Status: PARTIAL

  • integrations/vision/local_loop.py -- Qwen 3.5 VLM with action parsing, bbox handling, safety-gate stubs.
  • Tests: test_vlm_loop_functional.py (17 tests) validates control flow and action parsing.
  • Gap: The iteration loop uses a fixed 30-iteration cap instead of a goal-aware termination condition. Redesign plan documented in MEMORY.md sections A2-A8.

20. Offline TTS (LuxTTS / Pocket TTS)

Status: PARTIAL

  • integrations/service_tools/luxtts_tool.py -- sherpa-onnx ZipVoice-Distill INT8, zero-shot voice cloning, 24 kHz output.
  • integrations/service_tools/pocket_tts_tool.py -- 8 built-in voices, in-process TTS.
  • Fallback chain in model_bus_service.py: MakeItTalk cloud -> Pocket TTS offline.
  • Gap: INT8 quantization produces audible quality artifacts in some voice-cloning scenarios. Under investigation.

Summary

Category Delivered Partial Total
Core pipeline 4 0 4
Economics & federation 3 0 3
Security 5 0 5
Infrastructure 4 0 4
AI capabilities 2 2 4
Total 18 2 20

The two PARTIAL items have working code paths and passing tests. The gaps are in output quality (TTS) and loop termination strategy (VLM), not missing implementations.