███╗   ██╗███████╗███╗   ███╗ ██████╗ ██████╗ ██╗███████╗
████╗  ██║██╔════╝████╗ ████║██╔═══██╗██╔══██╗██║██╔════╝
██╔██╗ ██║█████╗  ██╔████╔██║██║   ██║██████╔╝██║███████╗
██║╚██╗██║██╔══╝  ██║╚██╔╝██║██║   ██║██╔══██╗██║╚════██║
██║ ╚████║███████╗██║ ╚═╝ ██║╚██████╔╝██║  ██║██║███████║
╚═╝  ╚═══╝╚══════╝╚═╝     ╚═╝ ╚═════╝ ╚═╝  ╚═╝╚═╝╚══════╝

COMING SOON

PRE-LAUNCH ALPHA

The goal: Nemoris agents don't just respond — they remember, reflect, learn from mistakes, sleep and wake up sharper, keep their promises, and anticipate what you'll need before you ask.

Everything built so far:

Feature

Per-turn debug snapshots: every turn now stores a redacted snapshot of input, prompt assembly, context hydration, provider call, and output. Inspect with /trace input, /trace prompt, /trace output, or /trace json.

Feature

Terminal trace inspector: nemoris trace --view prompt|output|json --offset N for full turn replay from the CLI.

Feature

Gateway GET /trace endpoint: trace snapshots available over the local HTTP gateway for external tooling and eval pipelines.

Feature

Checklist reply resolution: structured task completion tracking with acceptance criteria for multi-step agent work.

Feature

Progress pings: long-running operations emit periodic status updates instead of going silent.

Feature

Skill audit log: lifecycle tracking for skill load, execution, and failure events.

Feature

Broadcast message tool: agents can message multiple peers in a single operation.

Feature

Job prep and manual job state: deterministic preparation steps before scheduled jobs, with manual trigger support.

Fix

Initiative engine scoring hardened: temporal pattern observation wired correctly, method name mismatch fixed, anticipatory task surfacing more reliable.

Fix

Stream buffer reliability: edge cases around partial tool-use blocks and message boundary detection resolved.

Improvement

Context inspector now falls back to latest stored trace snapshot when live context is unavailable.

Improvement

Public hygiene checker script added for pre-publish safety checks.

Improvement

3,049 tests passing across unit, reliability, and dogfood suites.

Feature

Streaming preview: progressive Telegram message updates during LLM generation. Rate-limited edits, cursor indicator, code fence safety.

Feature

User model extractor and store: adaptive behaviour based on learned user preferences, communication style, and working patterns.

Feature

Context composer: intelligent prompt assembly that optimises token budget across identity, memory, and conversation context.

Feature

Active thread state: tracks the current conversation thread for coherent multi-turn interactions.

Feature

Coaching clarification: when requests are ambiguous, the agent asks targeted questions instead of guessing.

Feature

Trace-based learning loop: mines turn traces to identify improvement opportunities and skill gaps.

Feature

Browse tool: agents can fetch and extract content from web pages directly.

Feature

Skill management tool: agents can list, inspect, and manage installed skills at runtime.

Feature

11 new bundled skills: agent-review, business-advisor, frontend-design, implementation-safety, lemonsqueezy, product-autopilot, reddit-engage, release-handoff, ux-flow-audit, verification-evidence, webapp-testing.

Feature

New operator commands: /think, /focus, /model, /recall for quick mode switches and history search.

Improvement

Removed OpenClaw delivery adapters — fully standalone runtime with no legacy dependencies.

Improvement

2,451 tests passing across unit, reliability, and dogfood suites.

Feature

Official Anthropic SDK: replaced hand-rolled HTTP/SSE with @anthropic-ai/sdk. Automatic client caching, typed error classification, and native streaming.

Feature

Unified auth-profiles: Anthropic API keys, setup tokens, and OAuth tokens all stored in a single auth-profiles.json with file-lock safety.

Fix

Model selector drill-down: tapping a provider group now correctly shows individual models instead of silently dropping the inline keyboard.

Fix

Sticky OAuth mode: #isOAuthMode() no longer reads stale disk profiles — only activates for bearer-style tokens, preventing unnecessary Claude Code identity prefix injection.

Fix

Auxiliary lane auth: sleep-cycle, turn evaluator, and session compactor now use the correct NEMORIS_ANTHROPIC_API_KEY environment variable.

Fix

Persisted model override cleared: a stale chat_sessions.model_override was silently routing all turns to Haiku instead of the configured Sonnet primary.

Improvement

SDK error classification maps 401, 429, 5xx, and connection errors to Nemoris recovery categories for smarter circuit-breaker behavior.

Feature

Tiered Cognitive Memory: Implementation of CoALA-inspired Working, Semantic, Episodic, and Procedural memory layers.

Feature

Autonomous Sleep Cycles: Runtime-led memory consolidation, reflection synthesis, and daily planning during quiet hours.

Feature

Temporal Pattern Learning: Active mining of episodic streams to detect and anticipate recurring user needs.

Feature

Commitment Ledger: First-class tracking of user promises, pending obligations, and proactive follow-up triggers.

Feature

Reflective Learning: Synthetic insight generation that promotes raw observations into durable semantic facts.

Feature

Procedural Store: Verified skill library with lifecycle tracking (generate → test → verify → store).

Security

Cross-agent memory isolation and protected config paths hardened for stable multi-agent environments.

Improvement

Circadian Adaptation: Context-aware behavior modulation based on time-of-day and user communication norms.

Improvement

Idle-Time Maintenance: Curiosity runtime performs non-blocking session compaction and skill discovery when the operator is away.

Feature

Anticipatory Intelligence: CommitmentLedger, TemporalPatternDetector, and InitiativeEngine surface overdue and upcoming tasks proactively.

Feature

Procedural Learning: ProceduralStore, ReflexionMemory, and EvalRubric for verified skill generation and self-critique.

Feature

23 bundled starter skills covering code review, deployment, research, and common operator workflows.

Feature

Context7 MCP server wired as default for all agents — live documentation lookup in every turn.

Feature

New operator commands: /mind, /goals, /commitments, /sleep, /reflections, and /cost.

Feature

35 built-in tools (up from 30) and multi-bot Telegram support.

Fix

Streaming fixes: tool_use block sanitisation and input_json_delta accumulation for partial tool calls.

Fix

Cross-agent error routing: failures now route to the correct agent session instead of leaking across contexts.

Feature

Turn Evaluator: lightweight Haiku quality gate after heavy turns, with /eval command to toggle per-agent.

Feature

Cross-agent context isolation: session IDs scoped per agent, preventing memory bleed in single-bot mode.

Feature

Soft cap with wrap-up nudge: agents summarise at 30 tool calls instead of hard-stopping.

Feature

Sprint contracts: acceptance criteria on commitments for structured task completion.

Improvement

Context reset on task switch: fresh session when switching between agents.

Improvement

Shell exec timeout raised from 5s to 30s default, configurable up to 5 minutes.

Fix

Eval pattern narrowed: Python scripts no longer blocked by security policy.

Fix

Ollama token counting fixed: prompt_eval_count and eval_count now tracked correctly.

Improvement

Compaction thresholds raised to 0.80/0.92 for Sonnet 4.6; persona continuity preserved across summaries.

Fix

Ollama reasoning trace filter: <think> blocks stripped from user-facing responses.

Fix

Context replay sanitisation: tool_use blocks stripped from replayed turns.

Feature

Identity Interview: 5-archetype onboarding that generates personalised SOUL.md + USER.md + OPERATING.md — no LLM needed.

Feature

agentskills.io Importer: import/export skills from the 69K+ ecosystem via nemoris skill import.

Improvement

2,397 tests passing across unit, reliability, and dogfood suites — zero regressions.

Feature

Organic intelligence v0.2: trust progression system with 5 trust levels and proactive triggers wired into the runtime.

Feature

4-suite reliability harness: 340 deterministic dogfood tests covering durability, self-healing, delivery, transport, and provider failover.

Feature

Dogfood lifecycle harness: 10-phase end-to-end smoke test for full runtime verification.

Feature

Proactive triggers: the runtime initiates agent turns based on time, memory, and trust context without user prompting.

Improvement

Reliability tests are now the required CI gate; unit tests are informational.

Fix

SQLite handle cleanup in tests to prevent EBUSY on Windows.

Fix

Replaced vendored smol-toml tarball with registry dependency — fixes broken npm install for consumers.

Security

Sanitized test paths and hardened .gitignore. Removed internal planning docs from tracking.

Improvement

Windows support: Ollama install detection and cross-platform path handling.

Feature

48h raw context window with point-in-time snapshots and rollback.

Feature

Curiosity Engine: idle-time memory deduplication, session compaction, and skill proposals.

Feature

Frustration detection: agents halt on error loops (3+ same error) and ask for help.

Feature

Interrupt responsiveness: /stop halts mid-flight operations within 2-3 seconds.

Feature

Preference learning: approve once, gate skips next time, persists across restarts.

Feature

Scope escalation simplified: /approve /path grants read/write access persistently.

Feature

Workflow engine: TOML pipelines with approval gates, resume-on-restart, and sandboxed interpolation.

Feature

Flight recorder: TurnTrace SQLite logging with /trace for turn replay and search.

Feature

Patch generation: apply_patch and generate_patch tools for atomic file updates.

Feature

Structure config, swap providers, and patch live settings without restarting the runtime.

Improvement

JIT tool loading: tools load on demand instead of pre-compiling at runtime start for faster startup.

Feature

Local-first doctor diagnoses Full Disk Access permissions, port conflicts, and system health.

Fix

Six wiring fixes across the approval gate, tool context, turn traces, auto-resume, OpenRouter, and inline approvals.

Feature

Telegram inline approval buttons for quick approve and deny without context switching.

Feature

MessageQueue delivery modes: debounce, immediate, and batch cadence per chat.

Feature

SKILL.md open standard: compatible with Claude Code, Cursor, and Codex CLI. Skills in ~/.claude/skills/ work across tools.

Feature

Bundled browser skill: agent-browser CLI for web automation, screenshots, form filling, and content extraction.

Feature

nemoris dogfood: 49-check runtime verification CLI. Zero API calls, zero tokens. JSON mode for CI.

Feature

Learning loop: SelfCritic scores every turn, PatternLedger detects recurring requests, SkillProposer generates skills with operator approval.

Feature

35 built-in tools including rollback, show_changes, request_tool (JIT discovery), create_agent, create_skill, and create_mcp.

Feature

Core architecture: Active Memory, Delivery Guarantees, Task Contracts.

Feature

Providers: Anthropic (direct + prompt caching), OpenRouter (100+ models), Ollama (local).

Feature

Telegram integration: slash commands, reactions, vision, inline keyboards.

Feature

Self-healing Nurse system: health probes, automatic repair, rule promotion.

Feature

Exec approval gate: human-in-the-loop for shell commands.

Feature

MCP consumer: connect external MCP servers as native tools via config/mcp.toml.

Feature

Session search: FTS5 full-text search across conversation history.

Feature

Context compaction: DAG-based session summarisation.

Feature

Active recall: semantic memory with salience scoring and embeddings.

Feature

Multi-agent: task contract triggers and completion pings.

Feature

Scheduled jobs: cron-triggered and ad-hoc in unified queue.

Feature

Cross-platform: macOS (launchd), Linux (systemd), Windows (PM2).

Feature

Interactive setup wizard with provider OAuth, Telegram wiring, and model selection.

Feature

Migration CLI: nemoris migrate imports agents, jobs, and memory from prior runtimes.

Security

SSRF protection on all URL-intake surfaces.

Security

Input sanitisation with injection detection and boundary tagging.

Feature

Per-turn debug snapshots: every turn now stores a redacted snapshot of input, prompt assembly, context hydration, provider call, and output. Inspect with /trace input, /trace prompt, /trace output, or /trace json.

Feature

Terminal trace inspector: nemoris trace --view prompt|output|json --offset N for full turn replay from the CLI.

Feature

Gateway GET /trace endpoint: trace snapshots available over the local HTTP gateway for external tooling and eval pipelines.

Feature

Checklist reply resolution: structured task completion tracking with acceptance criteria for multi-step agent work.

Feature

Progress pings: long-running operations emit periodic status updates instead of going silent.

Feature

Skill audit log: lifecycle tracking for skill load, execution, and failure events.

Feature

Broadcast message tool: agents can message multiple peers in a single operation.

Feature

Job prep and manual job state: deterministic preparation steps before scheduled jobs, with manual trigger support.

Fix

Initiative engine scoring hardened: temporal pattern observation wired correctly, method name mismatch fixed, anticipatory task surfacing more reliable.

Fix

Stream buffer reliability: edge cases around partial tool-use blocks and message boundary detection resolved.

Improvement

Context inspector now falls back to latest stored trace snapshot when live context is unavailable.

Improvement

Public hygiene checker script added for pre-publish safety checks.

Improvement

3,049 tests passing across unit, reliability, and dogfood suites.

Feature

Streaming preview: progressive Telegram message updates during LLM generation. Rate-limited edits, cursor indicator, code fence safety.

Feature

User model extractor and store: adaptive behaviour based on learned user preferences, communication style, and working patterns.

Feature

Context composer: intelligent prompt assembly that optimises token budget across identity, memory, and conversation context.

Feature

Active thread state: tracks the current conversation thread for coherent multi-turn interactions.

Feature

Coaching clarification: when requests are ambiguous, the agent asks targeted questions instead of guessing.

Feature

Trace-based learning loop: mines turn traces to identify improvement opportunities and skill gaps.

Feature

Browse tool: agents can fetch and extract content from web pages directly.

Feature

Skill management tool: agents can list, inspect, and manage installed skills at runtime.

Feature

11 new bundled skills: agent-review, business-advisor, frontend-design, implementation-safety, lemonsqueezy, product-autopilot, reddit-engage, release-handoff, ux-flow-audit, verification-evidence, webapp-testing.

Feature

New operator commands: /think, /focus, /model, /recall for quick mode switches and history search.

Improvement

Removed OpenClaw delivery adapters — fully standalone runtime with no legacy dependencies.

Improvement

2,451 tests passing across unit, reliability, and dogfood suites.

Feature

Official Anthropic SDK: replaced hand-rolled HTTP/SSE with @anthropic-ai/sdk. Automatic client caching, typed error classification, and native streaming.

Feature

Unified auth-profiles: Anthropic API keys, setup tokens, and OAuth tokens all stored in a single auth-profiles.json with file-lock safety.

Fix

Model selector drill-down: tapping a provider group now correctly shows individual models instead of silently dropping the inline keyboard.

Fix

Sticky OAuth mode: #isOAuthMode() no longer reads stale disk profiles — only activates for bearer-style tokens, preventing unnecessary Claude Code identity prefix injection.

Fix

Auxiliary lane auth: sleep-cycle, turn evaluator, and session compactor now use the correct NEMORIS_ANTHROPIC_API_KEY environment variable.

Fix

Persisted model override cleared: a stale chat_sessions.model_override was silently routing all turns to Haiku instead of the configured Sonnet primary.

Improvement

SDK error classification maps 401, 429, 5xx, and connection errors to Nemoris recovery categories for smarter circuit-breaker behavior.

Feature

Tiered Cognitive Memory: Implementation of CoALA-inspired Working, Semantic, Episodic, and Procedural memory layers.

Feature

Autonomous Sleep Cycles: Runtime-led memory consolidation, reflection synthesis, and daily planning during quiet hours.

Feature

Temporal Pattern Learning: Active mining of episodic streams to detect and anticipate recurring user needs.

Feature

Commitment Ledger: First-class tracking of user promises, pending obligations, and proactive follow-up triggers.

Feature

Reflective Learning: Synthetic insight generation that promotes raw observations into durable semantic facts.

Feature

Procedural Store: Verified skill library with lifecycle tracking (generate → test → verify → store).

Security

Cross-agent memory isolation and protected config paths hardened for stable multi-agent environments.

Improvement

Circadian Adaptation: Context-aware behavior modulation based on time-of-day and user communication norms.

Improvement

Idle-Time Maintenance: Curiosity runtime performs non-blocking session compaction and skill discovery when the operator is away.

Feature

Anticipatory Intelligence: CommitmentLedger, TemporalPatternDetector, and InitiativeEngine surface overdue and upcoming tasks proactively.

Feature

Procedural Learning: ProceduralStore, ReflexionMemory, and EvalRubric for verified skill generation and self-critique.

Feature

23 bundled starter skills covering code review, deployment, research, and common operator workflows.

Feature

Context7 MCP server wired as default for all agents — live documentation lookup in every turn.

Feature

New operator commands: /mind, /goals, /commitments, /sleep, /reflections, and /cost.

Feature

35 built-in tools (up from 30) and multi-bot Telegram support.

Fix

Streaming fixes: tool_use block sanitisation and input_json_delta accumulation for partial tool calls.

Fix

Cross-agent error routing: failures now route to the correct agent session instead of leaking across contexts.

Feature

Turn Evaluator: lightweight Haiku quality gate after heavy turns, with /eval command to toggle per-agent.

Feature

Cross-agent context isolation: session IDs scoped per agent, preventing memory bleed in single-bot mode.

Feature

Soft cap with wrap-up nudge: agents summarise at 30 tool calls instead of hard-stopping.

Feature

Sprint contracts: acceptance criteria on commitments for structured task completion.

Improvement

Context reset on task switch: fresh session when switching between agents.

Improvement

Shell exec timeout raised from 5s to 30s default, configurable up to 5 minutes.

Fix

Eval pattern narrowed: Python scripts no longer blocked by security policy.

Fix

Ollama token counting fixed: prompt_eval_count and eval_count now tracked correctly.

Improvement

Compaction thresholds raised to 0.80/0.92 for Sonnet 4.6; persona continuity preserved across summaries.

Fix

Ollama reasoning trace filter: <think> blocks stripped from user-facing responses.

Fix

Context replay sanitisation: tool_use blocks stripped from replayed turns.

Feature

Identity Interview: 5-archetype onboarding that generates personalised SOUL.md + USER.md + OPERATING.md — no LLM needed.

Feature

agentskills.io Importer: import/export skills from the 69K+ ecosystem via nemoris skill import.

Improvement

2,397 tests passing across unit, reliability, and dogfood suites — zero regressions.

Feature

Organic intelligence v0.2: trust progression system with 5 trust levels and proactive triggers wired into the runtime.

Feature

4-suite reliability harness: 340 deterministic dogfood tests covering durability, self-healing, delivery, transport, and provider failover.

Feature

Dogfood lifecycle harness: 10-phase end-to-end smoke test for full runtime verification.

Feature

Proactive triggers: the runtime initiates agent turns based on time, memory, and trust context without user prompting.

Improvement

Reliability tests are now the required CI gate; unit tests are informational.

Fix

SQLite handle cleanup in tests to prevent EBUSY on Windows.

Fix

Replaced vendored smol-toml tarball with registry dependency — fixes broken npm install for consumers.

Security

Sanitized test paths and hardened .gitignore. Removed internal planning docs from tracking.

Improvement

Windows support: Ollama install detection and cross-platform path handling.

Feature

48h raw context window with point-in-time snapshots and rollback.

Feature

Curiosity Engine: idle-time memory deduplication, session compaction, and skill proposals.

Feature

Frustration detection: agents halt on error loops (3+ same error) and ask for help.

Feature

Interrupt responsiveness: /stop halts mid-flight operations within 2-3 seconds.

Feature

Preference learning: approve once, gate skips next time, persists across restarts.

Feature

Scope escalation simplified: /approve /path grants read/write access persistently.

Feature

Workflow engine: TOML pipelines with approval gates, resume-on-restart, and sandboxed interpolation.

Feature

Flight recorder: TurnTrace SQLite logging with /trace for turn replay and search.

Feature

Patch generation: apply_patch and generate_patch tools for atomic file updates.

Feature

Structure config, swap providers, and patch live settings without restarting the runtime.

Improvement

JIT tool loading: tools load on demand instead of pre-compiling at runtime start for faster startup.

Feature

Local-first doctor diagnoses Full Disk Access permissions, port conflicts, and system health.

Fix

Six wiring fixes across the approval gate, tool context, turn traces, auto-resume, OpenRouter, and inline approvals.

Feature

Telegram inline approval buttons for quick approve and deny without context switching.

Feature

MessageQueue delivery modes: debounce, immediate, and batch cadence per chat.

Feature

SKILL.md open standard: compatible with Claude Code, Cursor, and Codex CLI. Skills in ~/.claude/skills/ work across tools.

Feature

Bundled browser skill: agent-browser CLI for web automation, screenshots, form filling, and content extraction.

Feature

nemoris dogfood: 49-check runtime verification CLI. Zero API calls, zero tokens. JSON mode for CI.

Feature

Learning loop: SelfCritic scores every turn, PatternLedger detects recurring requests, SkillProposer generates skills with operator approval.

Feature

35 built-in tools including rollback, show_changes, request_tool (JIT discovery), create_agent, create_skill, and create_mcp.

Feature

Core architecture: Active Memory, Delivery Guarantees, Task Contracts.

Feature

Providers: Anthropic (direct + prompt caching), OpenRouter (100+ models), Ollama (local).

Feature

Telegram integration: slash commands, reactions, vision, inline keyboards.

Feature

Self-healing Nurse system: health probes, automatic repair, rule promotion.

Feature

Exec approval gate: human-in-the-loop for shell commands.

Feature

MCP consumer: connect external MCP servers as native tools via config/mcp.toml.

Feature

Session search: FTS5 full-text search across conversation history.

Feature

Context compaction: DAG-based session summarisation.

Feature

Active recall: semantic memory with salience scoring and embeddings.

Feature

Multi-agent: task contract triggers and completion pings.

Feature

Scheduled jobs: cron-triggered and ad-hoc in unified queue.

Feature

Cross-platform: macOS (launchd), Linux (systemd), Windows (PM2).

Feature

Interactive setup wizard with provider OAuth, Telegram wiring, and model selection.

Feature

Migration CLI: nemoris migrate imports agents, jobs, and memory from prior runtimes.

Security

SSRF protection on all URL-intake surfaces.

Security

Input sanitisation with injection detection and boundary tagging.

Feature

Per-turn debug snapshots: every turn now stores a redacted snapshot of input, prompt assembly, context hydration, provider call, and output. Inspect with /trace input, /trace prompt, /trace output, or /trace json.

Feature

Terminal trace inspector: nemoris trace --view prompt|output|json --offset N for full turn replay from the CLI.

Feature

Gateway GET /trace endpoint: trace snapshots available over the local HTTP gateway for external tooling and eval pipelines.

Feature

Checklist reply resolution: structured task completion tracking with acceptance criteria for multi-step agent work.

Feature

Progress pings: long-running operations emit periodic status updates instead of going silent.

Feature

Skill audit log: lifecycle tracking for skill load, execution, and failure events.

Feature

Broadcast message tool: agents can message multiple peers in a single operation.

Feature

Job prep and manual job state: deterministic preparation steps before scheduled jobs, with manual trigger support.

Fix

Initiative engine scoring hardened: temporal pattern observation wired correctly, method name mismatch fixed, anticipatory task surfacing more reliable.

Fix

Stream buffer reliability: edge cases around partial tool-use blocks and message boundary detection resolved.

Improvement

Context inspector now falls back to latest stored trace snapshot when live context is unavailable.

Improvement

Public hygiene checker script added for pre-publish safety checks.

Improvement

3,049 tests passing across unit, reliability, and dogfood suites.

Feature

Streaming preview: progressive Telegram message updates during LLM generation. Rate-limited edits, cursor indicator, code fence safety.

Feature

User model extractor and store: adaptive behaviour based on learned user preferences, communication style, and working patterns.

Feature

Context composer: intelligent prompt assembly that optimises token budget across identity, memory, and conversation context.

Feature

Active thread state: tracks the current conversation thread for coherent multi-turn interactions.

Feature

Coaching clarification: when requests are ambiguous, the agent asks targeted questions instead of guessing.

Feature

Trace-based learning loop: mines turn traces to identify improvement opportunities and skill gaps.

Feature

Browse tool: agents can fetch and extract content from web pages directly.

Feature

Skill management tool: agents can list, inspect, and manage installed skills at runtime.

Feature

11 new bundled skills: agent-review, business-advisor, frontend-design, implementation-safety, lemonsqueezy, product-autopilot, reddit-engage, release-handoff, ux-flow-audit, verification-evidence, webapp-testing.

Feature

New operator commands: /think, /focus, /model, /recall for quick mode switches and history search.

Improvement

Removed OpenClaw delivery adapters — fully standalone runtime with no legacy dependencies.

Improvement

2,451 tests passing across unit, reliability, and dogfood suites.

Feature

Official Anthropic SDK: replaced hand-rolled HTTP/SSE with @anthropic-ai/sdk. Automatic client caching, typed error classification, and native streaming.

Feature

Unified auth-profiles: Anthropic API keys, setup tokens, and OAuth tokens all stored in a single auth-profiles.json with file-lock safety.

Fix

Model selector drill-down: tapping a provider group now correctly shows individual models instead of silently dropping the inline keyboard.

Fix

Sticky OAuth mode: #isOAuthMode() no longer reads stale disk profiles — only activates for bearer-style tokens, preventing unnecessary Claude Code identity prefix injection.

Fix

Auxiliary lane auth: sleep-cycle, turn evaluator, and session compactor now use the correct NEMORIS_ANTHROPIC_API_KEY environment variable.

Fix

Persisted model override cleared: a stale chat_sessions.model_override was silently routing all turns to Haiku instead of the configured Sonnet primary.

Improvement

SDK error classification maps 401, 429, 5xx, and connection errors to Nemoris recovery categories for smarter circuit-breaker behavior.

Feature

Tiered Cognitive Memory: Implementation of CoALA-inspired Working, Semantic, Episodic, and Procedural memory layers.

Feature

Autonomous Sleep Cycles: Runtime-led memory consolidation, reflection synthesis, and daily planning during quiet hours.

Feature

Temporal Pattern Learning: Active mining of episodic streams to detect and anticipate recurring user needs.

Feature

Commitment Ledger: First-class tracking of user promises, pending obligations, and proactive follow-up triggers.

Feature

Reflective Learning: Synthetic insight generation that promotes raw observations into durable semantic facts.

Feature

Procedural Store: Verified skill library with lifecycle tracking (generate → test → verify → store).

Security

Cross-agent memory isolation and protected config paths hardened for stable multi-agent environments.

Improvement

Circadian Adaptation: Context-aware behavior modulation based on time-of-day and user communication norms.

Improvement

Idle-Time Maintenance: Curiosity runtime performs non-blocking session compaction and skill discovery when the operator is away.

Feature

Anticipatory Intelligence: CommitmentLedger, TemporalPatternDetector, and InitiativeEngine surface overdue and upcoming tasks proactively.

Feature

Procedural Learning: ProceduralStore, ReflexionMemory, and EvalRubric for verified skill generation and self-critique.

Feature

23 bundled starter skills covering code review, deployment, research, and common operator workflows.

Feature

Context7 MCP server wired as default for all agents — live documentation lookup in every turn.

Feature

New operator commands: /mind, /goals, /commitments, /sleep, /reflections, and /cost.

Feature

35 built-in tools (up from 30) and multi-bot Telegram support.

Fix

Streaming fixes: tool_use block sanitisation and input_json_delta accumulation for partial tool calls.

Fix

Cross-agent error routing: failures now route to the correct agent session instead of leaking across contexts.

Feature

Turn Evaluator: lightweight Haiku quality gate after heavy turns, with /eval command to toggle per-agent.

Feature

Cross-agent context isolation: session IDs scoped per agent, preventing memory bleed in single-bot mode.

Feature

Soft cap with wrap-up nudge: agents summarise at 30 tool calls instead of hard-stopping.

Feature

Sprint contracts: acceptance criteria on commitments for structured task completion.

Improvement

Context reset on task switch: fresh session when switching between agents.

Improvement

Shell exec timeout raised from 5s to 30s default, configurable up to 5 minutes.

Fix

Eval pattern narrowed: Python scripts no longer blocked by security policy.

Fix

Ollama token counting fixed: prompt_eval_count and eval_count now tracked correctly.

Improvement

Compaction thresholds raised to 0.80/0.92 for Sonnet 4.6; persona continuity preserved across summaries.

Fix

Ollama reasoning trace filter: <think> blocks stripped from user-facing responses.

Fix

Context replay sanitisation: tool_use blocks stripped from replayed turns.

Feature

Identity Interview: 5-archetype onboarding that generates personalised SOUL.md + USER.md + OPERATING.md — no LLM needed.

Feature

agentskills.io Importer: import/export skills from the 69K+ ecosystem via nemoris skill import.

Improvement

2,397 tests passing across unit, reliability, and dogfood suites — zero regressions.

Feature

Organic intelligence v0.2: trust progression system with 5 trust levels and proactive triggers wired into the runtime.

Feature

4-suite reliability harness: 340 deterministic dogfood tests covering durability, self-healing, delivery, transport, and provider failover.

Feature

Dogfood lifecycle harness: 10-phase end-to-end smoke test for full runtime verification.

Feature

Proactive triggers: the runtime initiates agent turns based on time, memory, and trust context without user prompting.

Improvement

Reliability tests are now the required CI gate; unit tests are informational.

Fix

SQLite handle cleanup in tests to prevent EBUSY on Windows.

Fix

Replaced vendored smol-toml tarball with registry dependency — fixes broken npm install for consumers.

Security

Sanitized test paths and hardened .gitignore. Removed internal planning docs from tracking.

Improvement

Windows support: Ollama install detection and cross-platform path handling.

Feature

48h raw context window with point-in-time snapshots and rollback.

Feature

Curiosity Engine: idle-time memory deduplication, session compaction, and skill proposals.

Feature

Frustration detection: agents halt on error loops (3+ same error) and ask for help.

Feature

Interrupt responsiveness: /stop halts mid-flight operations within 2-3 seconds.

Feature

Preference learning: approve once, gate skips next time, persists across restarts.

Feature

Scope escalation simplified: /approve /path grants read/write access persistently.

Feature

Workflow engine: TOML pipelines with approval gates, resume-on-restart, and sandboxed interpolation.

Feature

Flight recorder: TurnTrace SQLite logging with /trace for turn replay and search.

Feature

Patch generation: apply_patch and generate_patch tools for atomic file updates.

Feature

Structure config, swap providers, and patch live settings without restarting the runtime.

Improvement

JIT tool loading: tools load on demand instead of pre-compiling at runtime start for faster startup.

Feature

Local-first doctor diagnoses Full Disk Access permissions, port conflicts, and system health.

Fix

Six wiring fixes across the approval gate, tool context, turn traces, auto-resume, OpenRouter, and inline approvals.

Feature

Telegram inline approval buttons for quick approve and deny without context switching.

Feature

MessageQueue delivery modes: debounce, immediate, and batch cadence per chat.

Feature

SKILL.md open standard: compatible with Claude Code, Cursor, and Codex CLI. Skills in ~/.claude/skills/ work across tools.

Feature

Bundled browser skill: agent-browser CLI for web automation, screenshots, form filling, and content extraction.

Feature

nemoris dogfood: 49-check runtime verification CLI. Zero API calls, zero tokens. JSON mode for CI.

Feature

Learning loop: SelfCritic scores every turn, PatternLedger detects recurring requests, SkillProposer generates skills with operator approval.

Feature

35 built-in tools including rollback, show_changes, request_tool (JIT discovery), create_agent, create_skill, and create_mcp.

Feature

Core architecture: Active Memory, Delivery Guarantees, Task Contracts.

Feature

Providers: Anthropic (direct + prompt caching), OpenRouter (100+ models), Ollama (local).

Feature

Telegram integration: slash commands, reactions, vision, inline keyboards.

Feature

Self-healing Nurse system: health probes, automatic repair, rule promotion.

Feature

Exec approval gate: human-in-the-loop for shell commands.

Feature

MCP consumer: connect external MCP servers as native tools via config/mcp.toml.

Feature

Session search: FTS5 full-text search across conversation history.

Feature

Context compaction: DAG-based session summarisation.

Feature

Active recall: semantic memory with salience scoring and embeddings.

Feature

Multi-agent: task contract triggers and completion pings.

Feature

Scheduled jobs: cron-triggered and ad-hoc in unified queue.

Feature

Cross-platform: macOS (launchd), Linux (systemd), Windows (PM2).

Feature

Interactive setup wizard with provider OAuth, Telegram wiring, and model selection.

Feature

Migration CLI: nemoris migrate imports agents, jobs, and memory from prior runtimes.

Security

SSRF protection on all URL-intake surfaces.

Security

Input sanitisation with injection detection and boundary tagging.

Build in progress by the Nemoris team