Technical Information

Governed AI for Critical Knowledge

Award-winning enterprise AI with complete accountability. Every answer cited, every action audited, every change traced. Run fully offline with open source models. Enterprise governance built in, not bolted on.

How It Works

From Documents to Intelligent Answers

Upload Your Knowledge

Import 25+ document formats. iKB processes, indexes, and builds knowledge graphs automatically.

AI Understands Context

Advanced RAG pipeline with intent classification, contextual chunking, and entity extraction.

Users Ask Questions

Across 6 channels — or in Freeform mode with personal uploads and web search.

Accurate, Traced Answers

Quality-scored, cited responses with full pipeline provenance tracing.

New — Freeform Chat

Personal AI Workspace

A new conversation mode where users chat with AI without a pre-configured knowledge base. Upload personal documents, search the web, and get answers — all within governed limits.

Chat Modes

Standard KB chat and Freeform mode, switchable via sidebar toolbar

Dual

RAG Sources

Searches both uploaded personal documents and web simultaneously

Unit Tests

Comprehensive coverage for quotas, model validation, conversation limits

REST

API Blueprint

Full CRUD endpoints for conversations, uploads, and quota checking

Core Capabilities

Users upload documents into their own Freeform conversation with web search fallback when local docs are insufficient.

Per-User Document Upload Web Search Integration Document Panel Sidebar UI Mode Switch Dual-RAG

Administration

Admins control Freeform access, quotas, and model availability with full session visibility.

Global Enable/Disable Quota System (Daily/Weekly/Monthly) Token Usage Limits Model Restrictions Admin Session Visibility Auto-Cleanup & Expiry S3 Orphan Cleanup

iKB Memory

Per-User Semantic Memory

AI remembers user preferences and facts across sessions. Powered by Mem0 with pgvector. Hybrid architecture: global memories shared across topics + topic-specific memories scoped per domain.

Automatic Extraction

After each conversation turn, the AI silently learns relevant facts about the user — preferences, role, style, and domain knowledge.

Privacy-First

Incognito mode skips memory entirely. Full GDPR/PDPA DSAR support: search, export, and purge user memories. Every operation audit-logged.

All 6 Channels

Memory works across Native, WebChat, Slack, Teams, OMNI, and API. Anonymous users get session-scoped memory with auto-cleanup.

Mem0 + pgvector Global + Topic Memories Per-Topic Toggle Token Budgets Incognito Mode Fail-Safe Audit Logged

LiteLLM Gateway

Unified AI Model Gateway

Full LiteLLM integration as the unified model gateway, replacing direct API calls. Provider-first model discovery, per-model hybrid routing, unified cost tracking, and dynamic configuration — all managed from the admin UI.

Provider Discovery

Discover and enable models from OpenAI, Anthropic, Google, Cohere, Cerebras, and LocalAI/Custom providers directly from admin UI.

Hybrid Routing

Each model independently routed through LiteLLM or legacy direct API. "iKB LLM Router" / "Legacy" badges in all dropdowns.

Cost Tracking

Real-time cost dashboard with per-topic, per-model, per-channel spend attribution via LiteLLM's extra_spend_tag_headers.

Dynamic config.yaml Gateway Health TTS/STT Routing Cohere Reranking Auto-Restart All 6 Channels Budget Controls

Hierarchical Skills System

33+ Output Skills with Composable Sub-Skills

Expanded skill framework with categories, composable sub-skills via {{sub:slug}}, and file-based skill loading. 18 system skills + 15 enterprise skills covering document co-authoring, newsletters, incident reports, quotations, and 5 pedagogical skills for tutor mode.

33+

Total Skills

18 system + 15 enterprise (co-authoring, 3rd-party reports, newsletters, incident reports, quotation/scope of work) + 5 pedagogical

Detection Tiers

Manual @prefix (free) → regex patterns (free) → intent mapping (free) → LLM classifier (tokens)

Injection Blocks

Forbidden prompt injection patterns validated on save. Skill instructions explicitly lower-priority than system rules.

All 6

Channels

Manual @prefix on all channels. Auto-detection on native chat. Pipeline tracing logs skill detection method.

Custom Skill Builder

Create custom skills with name, slug, prompt template (2000 chars), regex triggers with ReDoS validation, and governance flags.

Prompt Templates Regex Triggers ReDoS Validation Governance Flags Multi-Skill

Document Export

Per-message export to Word, Excel, and PowerPoint via ⋮ menu. AI-powered reformatting with Advanced Processing Model. Auto-export on narrative requests. Diagrams embedded in exports.

DOCX XLSX PPTX AI Reformatting Auto-Download Diagram Embedding

New — Tutor Mode

Guided Learning Journeys

Transform any knowledge base into an interactive learning experience. AI-generated step-by-step learning plans, comprehension questions, and Socratic teaching — all grounded in your documents.

3–12

Step Plans

Progressive learning plans that scale with document volume, tailored to learner's stated goal

Pedagogical Skills

Diagnostic assessment, mastery tracking, targeted drill, error analysis, Socratic teaching

MCQ

+ Free-Text

Auto-generated comprehension questions with real-time LLM evaluation and feedback

Persistent

Sessions persisted to database for long-term analytics, not just Redis

Tutor Experience

Split-panel UI slides in from the right with step navigation and progress tracking. "Tutor Me" on any selected text. Quick-access graduation cap icon. AI inference disclaimer banner.

Split-Panel UI Step Navigation Progress Tracking "Tutor Me" on Selection Quick-Access Button AI Inference Disclaimer

Quiz System

5–10 question quizzes generated from topic documents via RAG. Per-concept mastery tracking. Instant MCQ feedback (zero latency). Free-text evaluated with confusion-type classification.

RAG-Generated Questions Per-Concept Mastery Confusion Classification Persistent Results Admin Toggle Score + Duration + Tokens

Tutor Activity Reporting

Admin dashboard with Tutor Sessions tab (topic, user, step progress, completion status, tokens) and Quiz Results tab (score, duration, questions, token cost). Summary cards with date range filtering (7/14/30/90 days). Scoped admin access.

Session Analytics Quiz Results Completion Rate Average Score Token Tracking Date Range Filter

New — Prompt Guard

Offline Injection Detection

Four guard modes protect against prompt injection attacks. Output scrubber detects system prompt leaks. Security sandwich reinforces safety invariants via recency bias. Integrated across all channels.

4 Guard Modes

Off, Soft (log only), Warn (flag to user), Strict (block message). Configurable per topic in Topic Settings.

Output Scrubber

Detects system prompt leaks in AI responses. Prevents accidental exposure of internal instructions.

Security Sandwich

Reinforces safety invariants at the end of the prompt via recency bias. Tutor/quiz context wrapped in "REFERENCE DATA — NOT INSTRUCTIONS" blocks.

New — AI Workspaces

Private Multi-User Workspaces

Users create private AI workspaces with document collections. Per-workspace model selection, web search, voice, document limits, and token budgets. DLP mode blocks or warns on sensitive content.

Workspace Features

Each workspace is a self-contained AI environment with its own document collection, model configuration, and conversation history.

Private Documents Model Selection Web Search Voice Support Document Limits Token Budgets

DLP & Governance

Data Loss Prevention mode blocks or warns on sensitive content in workspace conversations. Full audit trail for all workspace activity.

DLP Mode Block / Warn Sensitive Content Detection Audit Trail

New — Tiered Context

Smarter Conversation Memory

Two-tier context management for long discussions. Recent messages at full fidelity, older messages auto-summarized. Configurable per topic, supported across all 6 channels.

Two-Tier Architecture

Tier 1: recent messages at full fidelity. Tier 2: older messages with AI-generated rolling summaries for context continuity.

Auto/Manual Modes

Choose automatic context management or manual control. Configurable Tier 1/Tier 2 counts in Topic Settings.

All 6 Channels

Native chat, Slack, Teams, Chatwoot, API, and WebChat all benefit from tiered context management.

New — Contextual Follow-up

"Ask About This" & "Tutor Me"

Highlight any text in an AI response and a floating popup appears. Click "Ask About This" for an instant follow-up question, or "Tutor Me" to start a learning journey on that concept.

Text Selection Popup Instant Follow-Up Tutor Me Integration Works on All Responses

Consent & Terms

Information, Terms & Privacy

Comprehensive consent management with governance-first design. Blocking modal on first visit, admin-facing responsible administration interstitial, and WebChat widget terms — all with immutable audit trails and version tracking.

User & Admin Terms

Blocking acceptance modal for users. "Responsible Administration" interstitial for admins. Decline = redirect/logout. Version bumps force re-acceptance.

Three-Layer Persistence Version Tracking Multi-Language DOMPurify Sanitized

Governance

Immutable TermsAcceptance audit trail. CORS-enabled widget endpoints. Rate-limited (30/min status, 5/min accept). Per-language content via AppSettings.

Immutable Audit Widget CORS Rate-Limited IP Tracking

Evaluation Framework

Automated Quality Testing

Evaluate AI response quality using the promptfoo testing framework. Create test cases, run evaluations, and measure accuracy at scale.

promptfoo Integration

Run evaluations against topics using the promptfoo binary. Start, monitor, cancel, and purge evaluation runs with real-time polling.

Bulk Test Import

Import test cases from CSV/XLSX with header normalization, BOM handling, delimiter detection, and Unicode support.

Results Dashboard

View pass/fail rates, assertion details, and response quality metrics. Automated scoring for empty, error, short, and unhelpful responses.

Test Case Management Create, edit, delete, and reorder evaluation test cases per topic.

Model Selector Choose which AI model to run evaluation against.

Advisory Locks Concurrent run prevention for evaluation safety.

Governance Integration Token tracking and audit logging for all evaluation actions.

Discretionary Access Control

Per-Admin DACL Permissions

18 granular permission columns per admin account with deny-by-default enforcement. Topic-scoped and system-scoped tiers with hierarchy enforcement to prevent privilege escalation.

Permission Columns

Topic-scoped: model selection, web crawl, token costs, documents, analytics. System-scoped: users, channels, settings, models, governance

Permission Tiers

Topic-scoped permissions and System-scoped permissions — deny-by-default on both

100%

Server-Side

Every permission check enforced at API level, not just UI. Fail-closed on missing permissions.

Audit

Logged

All permission changes tracked in governance audit trail with actor and timestamp

Admin Management

Superadmins assign granular permissions per admin. Visual permission matrix with bulk operations. Hierarchy enforcement prevents privilege escalation.

Permission Matrix UI Deny-by-Default Superadmin Bypass Self-Edit Prevention Per-Channel Limits Hierarchy Enforcement

Enforcement

Server-side middleware checks permissions on every API call. UI dynamically hides unauthorized sections. Per-channel-type creation limits (max Slack bots, max Teams channels).

API-Level Enforcement Dynamic UI Fail-Closed Navigation Filtering System Logs (Superadmin)

Rich Rendering & Visualization

7 Visualization Engines

AI responses come alive with interactive charts, mind maps, maps, timelines, inline SVG graphics, dashboard grid layouts, and syntax-highlighted code — all rendered client-side with lazy loading.

ECharts

Interactive charts and data visualizations. Bar, line, pie, scatter, radar — AI generates chart configs from data analysis.

Markmap

Mind map rendering from markdown headings. Interactive zoom, pan, and collapse for complex knowledge structures.

Leaflet

Interactive maps with markers, popups, and tile layers. AI can plot locations, routes, and geographic data.

Prism.js Syntax Highlighting

Beautiful code blocks with language-specific syntax highlighting. 40+ language support with one-click copy.

40+ Languages Dark/Light Themes Copy Button Line Numbers

vis-timeline

Interactive timeline visualizations for historical events, project milestones, and chronological data. Zoom, pan, and grouping.

Interactive Zoom Event Grouping Date Ranges Custom Styling

Inline SVG Graphics

AI generates custom vector graphics directly in responses — architecture diagrams, schematics, infographics, comparisons. DOMPurify-sanitized, hidden until rendered.

Custom Illustrations DOMPurify Sanitized No Visual Flash All Channels

Dashboard Grid Layouts

AI arranges 2–4 charts side by side using layout markers. Responsive — columns stack vertically on mobile.

2–4 Column Grid Responsive Stacking Layout Markers

Diagram Engine — Unified Tooling

Per-diagram toolbar with copy, download, and expand. All diagrams render in light mode regardless of page theme. AI-generated disclaimer on all containers. 70% transparent backgrounds. Native SVG embedding in zoom modal. Diagrams embedded in Office exports (DOCX/XLSX/PPTX).

Per-Diagram Toolbar Copy / Download / Expand Light Mode Rendering AI Disclaimer Office Export All 7 Engines Lazy-Loaded

Unified RAG Pipeline

Single Shared Pipeline — All 6 Channels

All channels share a single retrieval pipeline, eliminating duplicated code. Always-on tracing with governance-grade provenance. 8-stage pipeline with query decomposition and neighbourhood expansion.

8-Stage Retrieval Pipeline

Unified pipeline with per-document search weights, query decomposition for complex multi-part questions, and neighbourhood expansion for chunks that span boundaries. All channels gained these features automatically.

HyDE Query Decomposition Embedding Search Document Weight Re-scoring Reranking Deduplication Agentic Retrieval Neighbourhood Expansion

Technical Details Modal

Per-message pipeline trace visible in session detail admin page. Full LLM generation config stored per message (model, temperature, max_tokens, reasoning_effort).

Intent Classification

AI classifies queries into 7 intent types (factual, comparison, summarization, multi-hop, procedural, clarification, out-of-scope) to dynamically adjust retrieval strategy.

Skip Retrieval (OOS) Skip HyDE (Factual) Force Multi-Query Prefer GraphRAG Token-Efficient (16 tokens)

AI Intelligence

Advanced Retrieval & Quality

Unified 8-stage retrieval pipeline with quality scoring, per-user memory, per-document search weights, and first-person AI voice.

AI Judge & Quality Scoring

QAG faithfulness decomposition, few-shot calibration (65% → 77.5% consistency), position-weighted scoring with relevance tier badges. Abstention-aware — honest refusals score as PASS.

GraphRAG

Cross-document entity relationships, hybrid graph+vector queries. Retry button for failed documents, graceful partial failure, PostgreSQL lock safety.

Neighbourhood Expansion

Adjacent chunks auto-pulled to capture cross-boundary information. Smart dedup, reading-order interleaving, configurable window.

Unified RAG Pipeline HyDE, query decomposition, agentic retrieval, reranking, neighbourhood expansion — shared across all 6 channels.

Knowledge Domain + RAG Strategy Per-topic domain classification (healthcare/medical). Bulk RAG strategy. Pipeline Analyzer for debugging retrieval quality.

Multi-Model via LiteLLM OpenAI, Anthropic, Google, Cohere, Cerebras, LocalAI via unified gateway. Per-model routing with cost tracking.

Source Citations Source docs with page numbers. Page references stored per message for governance.

Multi-Language Any language in, any language out. Reinforced language mirroring. Drift prevention on resend.

Spreadsheet Analytics (Datagram) Structural profiler, smart sheet selection, 75 permitted pandas functions, chain-of-thought code generation, admin-configurable reasoning effort.

Per-Document Search Weight Fine-tune hybrid search balance per document. Auto-detected on upload: legal PDFs favour BM25, FAQs favour semantic.

Rich Visualizations Mermaid, ECharts, Markmap, Leaflet, vis-timeline, Prism.js, inline SVG — 7 rendering engines.

Follow-Up Suggestion Chips AI generates 2–3 clickable follow-up questions below each response. Per-topic toggle.

iKB Memory Per-user semantic memory via Mem0 + pgvector. AI remembers preferences and facts across sessions. Incognito mode available.

Knowledge Organization Topics, category groups, custom AI instructions, starter questions, real-time updates.

Per-Channel Instructions Different master AI instructions per channel. Widget gets concise, Native gets detailed.

33+ Hierarchical Skills Composable sub-skills, skill categories, 15 enterprise skills, 5 pedagogical skills. File-based manifest loading.

Governance & Compliance

Enterprise-Grade AI Governance

Four independent, immutable logging systems. Consent management with version tracking. OpenTelemetry instrumentation. Every action captured. Nothing deleted.

Audit Pillars

Logs, Prompt Versions, Config Changes, Moderation Events

47+

Tracked Fields

Before/after snapshots with IP attribution

Moderation Categories

Fail-closed, per-topic, per-channel

Deletable Records

Append-only. Deletion returns 403.

Content Moderation

13-category moderation using OpenAI's omni-moderation model, free of charge. Configurable per topic, enforced across all channels.

Fail-Closed Default Per-Topic Toggle Per-Channel Coverage Performance Tracking Translated Warnings

DSAR Compliance

Full GDPR/PDPA compliance integrated into the Governance dashboard. Multi-channel user data discovery, export, and erasure.

Multi-Channel Search Export (Art. 15) Purge (Art. 17) Audit-Logged Compliance-Safe Governance Tab

Complete Sovereignty — Run Fully Offline

Deploy on your own GPU infrastructure using open source AI models through vLLM, Ollama, or any OpenAI-compatible endpoint. Every component runs locally. No data ever leaves your network. Zero external dependencies for defence, government, finance, and any environment where data must never cross the perimeter.

vLLM Ollama OpenAI-Compatible Local GPU Inference Air-Gapped Deployment Zero External Dependencies On-Premise Embeddings

May 2026 — Governance Bundle (Plans 1–5)

Plan 1 — Audit Hash Chain

Every audit event carries OpenTelemetry attributes (ikb.audit.id, entry_hash, previous_hash) so SIEMs can chain-verify directly from Loki without DB access. Privacy-safe SHA-256 of metadata only. V1+V2 chain-aware verifier.

SHA-256 OTel SIEM-ready V1+V2 Verifier

Plan 2 — PII Redaction (Output)

Per-topic configurable. Schema + regex detector for email / phone / SSN / Luhn-validated CC / Malaysian NRIC. Native streaming with StreamingRedactor (64-char lookback). Wired across native, widget, Slack, Teams, OMNI, API. Fail-CLOSED on detector exception.

All Channels Streaming-Aware Fail-CLOSED NRIC Luhn-Validated CC

Plan 3 — Chat-Data Retention

Per-topic chat_retention_days. Daily Celery sweep at 02:00 UTC sets hidden_at. Read-path filter applied across chat history, sidebar, admin sessions/export, statistics, compare, shared conversation, dashboard, and admin session detail.

Soft-Hide Only Daily Sweep Read-Path Filter No Deletion

Plan 4 — Compliance Dashboard

Admin tile under Dashboards → Compliance, gated by perm_view_governance. Aggregator caches 7 tiles: audit activity, retention sweeps, encryption coverage, prompt versions, PII redactions, config changes, OTel export status.

7 Tiles Permission-Gated Cached OTel Health

Plan 5 — Document Versioning + Dedup

Content-SHA-256 hash on every document. New / new-version / duplicate-skipped decision tree. Deferred-promotion model — the prior version stays active until the new doc is fully indexed. No retrieval gap. Sibling-aware promote handles out-of-order v3-before-v2. 5-minute reconcile sweep. Admin UI version stack with per-version Revert. Idempotent Celery backfill (1/sec rate-limited) for legacy content_sha256 IS NULL rows.

SHA-256 Dedup Deferred Promotion No Retrieval Gap Sibling-Aware Per-Version Revert Reconcile Sweep Idempotent Backfill

Immutability Append-only records

Fail-Safe Logging never breaks ops

Privacy by Design Auto-redaction

Non-Repudiation IP + actor + timestamp

Exports CSV / ZIP up to 10K records

May 2026 — Retrieval

Structural Anchor Retrieval & `structural_counting` Skill

When a query names a structural unit by label and number (Section 5, Article 7, Perkara 14, §3), the matching anchor chunk is injected ahead of vector ranking. Multilingual lexicon. Corpus-agnostic. Auto-enabled per Knowledge Domain.

Multilingual Anchor Detector

Keys off markdown bold (**14.**), bold-label (**Perkara 14**), abbreviations (**Per. 14**), and line-anchored numerics. EN + Malay (Perkara, Fasal, Bahagian, Bab, Jadual, Perenggan, Klausa). Three-tier gate: kill-switch → per-topic flag → force-enable.

EN + Malay Markdown Bold Numeric Anchors Three-Tier Gate Per-Topic Override

Drop-One-Level Counting

When sub-items have mixed active/repealed status, the skill drops one structural level and cites the repealing acts. Solves "how many fasal still in force in Perkara 14?" with grounded counts and provenance — no hallucinated counts.

4 Trigger Patterns Repealing Acts Pinned by Tests

May 2026 — Reliability

Mobile + Streaming Resilience

A stable AI product survives mobile tab backgrounding, screen sleep, BFCache restore, and pinch-zoom. iKB does — on iOS Safari, Android Chrome, and desktop. No silent failures. No lost responses.

Tab Backgrounding

In-flight responses survive tab backgrounding. Auto-refresh on return. Dead-stream watchdog with scoped recovery polling. Long-idle resume refresh (topic dropdown + history sidebar) after 30+ seconds, with same-origin session-expired toast.

BFCache + iOS Quirks

Markdown re-renders on iOS / Chrome back-forward navigation. No collapse to plain text after sleep/wake. Pinch-zoom no longer drifts the chatbar. Cross-tab session-topic invariant preserved (A/B tabs don't leak).

Streaming TTS on iOS

Chunked playback for long messages with fast TTFB. Audio-element-first strategy on iOS routes to speaker (mute switch detection). Redis cache for repeat phrases (5 MB / 3600s TTL). MediaSource on Chrome/Edge, blob fallback on Firefox/Safari.

Phase Progress & Confidence

Pipeline phase events with caption next to dots and 114 phrasing variants. Anti-hallucination guard with low-confidence UI surface. One-line confidence footer with overall confidence and persisted "Answered in: X.Ys". Multi-question synthesis (Option A+) covers all sub-questions.

May 2026 — Models & Providers

5 to 17 Providers. Capability Auto-Learning.

Expanded default provider registry. Dynamic provider cards in admin UI, "Add Provider" modal with API endpoint, topic model dropdown grouped by provider with capability icons. LiteLLM proxy is now the source of truth for capability handling.

Capability Auto-Learn

Auto-populate and runtime auto-learn for supports_temperature, supports_reasoning, requires_max_completion_tokens. Gateway-as-source-of-truth: LiteLLM delegates per-model param handling (drop_params: true, modify_params: true); app-side capability branches removed.

Auto-Populate Runtime Auto-Learn drop_params default_reasoning='none'

Content Firewall — Presidio

Engine swap to Presidio (ML) with regex_fallback always-works mode. Privacy & Safety grouping in admin UI. Preview mode, retry on transient failures, allow-list, custom rules support. Failure-resilient init: retry sticky failures with backoff. Document re-processing flow removed by design (fresh-upload-only).

Presidio (ML) regex_fallback Allow-List Custom Rules Retry-on-Failure

Prompt Helper

prompt_generator_service with 33 unit tests. Modal UI on every channel: native, webchat, Slack, Teams, OMNI, API. AI-generated draft prompts that are channel-aware, with full i18n and help docs. Reduces blank-page paralysis for new users.

All 6 Channels Channel-Aware Drafts i18n 33 Unit Tests

Security

Enterprise Security

170-finding security audit across every admin page and channel. Encryption at rest for chat messages and pipeline traces. Prompt Guard injection detection. Audit hash chain with SHA-256. Redis rate limiting. Fail-closed deployment.

Dual-Key Encryption

AES-256-GCM with user key + admin key. All secrets Fernet-encrypted. Auto-redacted in logs.

Enterprise SSO

Authentik OIDC, JIT provisioning, group sync. Three modes: SSO-only, hybrid, local-only.

App Hardening

SSRF blocking, ODBC injection prevention, CSP nonces, CSRF, HSTS, XSS encoding.

Account Lockout

5 failed logins triggers 15-minute lockout. Stored in DB, survives restarts. Auto-expiry.

Egress Policy System

Network egress control for AI tools. Per-tool allow/deny rules, DNS-time SSRF validation, nonce-based tool delimiters.

147-Finding Audit

Complete security audit: SQL injection, SSRF, XSS, DACL bypass, credential leaks, privilege escalation — all remediated.

Self-Hosted

Your infrastructure, your network. Air-gapped deployment. No data leaves the perimeter.

No External Training

Documents are NEVER used to train AI models. Complete data sovereignty guaranteed.

Session Security

HTTPOnly cookies, strict SameSite, UUID v4, TOTP 2FA, HMAC webhooks, IP whitelisting.

Fail2Ban Integration

Network-level brute force protection. Structured log format for Fail2Ban parsing. Auto-ban repeat offenders at firewall level.

XSS Remediation

Comprehensive cross-site scripting audit and remediation across all user-facing templates and API responses.

Prompt Guard

4 guard modes (Off/Soft/Warn/Strict). Output scrubber. Security sandwich. CSRF global interceptor. Per-session Redis rate limits.

Audit Hash Chain

SHA-256 hash chain on audit logs. DB trigger prevents UPDATE/DELETE. Encryption at rest for chat messages, pipeline traces, and feedback.

Channels & Integration

One Knowledge Base, Every Channel

Deploy across 6 channels with full cross-channel feature parity. Reasoning level and text verbosity now configurable on all channels. All RAG features and skills available everywhere.

Primary

Native Chat

Web Channel

Secondary — Try the live demos

WhatsApp +60 3-8689 2818

Telegram @omni_test_acc2_bot

Slack @ikbv2026

Integrations

Microsoft Teams

Chatwoot / OMNI

REST API

Full Cross-Channel Feature Parity

All advanced RAG features — pipeline tracing, intent classification, content moderation, HyDE, multi-query, agentic retrieval, self-critique, reranking, GraphRAG, quality scoring, and negative response filtering — now work identically across all 6 channels. Unified governance prompt injection with smart layer truncation.

Pipeline Tracing Intent Classification Content Moderation HyDE Multi-Query Agentic Retrieval Self-Critique Reranking GraphRAG Quality Scoring Webhook Idempotency

Web Chat Widget

Deploy on any website with a single script tag. Frosted glass input, logo in history sidebar.

Voice STT/TTS Starter Forms Human Escalation AI Disclosure Lead Capture 3 Display Modes Domain Whitelist Frosted Glass UI

External API Tools

MCP + REST tool calling with parallel execution, encrypted auth, confirmation gating, and execution audit.

MCP Connections REST Tools Tool Orchestrator Encrypted Auth Block Mode Response Redaction Actionable Confirmation

Human Escalation & Chat Export

Human handover with email transcripts and rate limiting. Export conversations to Word, Excel, or PDF.

Per-Topic Toggle Email + Transcript DOCX Export XLSX Export PDF Export Rate Limited

SQL Functions & Cloud Sources

Schema-level discovery, admin annotations, query playground, and 40+ cloud storage integrations via rclone. All 12 provider configs aligned with actual rclone options.

Auto-Discovery REST API Cloud Sync (40+) 12 Providers Aligned Governance Exports Structured Errors

Admin & UX

Powerful Admin, Delightful Chat

LiteLLM gateway management, skills builder, self-update, AI Firewall, instant tooltips, UI standardization, and consent management — all from the admin panel.

Admin Enhancements

Bulk Operations & Trash Bulk delete documents with soft-delete trash bin. Restore or permanently purge. 30-day auto-expiry.

Unified People Tab Admin Groups with Authentik SSO sync. Account disable/enable toggle. Crawl job naming and per-chunk section editing.

AI Firewall Dashboard Dedicated governance tab for content moderation events with structured metadata logging.

LiteLLM Gateway Admin Provider discovery, model health, rate limits, budget controls, and cost dashboard from System Settings.

Instant Tooltips CSS-only zero-delay tooltips across admin, native chat, and widget. ~70 new tooltips. Smart text wrapping.

UI Standardization Consistent form sizing, button sizing, icon-only action buttons, tightened action bars across all admin pages.

Chat UX Improvements

Follow-Up Suggestion Chips 2–3 clickable follow-up questions below responses. Per-topic toggle.

Code Block Copy & Web Worker SSE One-click copy on code blocks. Streaming survives background tabs on mobile and desktop via Web Worker.

Inline SVG Graphics AI generates custom vector illustrations, schematics, and infographics directly in responses.

7 Rendering Engines Mermaid, ECharts, Markmap, Leaflet, vis-timeline, Prism.js, SVG — all lazy-loaded.

Per-Message Document Export ⋮ menu on each response: export to Word, Excel, or PowerPoint with AI-powered reformatting.

"Ask About This" & Lazy Topics Highlight text for instant follow-up or tutor launch. Lazy topic rendering removes pagination for large lists.

Documents & Crawling

25+ Formats, Smart Crawling, Resumable Processing

Document Processing

PDFWordExcelCSVMarkdownRTFEPUBBibTeXDWGDXFEMLMSGPNGJPGGIFTIFFWEBPODSWeb Crawl

Resumable processing survives worker restarts. Section-boundary chunking. Image-aware HyDE. Auto-merge retrieval. Real-time progress bar. Per-chunk section editing.

Web Crawling Enhancements

Cloudflare Browser Rendering — Alternative crawl engine using Cloudflare's Browser Rendering API. Configurable per job alongside Playwright.
Opt-In Document Download — Choose which document types to download during crawl (PDF, DOCX, etc.)
Conditional Re-Crawl — HTTP conditional headers (ETag, Last-Modified) to skip unchanged pages
Re-Crawl Completed Jobs — Not just failed ones. Force reprocess on retry for stale chunks.

Deploy & Scale

Flexible Deployment, Any Scale

Cloud SaaS

Fully managed, automatic updates, Celery task hardening

Private Cloud

Dedicated instance in your cloud region

On-Premise

Your infrastructure, air-gapped available

Performance

<500msQuery p95

100+Pages/min ingestion

10K+Concurrent users

99.9%Uptime SLA

Test Infrastructure

2,251Unit Tests

28Test Files

3.2×Latest Growth

SQLiteIn-Memory Tests

Covers: access control, account lockout, analytics, app settings, auth flow, celery tasks, circuit breaker, error handlers, eval service, freeform service, quality scoring, security, token counting, web crawl.

See iKB in Action

See how iKB can help make your documents more accessible and searchable.

Get Started Schedule a Call Try It Now

Governed AI for Critical Knowledge

From Documents to Intelligent Answers

Upload Your Knowledge

AI Understands Context

Users Ask Questions

Accurate, Traced Answers

Personal AI Workspace

Core Capabilities

Administration

Per-User Semantic Memory

Automatic Extraction

Privacy-First

All 6 Channels

Unified AI Model Gateway

Provider Discovery

Hybrid Routing

Cost Tracking

33+ Output Skills with Composable Sub-Skills

Custom Skill Builder

Document Export

Guided Learning Journeys

Tutor Experience

Quiz System

Tutor Activity Reporting

Offline Injection Detection

4 Guard Modes

Output Scrubber

Security Sandwich

Private Multi-User Workspaces

Workspace Features

DLP & Governance

Smarter Conversation Memory

Two-Tier Architecture

Auto/Manual Modes

All 6 Channels

"Ask About This" & "Tutor Me"

Information, Terms & Privacy

User & Admin Terms

Governance

Automated Quality Testing

promptfoo Integration

Bulk Test Import

Results Dashboard

Per-Admin DACL Permissions

Admin Management

Enforcement

7 Visualization Engines

ECharts

Markmap

Leaflet

Prism.js Syntax Highlighting

vis-timeline

Inline SVG Graphics

Dashboard Grid Layouts

Diagram Engine — Unified Tooling

Single Shared Pipeline — All 6 Channels

8-Stage Retrieval Pipeline

Technical Details Modal

Intent Classification

Advanced Retrieval & Quality

AI Judge & Quality Scoring

GraphRAG

Neighbourhood Expansion

Enterprise-Grade AI Governance

Content Moderation

DSAR Compliance

Complete Sovereignty — Run Fully Offline

May 2026 — Governance Bundle (Plans 1–5)

Plan 1 — Audit Hash Chain

Plan 2 — PII Redaction (Output)

Plan 3 — Chat-Data Retention

Plan 4 — Compliance Dashboard

Plan 5 — Document Versioning + Dedup

Structural Anchor Retrieval & structural_counting Skill

Multilingual Anchor Detector

Drop-One-Level Counting

Mobile + Streaming Resilience

Tab Backgrounding

BFCache + iOS Quirks

Streaming TTS on iOS

Structural Anchor Retrieval & `structural_counting` Skill