Data Ingestion & Knowledge Sources
Supported formats – PDFs and website crawling only (max 2,000 pages)
Plan limits – Team: 10 files/3 sites, Business: 30 files/10 sites
⚠️ NO DOCX, TXT, CSV, Excel – No audio, video, code files (vs 1,400+ formats)
⚠️ NO cloud storage sync – No Google Drive, Dropbox, OneDrive, Notion, Confluence
Multimodal Indexing – Indexes PDF, Word, Excel, PowerPoint, web pages, images with OCR, audio/video transcription
Programmatic Ingestion – REST API, Python/JS SDKs, CLI for automated data uploads
Sync Agent – Auto-watches cloud drives and sitemaps for continuous nonstop indexing
Language Support – Handles virtually any text-based language (non-pictogram languages only)
1,400+ file formats – PDF, DOCX, Excel, PowerPoint, Markdown, HTML + auto-extraction from ZIP/RAR/7Z archives
Website crawling – Sitemap indexing with configurable depth for help docs, FAQs, and public content
Multimedia transcription – AI Vision, OCR, YouTube/Vimeo/podcast speech-to-text built-in
Cloud integrations – Google Drive, SharePoint, OneDrive, Dropbox, Notion with auto-sync
Knowledge platforms – Zendesk, Freshdesk, HubSpot, Confluence, Shopify connectors
Massive scale – 60M words (Standard) / 300M words (Premium) per bot with no performance degradation
Messaging platforms – Website, Facebook, WhatsApp, Apple Messages, Telegram, SMS, email
200+ marketplace integrations – Zapier (5,000+ apps), HubSpot, Salesforce, Zendesk, Intercom
E-commerce platforms – Shopify, WooCommerce, BigCommerce native plugins
✅ Custom integrations – Webhooks with JSON payloads, 10s response timeout
No-Code Widget – Drop search/Q&A panel onto websites in minutes
Workflow Automation – n8n and Zapier integration for thousands of service connections
API-First Design – Embed into any channel via REST API/SDKs
⚠️ No Native Bots – No one-click Slack/Teams bots (requires custom development)
Website embedding – Lightweight JS widget or iframe with customizable positioning
CMS plugins – WordPress, WIX, Webflow, Framer, SquareSpace native support
5,000+ app ecosystem – Zapier connects CRMs, marketing, e-commerce tools
MCP Server – Integrate with Claude Desktop, Cursor, ChatGPT, Windsurf
OpenAI SDK compatible – Drop-in replacement for OpenAI API endpoints
LiveChat + Slack – Native chat widgets with human handoff capabilities
200+ Integration Ecosystem ( Core Differentiator)
✅ Mature marketplace – 200+ pre-built integrations after 20+ years development
✅ Enterprise CRM depth – Deep Salesforce, HubSpot, Zendesk, Intercom integrations
✅ E-commerce validated – Official Shopify/WooCommerce/BigCommerce plugins
✅ Webhook flexibility – JSON payloads, 10s timeouts for custom workflows (8/10 differentiator)
N/A
N/A
Agent Chat A P I v3.5 ( Differentiator)
Dual transport – REST API and WebSocket (RTM) for real-time communication
Official SDKs – JavaScript/Node.js, iOS (Swift), Android (Kotlin), Customer SDK
OAuth 2.1 PKCE – Modern auth plus Personal Access Tokens
⚠️ Rate limits – 180 requests/min may constrain high-volume apps
⚠️ Chat ops focus – No RAG APIs for semantic search or embedding management
N/A
N/A
AI Reply Suggestions – Context-based response recommendations from knowledge sources
Text Enhancement – Grammar correction and tone polishing before sending
AI Insights – Analyzes 1,000+ queries in 30s for trends
⚠️ NO anti-hallucination – No citation attribution or source tracing
Agentic RAG – Autonomous decision-making for retrieval strategies vs manual configuration
CrewAI Integration – Official integration for orchestrating autonomous AI agent teams
AI Search Copilot – Customizable LLM agents for employee support and customer service
Ingestion Agents (Beta) – Auto-labeling, summarization, graph extraction, Q&A generation, content safety
⚠️ Missing Features – No lead capture, human handoff, proactive alerting workflows
Custom AI Agents – Autonomous GPT-4/Claude agents for business tasks
Multi-Agent Systems – Specialized agents for support, sales, knowledge
Memory & Context – Persistent conversation history across sessions
Tool Integration – Webhooks + 5,000 Zapier apps for automation
Continuous Learning – Auto re-indexing without manual retraining
Chat Bot Automation ( Separate Product)
Visual builder – Drag-and-drop no-code chatbot with NLP
Proprietary engine – Internal NLP, doesn't use OpenAI/Bard/Bing
⚠️ Additional cost – $52/month on top of LiveChat subscription
⚠️ NO LLM selection – Proprietary only, no GPT-4/Claude/Gemini access
N/A
N/A
Widget Customization & White- Labeling
Live editor – Theme presets, custom hex colors, logo uploads, positioning
WCAG 2.1 AA – Accessibility with screen readers, keyboard navigation
Mobile responsive – Device-specific settings and hiding options
⚠️ White-label Enterprise only – Requires custom pricing, minimum 5 seats
N/A
N/A
⚠️ NO model selection – Proprietary AI engine only
⚠️ NO GPT-4, Claude, Gemini – Cannot choose LLM providers
⚠️ NO BYOLLM – No custom models or fine-tuning
⚠️ Opaque processing – Architecture and training data not documented (3/10 flexibility)
Model-Agnostic – OpenAI, Azure OpenAI, Google PaLM 2, Cohere, Anthropic, Hugging Face
100% Private GenAI – Keep all processing on Nuclia infrastructure without third-party exposure
Hugging Face Integration – Drop in open-source or domain-specific models easily
Flexible Switching – Swap models per query to optimize cost vs quality balance
GPT-5.1 models – Latest thinking models (Optimal & Smart variants)
GPT-4 series – GPT-4, GPT-4 Turbo, GPT-4o available
Claude 4.5 – Anthropic's Opus available for Enterprise
Auto model routing – Balances cost/performance automatically
Zero API key management – All models managed behind the scenes
Developer Experience ( A P I & S D Ks)
Agent Chat API v3.5 – REST and WebSocket with OAuth 2.1 PKCE
Mobile SDKs – iOS (Swift, iOS 15.6+), Android (Kotlin via Gradle)
✅ Strong documentation – Postman collections, tutorials, Discord community
⚠️ NO Python SDK – JavaScript/mobile only, limits backend integration
Rich APIs – REST APIs, Python/JS SDKs, CLI for ingestion and querying
Modular Design – Index first, query later workflow fits dev processes
Self-Host Option – On-prem NucliaDB for complete infrastructure control
Open-Source – NucliaDB GitHub (710+ stars, AGPLv3) with transparent code
REST API – Full-featured for agents, projects, data ingestion, chat queries
Python SDK – Open-source customgpt-client with full API coverage
Postman collections – Pre-built requests for rapid prototyping
Webhooks – Real-time event notifications for conversations and leads
OpenAI compatible – Use existing OpenAI SDK code with minimal changes
R A G Implementation & Accuracy
⚠️ NOT RAG-as-a-Service – No vector database, embeddings, retrieval pipeline
⚠️ NO chunking/embedding controls – Size, overlap, model selection not exposed
⚠️ NO hybrid search – No keyword + semantic combination
⚠️ NO anti-hallucination – No citations, source verification (2/10 RAG platform)
N/A
N/A
Real-time delivery – Sub-second message delivery, praised for reliability
Scalability – 37,000+ businesses, enterprise customers (Adobe, PayPal, IKEA)
⚠️ NO accuracy benchmarks – No published AI retrieval metrics
⚠️ Operational focus – Metrics for queue times, agent response, not AI accuracy
Quality-Based RAG – Focused on trusted, source-linked answers with citation attribution
Hybrid Search – Tune semantic vs keyword weighting for domain precision
Entity Enrichment – Summaries and entity extraction improve Q&A quality
Large-Scale Ready – Scales to large datasets; speed depends on LLM choice
Sub-second responses – Optimized RAG with vector search and multi-layer caching
Benchmark-proven – 13% higher accuracy, 34% faster than OpenAI Assistants API
Anti-hallucination tech – Responses grounded only in your provided content
OpenGraph citations – Rich visual cards with titles, descriptions, images
99.9% uptime – Auto-scaling infrastructure handles traffic spikes
✅ Seven certifications – SOC 2, GDPR, HIPAA, ISO 27001, PCI DSS, FedRAMP, CSA Star
Encryption – TLS transit, AES-256 at rest
Data residency – EU (Poland) compliance with account isolation
⚠️ SSO/SAML Enterprise only – Significant gap for mid-market identity management
GDPR Compliant – EU-based with strict data protection; no cross-customer training
Data Isolation – Knowledge Boxes with disk encryption and tenant separation
On-Prem Deployment – Self-host NucliaDB for complete data residency and control
Enterprise SSO – Role-based access with regional data centers (EU available)
SOC 2 Type II + GDPR – Third-party audited compliance
Encryption – 256-bit AES at rest, SSL/TLS in transit
Access controls – RBAC, 2FA, SSO, domain allowlisting
Data isolation – Never trains on your data
Full Compliance Portfolio ( Core Differentiator)
✅ Seven certifications – SOC 2, GDPR, HIPAA, ISO 27001, PCI DSS, FedRAMP, CSA Star
✅ FedRAMP unique – Rare federal authorization enables government contracts
✅ PCI DSS masking – Built-in payment card protection for financial services
✅ HIPAA BAA – Business Associate Agreement for healthcare compliance (9/10 differentiator)
N/A
N/A
Observability & Monitoring
Real-time monitoring – Agent status, queue depth, visitor activity dashboards
Chat metrics – Volume, missed chats, response times, agent performance, CSAT
Staffing predictions – AI scheduling optimization (Business+ plans)
⚠️ NO AI metrics – No retrieval accuracy, semantic search, hallucination monitoring
Usage Dashboard – Track token spend for indexing and query operations
Activity Logs – Audit trails for ingestion and query events
API/CLI Access – Send logs to Splunk, Elastic, or monitoring tools
⚠️ Custom Analytics – Conversation analytics require custom front-end implementation
Real-time dashboard – Query volumes, token usage, response times
Customer Intelligence – User behavior patterns, popular queries, knowledge gaps
Conversation analytics – Full transcripts, resolution rates, common questions
Export capabilities – API export to BI tools and data warehouses
Starter – $20/agent/month, 60-day history, 1 user
Team – $41/agent/month, unlimited history, 10 files/3 sites
Business – $59/agent/month, staffing predictions, 30 files/10 sites
Enterprise – Custom pricing, min 5 seats, SSO/SAML, white-label, HIPAA
⚠️ Cost escalation – 10 agents + ChatBot = $642/month ($7,704/year)
Consumption Model – Base license plus usage costs (indexing, queries, LLM calls)
Granular Control – Light usage stays cheap, scales automatically for heavy use
Free Trial – Hands-on evaluation before committing to paid plans
On-Prem Economics – Self-hosting provides cost control with existing infrastructure
Standard: $99/mo – 60M words, 10 bots
Premium: $449/mo – 300M words, 100 bots
Auto-scaling – Managed cloud scales with demand
Flat rates – No per-query charges
✅ 24/7 support – Live chat and email, consistently praised (4.5/5 G2, 4.6/5 Capterra)
Developer docs – Extensive at developers.livechat.com with Postman, tutorials
Enterprise SLA – Guaranteed response times on custom contracts
Common criticisms – Rising prices, per-agent scaling costs, fragmented ChatBot product
Documentation – Comprehensive docs, API reference, code samples, quick-start guides
Community – Slack community, Stack Overflow, developer forums for peer support
LangChain Integration – Official integration with popular AI frameworks
Enterprise Support – Dedicated assistance for on-prem/hybrid installations
Comprehensive docs – Tutorials, cookbooks, API references
Email + in-app support – Under 24hr response time
Premium support – Dedicated account managers for Premium/Enterprise
Open-source SDK – Python SDK, Postman, GitHub examples
5,000+ Zapier apps – CRMs, e-commerce, marketing integrations
Additional Considerations
⚠️ Platform classification – Human-agent live chat with AI, NOT RAG-as-a-Service
✅ Primary strength – Customer support workflows with 200+ integrations, seven certifications
⚠️ Knowledge gap – PDFs/websites only, NO DOCX/CSV/Excel/audio/video/code files
⚠️ NO RAG infrastructure – No vector DB, embeddings, chunking, similarity thresholds
✅ Beyond Search – AI search, Q&A, classification, multi-language multimodal indexing
✅ Enterprise RAG – Replace legacy search across text, audio, video content
✅ Open-Source Core – Reduces lock-in, enables extension and self-hosting
⚠️ DevOps Effort – Advanced setups need ML/DevOps resources for deployment
Time-to-value – 2-minute deployment vs weeks with DIY
Always current – Auto-updates to latest GPT models
Proven scale – 6,000+ organizations, millions of queries
Multi-LLM – OpenAI + Claude reduces vendor lock-in
Proprietary engine – Internal NLP, doesn't rely on OpenAI/Bard/Bing
⚠️ NO LLM selection – Cannot choose GPT-4, Claude, Gemini
⚠️ Opaque architecture – Model training data, capabilities not documented
⚠️ NO BYOLLM – No model routing, fine-tuning, custom models (3/10 flexibility)
Model-Agnostic – OpenAI, Azure OpenAI, PaLM 2, Cohere, Anthropic, Hugging Face
Private GenAI – 100% Nuclia-hosted infrastructure option for data isolation
Flexible Switching – Swap models per query without architectural changes
Local Models – Self-hosted models supported (requires extra setup)
✅ Developer Freedom – Choose optimal LLM per Knowledge Box or query
OpenAI – GPT-5.1 (Optimal/Smart), GPT-4 series
Anthropic – Claude 4.5 Opus/Sonnet (Enterprise)
Auto-routing – Intelligent model selection for cost/performance
Managed – No API keys or fine-tuning required
⚠️ NOT RAG platform – No vector DB, embedding controls, retrieval pipeline
⚠️ Limited sources – PDFs and websites only (max 2,000 pages, 10-30 files)
⚠️ Human-agent focus – Agent workflows, not autonomous retrieval (2/10 RAG rating)
Quality-Based RAG – Trusted answers with comprehensive source citations for transparency
Hybrid Search – Combine semantic vectors with keyword matching for precision
Customizable Chunking – Adjust chunk sizes, weighting, segmentation for context windows
Knowledge Graph – Auto entity/relationship extraction enriches corpus for Q&A
Multimodal Indexing – OCR for images, speech-to-text creates comprehensive knowledge base
✅ Open Architecture – NucliaDB open-source provides transparency vs black-box competitors
GPT-4 + RAG – Outperforms OpenAI in independent benchmarks
Anti-hallucination – Responses grounded in your content only
Automatic citations – Clickable source links in every response
Sub-second latency – Optimized vector search and caching
Scale to 300M words – No performance degradation at scale
✅ Customer support – Live chat for 37,000+ businesses with AI agent augmentation
✅ E-commerce – Shopify/WooCommerce/BigCommerce native integrations
✅ Multi-channel – Website, Facebook, WhatsApp, Apple Messages, Telegram, SMS
⚠️ NOT suitable for – Autonomous knowledge retrieval, programmatic document search
Enterprise Search – Modernize legacy search with AI across text/audio/video
Customer Support – Internal Q&A for support teams from product documentation
Multimodal Discovery – Unified search across PDFs, videos, audio, presentations
Regulatory Compliance – GDPR-compliant retrieval with data residency guarantees
Developer RAG Backend – API-first infrastructure for custom AI applications
Customer support – 24/7 AI handling common queries with citations
Internal knowledge – HR policies, onboarding, technical docs
Sales enablement – Product info, lead qualification, education
Documentation – Help centers, FAQs with auto-crawling
E-commerce – Product recommendations, order assistance
✅ Seven certifications – SOC 2, GDPR, HIPAA, ISO 27001, PCI DSS, FedRAMP, CSA Star
✅ Data privacy – Customer data never used for training, zero-retention policies
Encryption – TLS transit, AES-256 at rest, regional data residency
⚠️ HIPAA/SSO Enterprise only – Min 5 seats ($6,000/year minimum)
GDPR Compliant – EU-based; customer data never used for model training
Data Isolation – Knowledge Boxes with disk encryption; zero cross-training
On-Prem Options – Self-host NucliaDB and local LLMs for data sovereignty
Enterprise SSO – Identity provider integration with role-based access control
Regional Centers – EU and other regions for local data residency laws
SOC 2 Type II + GDPR – Regular third-party audits, full EU compliance
256-bit AES encryption – Data at rest; SSL/TLS in transit
SSO + 2FA + RBAC – Enterprise access controls with role-based permissions
Data isolation – Never trains on customer data
Domain allowlisting – Restrict chatbot to approved domains
Starter – $20/agent/month, 60-day history, 1 user
Team – $41/agent/month, unlimited history, 400 users, AI features
Business – $59/agent/month, staffing predictions, full AI features
Enterprise – Custom pricing (min 5 seats), SSO/SAML, white-label, HIPAA
⚠️ Cost escalation – Per-agent model criticized for scaling vs project pricing
Pricing Model – License plus consumption (base plus indexing/queries/LLM calls)
Free Trial – Available for hands-on evaluation before commitment
Granular Control – Pay for usage; light stays cheap, scales automatically
Transparent Billing – Storage, indexing, queries, LLM usage clearly itemized
Best Value For – Organizations optimizing costs through usage control vs seat-based pricing
Standard: $99/mo – 10 chatbots, 60M words, 5K items/bot
Premium: $449/mo – 100 chatbots, 300M words, 20K items/bot
Enterprise: Custom – SSO, dedicated support, custom SLAs
7-day free trial – Full Standard access, no charges
Flat-rate pricing – No per-query charges, no hidden costs
✅ 24/7 support – Live chat/email, consistently praised (4.5/5 G2, 4.6/5 Capterra)
Developer docs – Extensive at developers.livechat.com with Postman, tutorials
Enterprise SLA – Guaranteed response times, uptime commitments
SDK support – JavaScript, iOS (Swift), Android (Kotlin), Customer SDK
Comprehensive Docs – docs.nuclia.dev with guides, API reference, code examples
Active Community – Slack community, Stack Overflow, developer forums
Open-Source – NucliaDB GitHub (710+ stars, AGPLv3) with transparent code
LangChain Integration – Official integration for AI framework ecosystem compatibility
Enterprise Support – Dedicated support for on-prem/hybrid installations
Documentation hub – Docs, tutorials, API references
Support channels – Email, in-app chat, dedicated managers (Premium+)
Open-source – Python SDK, Postman, GitHub examples
Community – User community + 5,000 Zapier integrations
Limitations & Considerations
⚠️ NOT RAG platform – Human-agent live chat, not autonomous retrieval (2/10 RAG)
⚠️ Limited formats – PDFs/websites only, NO DOCX/CSV/Excel/audio/video/code
⚠️ NO LLM flexibility – Proprietary engine only, no GPT-4/Claude/Gemini
⚠️ Fragmented product – ChatBot requires separate $52/month purchase
⚠️ Cost scaling – Per-agent pricing criticized, 10 agents = $642/month
⚠️ API-First Complexity – Developer-focused; not plug-and-play for non-technical teams
⚠️ No Turnkey UI – Advanced branding requires building custom front-end
⚠️ No Native Bots – Slack/Teams bots require custom API development
⚠️ Language Limits – Cannot index pictogram-based languages (Japanese, Chinese characters)
⚠️ Learning Curve – Advanced RAG parameters feel technical for beginners
✅ Best For Developers – Capable platform for technical teams with resources
Managed service – Less control over RAG pipeline vs build-your-own
Model selection – OpenAI + Anthropic only; no Cohere, AI21, open-source
Real-time data – Requires re-indexing; not ideal for live inventory/prices
Enterprise features – Custom SSO only on Enterprise plan
N/A
AI Search + Q&A – Semantic search with generative answers from your data
Source Citations – Shows exact sources for answer transparency and trust
Auto-Summarization – Summarizes long docs with entity recognition and AI classification
Multi-Turn Chat – Handles one-shot Q&A and conversational multi-turn interactions
✅ #1 accuracy – Median 5/5 in independent benchmarks, 10% lower hallucination than OpenAI
✅ Source citations – Every response includes clickable links to original documents
✅ 93% resolution rate – Handles queries autonomously, reducing human workload
✅ 92 languages – Native multilingual support without per-language config
✅ Lead capture – Built-in email collection, custom forms, real-time notifications
✅ Human handoff – Escalation with full conversation context preserved
N/A
Widget Styling – No-code widget offers basic styling customization options
Custom Prompts – Set system prompts to adjust tone and response style
Custom UI – Build fully branded front-end on flexible API
⚠️ Advanced Branding – Deep customization requires developer resources and API integration
Full white-labeling included – Colors, logos, CSS, custom domains at no extra cost
2-minute setup – No-code wizard with drag-and-drop interface
Persona customization – Control AI personality, tone, response style via pre-prompts
Visual theme editor – Real-time preview of branding changes
Domain allowlisting – Restrict embedding to approved sites only
Customization & Flexibility ( Behavior & Knowledge) N/A
Retrieval Tuning – Adjust chunk sizes, weighting, metadata filters for precision
Per-Query Prompts – Custom prompts per query for dynamic persona/style
Knowledge Boxes – Multiple isolated data spaces with tags for scope
Structured Output – Return JSON or fine-tune private models for specific needs
Live content updates – Add/remove content with automatic re-indexing
System prompts – Shape agent behavior and voice through instructions
Multi-agent support – Different bots for different teams
Smart defaults – No ML expertise required for custom behavior
No- Code Interface & Usability N/A
No-Code Dashboard – Create Knowledge Box → upload data → tune → embed widget
Defaults Work – Out-of-box settings are production-ready for basic use
⚠️ Technical Sliders – Advanced retrieval/prompt options may confuse non-technical users
⚠️ Custom UI Needed – Full branding requires building on API front-end
2-minute deployment – Fastest time-to-value in the industry
Wizard interface – Step-by-step with visual previews
Drag-and-drop – Upload files, paste URLs, connect cloud storage
In-browser testing – Test before deploying to production
Zero learning curve – Productive on day one
N/A
Market Position – API-first RAG platform with multimodal indexing and model-agnostic architecture
Target Customers – Dev teams needing text/audio/video search with on-prem/hybrid deployment options
Key Competitors – Deepset/Haystack, Vectara.ai, Azure AI Search, custom RAG implementations
Competitive Advantages – Multimodal OCR/speech-to-text, 100% private GenAI, open-source NucliaDB portability
Pricing Advantage – Consumption model controls costs; light usage cheap, enterprise-ready scalability
Market position – Leading RAG platform balancing enterprise accuracy with no-code usability. Trusted by 6,000+ orgs including Adobe, MIT, Dropbox.
Key differentiators – #1 benchmarked accuracy • 1,400+ formats • Full white-labeling included • Flat-rate pricing
vs OpenAI – 10% lower hallucination, 13% higher accuracy, 34% faster
vs Botsonic/Chatbase – More file formats, source citations, no hidden costs
vs LangChain – Production-ready in 2 min vs weeks of development
R A G-as-a- Service Assessment N/A
Platform Type – TRUE RAG-AS-A-SERVICE with managed infrastructure and API-first design
Agentic Focus – Progress Agentic RAG with autonomous decision-making capabilities
Fully Managed – Hosted NucliaDB with auto-scaling, chunking, embedding, no infrastructure management
Model-Agnostic Service – Switch LLM providers without architectural changes (OpenAI, Azure, PaLM, Anthropic)
Agent-Ready – Only RAG platform designed for AI agents (CrewAI, LangChain compatible)
Hybrid Deployment – Cloud-managed with on-prem NucliaDB for data sovereignty
Platform type – TRUE RAG-AS-A-SERVICE with managed infrastructure
API-first – REST API, Python SDK, OpenAI compatibility, MCP Server
No-code option – 2-minute wizard deployment for non-developers
Hybrid positioning – Serves both dev teams (APIs) and business users (no-code)
Enterprise ready – SOC 2 Type II, GDPR, WCAG 2.0, flat-rate pricing
Join the Discussion
Loading comments...