Data Ingestion & Knowledge Sources
Website Crawling – Entire sites by URL/sitemap with thousands of pages
File Uploads – CSV, TXT, PDF, DOCX, PPTX, Markdown up to 10MB
Cloud Storage Connectors – Google Drive, Dropbox, OneDrive, Notion, Confluence, GitBook
Enterprise Scale – Up to 100K pages on Enterprise tier
Manual Retraining – ⚠️ Click-button refresh, automated cycles on roadmap
Document support – PDF, DOCX, HTML automatically indexed (Vectara Platform )
Auto-sync connectors – Cloud storage and enterprise system integrations keep data current
Embedding processing – Background conversion to embeddings enables fast semantic search
1,400+ file formats – PDF, DOCX, Excel, PowerPoint, Markdown, HTML + auto-extraction from ZIP/RAR/7Z archives
Website crawling – Sitemap indexing with configurable depth for help docs, FAQs, and public content
Multimedia transcription – AI Vision, OCR, YouTube/Vimeo/podcast speech-to-text built-in
Cloud integrations – Google Drive, SharePoint, OneDrive, Dropbox, Notion with auto-sync
Knowledge platforms – Zendesk, Freshdesk, HubSpot, Confluence, Shopify connectors
Massive scale – 60M words (Standard) / 300M words (Premium) per bot with no performance degradation
Native Connectors – Slack, Google Chat, Messenger, Crisp, Freshchat, Zendesk, Zoho
Website Embed – Quick script or iframe for any site
Webhook Support – Higher tiers add event-driven integrations
REST APIs & SDKs – Easy integration into custom applications with comprehensive tooling
Embedded experiences – Search/chat widgets for websites, mobile apps, custom portals
Low-code connectors – Azure Logic Apps and PowerApps simplify workflow integration
Website embedding – Lightweight JS widget or iframe with customizable positioning
CMS plugins – WordPress, WIX, Webflow, Framer, SquareSpace native support
5,000+ app ecosystem – Zapier connects CRMs, marketing, e-commerce tools
MCP Server – Integrate with Claude Desktop, Cursor, ChatGPT, Windsurf
OpenAI SDK compatible – Drop-in replacement for OpenAI API endpoints
LiveChat + Slack – Native chat widgets with human handoff capabilities
Multi-Turn Q&A – Conversation history visible in admin dashboard
95+ Languages – Global audience support out-of-box
Lead Capture – Automatic collection during chat sessions
Human Handoff – Built-in escalation to live agents
Sentiment Tracking – Real-time conversation metrics and performance
Vector + LLM search – Smart retrieval with generative answers, context-aware responses
Mockingbird LLM – Proprietary model with source citations (details )
Multi-turn conversations – Conversation history tracking for smooth back-and-forth dialogue
✅ #1 accuracy – Median 5/5 in independent benchmarks, 10% lower hallucination than OpenAI
✅ Source citations – Every response includes clickable links to original documents
✅ 93% resolution rate – Handles queries autonomously, reducing human workload
✅ 92 languages – Native multilingual support without per-language config
✅ Lead capture – Built-in email collection, custom forms, real-time notifications
✅ Human handoff – Escalation with full conversation context preserved
No-Code Dashboard – Swap logos, colors, welcome text in seconds
White-Label Option – Remove SiteGPT branding for clean look
Preset Personas – Set tone and voice per bot
White-label control – Full theming, logos, CSS customization for brand alignment
Domain restrictions – Bot scope and branding configurable per deployment
Search UI styling – Result cards and search interface match company identity
Full white-labeling included – Colors, logos, CSS, custom domains at no extra cost
2-minute setup – No-code wizard with drag-and-drop interface
Persona customization – Control AI personality, tone, response style via pre-prompts
Visual theme editor – Real-time preview of branding changes
Domain allowlisting – Restrict embedding to approved sites only
GPT-4o-mini – Fast mode for speed-optimized responses
GPT-4o – Accurate mode for deeper comprehensive answers
Per-Chatbot Selection – Choose mode balancing speed vs depth
OpenAI Only – ⚠️ No Claude, Gemini, or custom models
Mockingbird default – In-house model with GPT-4/GPT-3.5 via Azure OpenAI available
Flexible selection – Choose model balancing cost versus quality for use case
Custom prompts – Prompt templates configurable for tone, format, citation rules
GPT-5.1 models – Latest thinking models (Optimal & Smart variants)
GPT-4 series – GPT-4, GPT-4 Turbo, GPT-4o available
Claude 4.5 – Anthropic's Opus available for Enterprise
Auto model routing – Balances cost/performance automatically
Zero API key management – All models managed behind the scenes
Developer Experience ( A P I & S D Ks)
REST API – Bot management, content uploads, answer retrieval
Persona Management – Configure Quick Prompts and Personas via API
No Multi-Language SDKs – ⚠️ REST only, straightforward integration
Multi-language SDKs – C#, Python, Java, JavaScript with REST API (FAQs )
Clear documentation – Sample code and guides for integration, indexing operations
Secure authentication – Azure AD or custom auth setup for API access
REST API – Full-featured for agents, projects, data ingestion, chat queries
Python SDK – Open-source customgpt-client with full API coverage
Postman collections – Pre-built requests for rapid prototyping
Webhooks – Real-time event notifications for conversations and leads
OpenAI compatible – Use existing OpenAI SDK code with minimal changes
RAG Foundation – Keeps answers factual and grounded in content
Fast vs Accurate Modes – Choose speed or depth per use case
Fallback Handling – Graceful replies and handoff for edge cases
✅ Enterprise scale – Millisecond responses with heavy traffic (benchmarks )
✅ Hybrid search – Semantic and keyword matching for pinpoint accuracy
✅ Hallucination prevention – Advanced reranking with factual-consistency scoring
Sub-second responses – Optimized RAG with vector search and multi-layer caching
Benchmark-proven – 13% higher accuracy, 34% faster than OpenAI Assistants API
Anti-hallucination tech – Responses grounded only in your provided content
OpenGraph citations – Rich visual cards with titles, descriptions, images
99.9% uptime – Auto-scaling infrastructure handles traffic spikes
Customization & Flexibility ( Behavior & Knowledge)
One-Click Retrain – Upload files or re-crawl site without tech skills
Personas & Prompts – Steer conversation style with templates
Multiple Chatbots – Run separate bots per team or dataset
Indexing control – Configure chunk sizes, metadata tags, retrieval parameters
Search weighting – Tune semantic vs lexical search balance per query
Domain tuning – Adjust prompt templates and relevance thresholds for specialty domains
Live content updates – Add/remove content with automatic re-indexing
System prompts – Shape agent behavior and voice through instructions
Multi-agent support – Different bots for different teams
Smart defaults – No ML expertise required for custom behavior
Growth Plan – ~$79/month with basic features and GPT-3.5
Pro/Scale Plan – ~$259/month with GPT-4 and higher limits
Enterprise Tier – Custom pricing for 100K+ pages and white-label
Scalable Limits – Message counts, bots, pages scale with tier
Usage-based pricing – Free tier available, bundles scale with growth (pricing )
Enterprise tiers – Plans scale with query volume, data size for heavy usage
Dedicated deployment – VPC or on-prem options for data isolation requirements
Standard: $99/mo – 60M words, 10 bots
Premium: $449/mo – 300M words, 100 bots
Auto-scaling – Managed cloud scales with demand
Flat rates – No per-query charges
HTTPS/TLS Encryption – Industry-standard transit security
Encrypted Storage – Data at rest protection
Workspace Isolation – Data stays in your workspace
No Formal Certifications – ⚠️ SOC 2, ISO 27001, HIPAA not disclosed
✅ Data encryption – Transit and rest encryption, no model training on your content
✅ Compliance certifications – SOC 2, ISO, GDPR, HIPAA (details )
✅ Customer-managed keys – BYOK support with private deployments for full control
SOC 2 Type II + GDPR – Third-party audited compliance
Encryption – 256-bit AES at rest, SSL/TLS in transit
Access controls – RBAC, 2FA, SSO, domain allowlisting
Data isolation – Never trains on your data
Observability & Monitoring
Analytics Dashboard – Chat histories, trends, metrics in one place
Daily Email Digests – Team updates without logging in
Azure portal dashboard – Query latency, index health, usage metrics at-a-glance
Azure Monitor integration – Azure Monitor and App Insights for custom alerts
API log exports – Metrics exportable via API for compliance, analysis reports
Real-time dashboard – Query volumes, token usage, response times
Customer Intelligence – User behavior patterns, popular queries, knowledge gaps
Conversation analytics – Full transcripts, resolution rates, common questions
Export capabilities – API export to BI tools and data warehouses
Email Support – Submit requests for technical assistance
Feature Request Form – Dedicated channel for integrations/features
Active Blog – Product updates and best practices
Agency Partner Program – Growing ecosystem for resellers
Microsoft network – Comprehensive docs, forums, technical guides backed by Microsoft
Enterprise support – Dedicated channels and SLA-backed help for enterprise plans
Azure ecosystem – Broad partner network and active developer community access
Comprehensive docs – Tutorials, cookbooks, API references
Email + in-app support – Under 24hr response time
Premium support – Dedicated account managers for Premium/Enterprise
Open-source SDK – Python SDK, Postman, GitHub examples
5,000+ Zapier apps – CRMs, e-commerce, marketing integrations
GPT-4o Full Model – Flagship multimodal for deeper reasoning
GPT-4o-mini – Faster cost-optimized variant for high volume
GPT-3.5-turbo – Default for lower-tier plans
No Custom Models – ⚠️ Limited to OpenAI models only
✅ Mockingbird LLM – 26% better than GPT-4 on BERT F1, 0.9% hallucination rate
✅ Mockingbird 2 – 7 languages (EN/ES/FR/AR/ZH/JA/KO), under 10B parameters
GPT-4/GPT-3.5 fallback – Azure OpenAI integration for OpenAI model preference
HHEM + HCM – Hughes Hallucination Evaluation with Correction Model (Mockingbird-2-Echo)
✅ No training on data – Customer data never used for model training/improvement
Custom prompts – Templates configurable for tone, format, citation rules per domain
OpenAI – GPT-5.1 (Optimal/Smart), GPT-4 series
Anthropic – Claude 4.5 Opus/Sonnet (Enterprise)
Auto-routing – Intelligent model selection for cost/performance
Managed – No API keys or fine-tuning required
Website Crawling – Thousands of pages in single operation
RAG Architecture – Grounds responses in uploaded/crawled content
File Support – CSV, TXT, PDF, DOCX, PPTX, Markdown (10MB limit)
Multi-Turn Context – Conversation history for coherent interactions
Manual Retraining – ⚠️ No auto-sync, click to update
✅ Hybrid search – Semantic vector + BM25 keyword matching for pinpoint accuracy
✅ Advanced reranking – Multi-stage pipeline optimizes results before generation with relevance scoring
✅ Factual scoring – HHEM provides reliability score for every response's grounding quality
✅ Citation precision – Mockingbird outperforms GPT-4 on citation metrics, traceable to sources
Multilingual RAG – Cross-lingual: query/retrieve/generate in different languages (7 supported)
Structured outputs – Extract specific information for autonomous agent integration, deterministic data
GPT-4 + RAG – Outperforms OpenAI in independent benchmarks
Anti-hallucination – Responses grounded in your content only
Automatic citations – Clickable source links in every response
Sub-second latency – Optimized vector search and caching
Scale to 300M words – No performance degradation at scale
Customer Support Automation – 24/7 answers reducing ticket volume
Website Knowledge Conversion – Rapid chatbot from existing content
Multi-Channel Support – Unified bot across Slack, Messenger, Zendesk
Lead Generation – Automatic capture with CRM webhooks
Global Support – 95+ languages for worldwide service
Regulated industries – Health, legal, finance needing accuracy, security, SOC 2 compliance
Enterprise knowledge – Q&A systems with precise answers from large document repositories
Autonomous agents – Structured outputs for deterministic data extraction, decision-making workflows
White-label APIs – Customer-facing search/chat with millisecond responses at enterprise scale
Multilingual support – 7 languages with single knowledge base for multiple locales
High accuracy needs – Citation precision, factual scoring, 0.9% hallucination rate (Mockingbird-2-Echo)
Customer support – 24/7 AI handling common queries with citations
Internal knowledge – HR policies, onboarding, technical docs
Sales enablement – Product info, lead qualification, education
Documentation – Help centers, FAQs with auto-crawling
E-commerce – Product recommendations, order assistance
HTTPS/TLS – Encryption for data in transit
Encrypted Storage – Data at rest protection
Workspace Isolation – Customer data separated
No Compliance Docs – ⚠️ SOC 2, HIPAA not publicly disclosed
✅ SOC 2 Type 2 – Independent audit demonstrating enterprise-grade operational security controls
✅ ISO 27001 + GDPR – Information security management with EU data protection compliance
✅ HIPAA ready – Healthcare compliance with BAAs available for PHI handling
✅ Encryption – TLS 1.3 in transit, AES-256 at rest with BYOK support
✅ Zero data retention – No model training on customer data, content stays private
Private deployments – VPC or on-premise for data sovereignty and network isolation
SOC 2 Type II + GDPR – Regular third-party audits, full EU compliance
256-bit AES encryption – Data at rest; SSL/TLS in transit
SSO + 2FA + RBAC – Enterprise access controls with role-based permissions
Data isolation – Never trains on customer data
Domain allowlisting – Restrict chatbot to approved domains
Growth $79/mo – Website crawling, file uploads, GPT-3.5
Pro/Scale $259/mo – GPT-4, higher limits, webhooks
Enterprise Custom – 100K+ pages, white-label, dedicated support
7-Day Free Trial – Risk-free evaluation without credit card
30-day free trial – Full enterprise feature access for evaluation before commitment
Usage-based pricing – Pay for query volume and data size with scalable tiers
Free tier – Generous free tier for development, prototyping, small production deployments
Enterprise pricing – Custom pricing for VPC/on-prem installations, heavy usage bundles available
✅ Transparent pricing – No per-seat charges, storage surprises, or model switching fees
Funding – $53.5M raised ($25M Series A July 2024, FPV/Race Capital)
Standard: $99/mo – 10 chatbots, 60M words, 5K items/bot
Premium: $449/mo – 100 chatbots, 300M words, 20K items/bot
Enterprise: Custom – SSO, dedicated support, custom SLAs
7-day free trial – Full Standard access, no charges
Flat-rate pricing – No per-query charges, no hidden costs
Email Support – Technical assistance and feature questions
REST API Docs – API reference for developers
Active Blog – Product updates and use cases
Guided Dashboard – Tooltips and onboarding for new users
Enterprise support – Dedicated channels and SLA-backed help for enterprise customers
Microsoft network – Extensive infrastructure, forums, technical guides backed by Microsoft
Comprehensive docs – API references, integration guides, SDKs at docs.vectara.com
Sample code – Pre-built examples, Jupyter notebooks, quick-start guides for rapid integration
Active community – Developer forums for peer support, knowledge sharing, best practices
Documentation hub – Docs, tutorials, API references
Support channels – Email, in-app chat, dedicated managers (Premium+)
Open-source – Python SDK, Postman, GitHub examples
Community – User community + 5,000 Zapier integrations
Limitations & Considerations
OpenAI-Only – ⚠️ No Claude, Gemini, Llama, or custom models
Manual Retraining – ⚠️ No automatic content syncing yet
10MB File Limit – ⚠️ Per-file cap may constrain large docs
No Compliance Certs – ⚠️ May limit enterprise adoption
No Advanced RAG – ⚠️ Missing hybrid search, knowledge graphs
Cloud-Only – ⚠️ No self-hosting for air-gapped environments
⚠️ Azure ecosystem focus – Best with Azure services, less smooth for AWS/GCP organizations
⚠️ Developer expertise needed – Advanced indexing requires technical skills vs turnkey no-code tools
⚠️ No drag-and-drop GUI – Azure portal management but no chatbot builder like Tidio/WonderChat
⚠️ Limited model selection – Mockingbird/GPT-4/GPT-3.5 only, no Claude/Gemini/custom models
⚠️ Sales-driven pricing – Contact sales for enterprise pricing, less transparent than self-serve platforms
⚠️ Overkill for simple bots – Enterprise RAG unnecessary for basic FAQ or customer service
Managed service – Less control over RAG pipeline vs build-your-own
Model selection – OpenAI + Anthropic only; no Cohere, AI21, open-source
Real-time data – Requires re-indexing; not ideal for live inventory/prices
Enterprise features – Custom SSO only on Enterprise plan
Multi-Turn Conversation – History tracking for context-aware interactions
Sentiment Tracking – Real-time analysis and performance metrics
Lead Collection – Automatic capture with industry templates
Human Handoff – Built-in escalation workflows to live agents
Functions Framework – Trigger tickets, CRM updates from chat
Multi-Channel Integrations – Slack, Messenger, Zendesk, Freshchat connectors
Agentic RAG Framework – Python library for autonomous agents: emails, bookings, system integration
Agent APIs (Tech Preview) – Customizable reasoning models, behavioral instructions, tool access controls
LlamaIndex integration – Rapid tool creation connecting Vectara corpora, single-line code generation
Multi-LLM support – OpenAI, Anthropic, Gemini, GROQ, Together.AI, Cohere, AWS Bedrock integration
Step-level audit trails – Source citations, reasoning steps, decision paths for governance compliance
✅ Grounded actions – Document-grounded decisions with citations, 0.9% hallucination rate (Mockingbird-2-Echo)
⚠️ Developer platform – Requires programming expertise, not for non-technical teams
⚠️ No chatbot UI – No polished widgets or turnkey conversational interfaces
⚠️ Tech preview status – Agent APIs subject to change before general availability
Custom AI Agents – Autonomous GPT-4/Claude agents for business tasks
Multi-Agent Systems – Specialized agents for support, sales, knowledge
Memory & Context – Persistent conversation history across sessions
Tool Integration – Webhooks + 5,000 Zapier apps for automation
Continuous Learning – Auto re-indexing without manual retraining
R A G-as-a- Service Assessment
NO-CODE CHATBOT BUILDER – SMB-focused platform emphasizing rapid deployment
Core Mission – Quick website-to-chatbot conversion with multi-channel support
Target Market – SMB support teams, not developer/RAG infrastructure market
RAG Implementation – Grounding responses in content, fallback handling
Managed SaaS – Guided dashboard, pre-built integrations, 7-day trial
Not Pure RAG-as-a-Service – ⚠️ Combines chatbot building with RAG
Platform Type – TRUE ENTERPRISE RAG-AS-A-SERVICE: Agent OS for trusted AI
Core Mission – Deploy AI assistants/agents with grounded answers, safe actions, always-on governance
Target Market – Enterprises needing production RAG, white-label APIs, VPC/on-prem deployments
RAG Implementation – Mockingbird LLM (26% better than GPT-4), hybrid search, multi-stage reranking
API-First Architecture – REST APIs, SDKs (C#/Python/Java/JS), Azure integration (Logic Apps/Power BI)
Security & Compliance – SOC 2 Type 2, ISO 27001, GDPR, HIPAA, BYOK, VPC/on-prem
Agent-Ready Platform – Python library, Agent APIs, structured outputs, audit trails, policy enforcement
Advanced RAG Features – Hybrid search, reranking, HHEM scoring, multilingual retrieval (7 languages)
Funding – $53.5M raised ($25M Series A July 2024, FPV/Race Capital)
⚠️ Enterprise complexity – Requires developer expertise for indexing, tuning, agent configuration
⚠️ No no-code builder – Azure portal management but no drag-and-drop chatbot builder
⚠️ Azure ecosystem focus – Best with Azure, less smooth for AWS/GCP cross-cloud flexibility
Use Case Fit – Mission-critical RAG, regulated industries (SOC 2/HIPAA), white-label APIs, VPC/on-prem
Platform type – TRUE RAG-AS-A-SERVICE with managed infrastructure
API-first – REST API, Python SDK, OpenAI compatibility, MCP Server
No-code option – 2-minute wizard deployment for non-developers
Hybrid positioning – Serves both dev teams (APIs) and business users (no-code)
Enterprise ready – SOC 2 Type II, GDPR, WCAG 2.0, flat-rate pricing
Market Position – No-code RAG chatbot for SMB customer service
Target Customers – Small/mid-size businesses needing quick deployment
Key Competitors – Chatbase.co, Botsonic, Ragie.ai, WonderChat
Advantages – Comprehensive crawling (100K pages), native multi-channel, Functions feature
Best For – Website-based chatbots, multi-channel support, no-code setup
Market position – Enterprise RAG platform between Azure AI Search and chatbot builders
Target customers – Enterprises needing production RAG, white-label APIs, VPC/on-prem deployments
Key competitors – Azure AI Search, Coveo, OpenAI Enterprise, Pinecone Assistant
Competitive advantages – Mockingbird LLM, hallucination detection, SOC 2/HIPAA compliance, millisecond responses
Pricing advantage – Usage-based with free tier, best value for enterprise RAG infrastructure
Use case fit – Mission-critical RAG, white-label APIs, Azure integration, high-accuracy requirements
Market position – Leading RAG platform balancing enterprise accuracy with no-code usability. Trusted by 6,000+ orgs including Adobe, MIT, Dropbox.
Key differentiators – #1 benchmarked accuracy • 1,400+ formats • Full white-labeling included • Flat-rate pricing
vs OpenAI – 10% lower hallucination, 13% higher accuracy, 34% faster
vs Botsonic/Chatbase – More file formats, source citations, no hidden costs
vs LangChain – Production-ready in 2 min vs weeks of development
Additional Considerations
Functions Feature – Trigger actions like tickets directly from chat
SourceSync API – Headless RAG backend for developer control
✅ Factual scoring – Hybrid search with reranking provides unique factual-consistency scores
Flexible deployment – Public cloud, VPC, or on-prem for varied compliance needs
Active development – Regular feature releases and integrations keep platform current
Time-to-value – 2-minute deployment vs weeks with DIY
Always current – Auto-updates to latest GPT models
Proven scale – 6,000+ organizations, millions of queries
Multi-LLM – OpenAI + Claude reduces vendor lock-in
No- Code Interface & Usability
Guided Dashboard – Paste URL or upload files, launch in minutes
Pre-Built Integrations – Copy-paste embed snippet for deployment
7-Day Free Trial – Test risk-free with live demo
Azure portal UI – Straightforward index management and settings configuration interface
Low-code options – PowerApps, Logic Apps connectors enable quick non-dev integration
⚠️ Technical complexity – Advanced indexing tweaks require developer expertise vs turnkey tools
2-minute deployment – Fastest time-to-value in the industry
Wizard interface – Step-by-step with visual previews
Drag-and-drop – Upload files, paste URLs, connect cloud storage
In-browser testing – Test before deploying to production
Zero learning curve – Productive on day one
Join the Discussion
Loading comments...