Understanding Chunks and Products

This guide explains the advanced concepts behind how your AI agent finds and delivers accurate answers: vectorized chunks for semantic understanding and supplement products for precise filtering.

How Your AI Finds Answers

Your AI uses a hybrid search system that combines two powerful approaches:

Semantic Search - Understands meaning and context from your content chunks
Structured Search - Filters and sorts using your product catalog data

This combination delivers results that neither approach could achieve alone.

Vectorized Chunks

What Are Chunks?

When you upload content (web pages, documents, PDFs), the system breaks it into smaller, manageable pieces called chunks. Each chunk typically contains a paragraph or section of related information.

Example: A 10-page PDF about your return policy might become 25-30 chunks, each covering a specific topic like "refund timeframes," "exchange process," or "exceptions."

What Is Vectorization?

Each chunk is converted into a vector embedding - a numerical representation that captures its semantic meaning. Think of it as translating text into coordinates in a multi-dimensional space where similar meanings are located near each other.

How it works:

"affordable engagement rings" → [0.23, -0.45, 0.67, 0.12, ...]
"budget-friendly diamond bands" → [0.25, -0.43, 0.80, 0.14, ...]

These two phrases have similar vectors because they have similar meanings, even though they use different words.

How Semantic Search Works

When a customer asks a question:

Query vectorization - The question is converted to a vector
Similarity search - System finds chunks with similar vectors
Relevance ranking - Most relevant chunks are selected
Answer generation - AI uses these chunks to craft a response

Example:

Customer asks: "What's your policy on returns?"
System finds chunks about refunds, exchanges, return windows
AI synthesizes an accurate answer from your actual content

Why Chunks Matter

Benefit	Description
Contextual answers	Responses based on your actual content, not generic AI knowledge
Accurate information	AI cites your policies, products, and services correctly
Reduced hallucination	Grounded in real data rather than making things up
Up-to-date responses	Reflects your latest uploaded content

Chunk Limits by Plan

Plan	Chunk Limit
Free	500
Professional	1,000
Premium	2,000
Enterprise	Custom

Managing Your Chunks

What counts toward your limit:

Uploaded documents (PDFs, DOCs, TXT files)
Crawled web pages
Manually added content

Tips for optimization:

Remove duplicate or outdated content
Focus on high-value information customers ask about
Consolidate similar pages when possible

Supplement Products

The Limitation of Semantic Search

While semantic search excels at understanding meaning, it struggles with numerical precision.

The problem:

"Affordable rings" and "luxury rings" have different meanings ✓
"$500 rings" and "$5,000 rings" may appear similar if descriptions match ✗

Vector embeddings don't encode numerical relationships well. A budget ring and an expensive ring might look similar in vector space if they're both described as "beautiful diamond engagement rings."

What Are Supplement Products?

Supplement products are your structured product catalog that adds precise, filterable data on top of semantic search. They "supplement" the vector search with exact values.

Product Data Structure

Each product record contains:

Field	Description	Example
Title	Product name	"1.5ct Oval Diamond Ring"
URL	Product page link	"https://yoursite.com/rings/oval-15ct"
Price	Numeric price value	2499.00
Numeric 1-3	Custom numeric fields	Carat: 1.5, Length: 8mm
Categories	Classification tags	"Engagement, Oval, Natural"
Keywords	Searchable attributes	"solitaire, platinum, certified"
Media URL	Product image	"https://yoursite.com/images/ring.jpg"

How Structured Search Works

The product catalog enables:

Capability	Example Query
Price filtering	"under $2,000" or "between $500-$1,000"
Numeric comparisons	"at least 1 carat" or "under 50,000 miles"
Category filtering	"show me oval diamonds" or "exclude SUVs"
Keyword matching	"with heated seats" or "certified pre-owned"
Budget awareness	"within my $3,000 budget"

The Hybrid Search in Action

Customer query: "Show me oval diamond rings under $3,000"

What happens:

Semantic search analyzes chunks
- Finds content about oval diamonds, ring styles, quality factors
- Understands "oval" refers to shape, not just any mention of the word
Structured search filters products
- Filters shape category = "oval"
- Filters price < $3,000
- Returns matching products with exact prices
Combined results
- AI presents relevant products with accurate pricing
- Provides context from your content about oval diamonds
- Respects the exact budget constraint

Result: Customer sees oval rings priced $1,200 to $2,950 - not a $4,500 ring that happens to mention "affordable" in its description.

Industry Examples

Jewelry Store:

Price, carat weight, ring size
Categories: shape, metal, stone type
Keywords: certified, natural, lab-grown

Car Dealership:

Price, mileage, year
Categories: make, model, body style
Keywords: AWD, leather, sunroof

Real Estate:

Price, square footage, bedrooms
Categories: property type, neighborhood
Keywords: pool, garage, renovated

Restaurant:

Price, calories, prep time
Categories: cuisine, meal type, dietary
Keywords: gluten-free, spicy, vegetarian

Product Limits by Plan

Plan	Product Limit
Free	500
Professional	1,000
Premium	2,000
Enterprise	Custom

Chunks vs. Products: When to Use Each

Content Type	Use Chunks	Use Products
Policies and FAQs	✓
Company information	✓
Blog posts and articles	✓
Product catalog	✓ (descriptions)	✓ (structured data)
Service listings	✓	✓
Pricing tables		✓

Best practice: Use both together. Upload product descriptions as documents (creates chunks for semantic understanding) AND import your catalog as products (enables precise filtering).

Practical Tips

Optimizing Chunks

Quality over quantity - Well-written content creates better chunks
Clear structure - Use headings and paragraphs for logical chunking
Avoid duplication - Same content uploaded twice wastes chunk limit
Update regularly - Remove outdated content, add new information

Optimizing Products

Complete data - Fill in all available fields
Consistent categories - Use the same terms across products
Rich keywords - Include features customers search for
Accurate pricing - Keep prices current for budget filtering

Monitoring Usage

Check your usage in Admin > Billing Usage:

Current chunk count vs. limit
Current product count vs. limit
Progress bars show utilization percentage

Technical Deep Dive

Vector Dimensions

Embeddings are high-dimensional vectors (typically 768 or 1536 dimensions) that encode semantic meaning. The AI model generating these embeddings was trained on billions of text examples to understand language nuances.

Similarity Metrics

When searching, the system calculates cosine similarity between the query vector and all chunk vectors. Higher similarity scores indicate more relevant content.

Chunk Size

Chunks are sized to balance:

Context - Large enough to contain meaningful information
Precision - Small enough to match specific queries
Performance - Optimized for fast retrieval

Typical chunk size: 200-500 tokens (roughly 150-400 words).

Retrieval Count

For each query, the system retrieves the top-k most similar chunks (typically 3-10) to provide context for answer generation. This balances accuracy with response speed.

Summary

Concept	Purpose	Strength
Chunks	Semantic understanding	Meaning, context, natural language
Products	Structured filtering	Numbers, categories, exact values
Hybrid Search	Combined approach	Accurate, relevant, precise results

Your AI agent uses both systems together to deliver answers that understand what customers mean AND respect exact constraints like budgets and specifications.

Data Management - Upload content and manage chunks
Uploads - Import products and documents
Billing Usage - Monitor chunk and product usage