How Website Scraping Works
When you add a URL, VozAgent:- Fetches the page at the URL you provide
- Extracts the text content from the page (stripping out navigation, ads, scripts, etc.)
- Indexes the content so your assistant can search it during calls
- Stores a snapshot of the content at the time of scraping
Adding a Website URL
There are two ways to add website content, depending on where you start.From the Knowledge page
- Go to Knowledge in your dashboard sidebar
- Click the Add Knowledge button
- Select Website from the resource type options
-
Enter the website URL in the Website URL field (e.g.,
https://example.com/services) - Click Add Knowledge
https://acmeplumbing.com/services, the title will be set to acmeplumbing.com.
From the Paste URL dialog
If you’re using the legacy document view, you may see the Paste from URL option:- Click Add Document and select Paste from URL
- Enter a Name for the resource (e.g., “Services Page”)
- Enter the URL of the page you want to scrape (e.g.,
https://example.com/services) - Click Upload
What Makes a Good URL to Add?
Not all pages are equally useful. Here are the best pages to add:- Services pages — what you offer, service descriptions, specialties
- FAQ pages — common questions and answers your callers typically ask
- About pages — business background, team info, company story
- Pricing pages — rates, packages, estimates information
- Location/contact pages — areas served, hours, addresses
- Policy pages — cancellation policies, warranties, guarantees
One Page per URL
Each URL you add scrapes a single page. It does not automatically follow links or crawl your entire website. If you want your assistant to know about your services page, FAQ page, and pricing page, you need to add each URL separately. For example, to cover your main website content, you might add:https://yourbusiness.com/serviceshttps://yourbusiness.com/faqhttps://yourbusiness.com/abouthttps://yourbusiness.com/pricing
Auto-Created Website Knowledge
When you first set up VozAgent and provide your business website, the platform automatically scrapes your site and creates a knowledge item. This item is marked with a System badge and cannot be deleted. System website items have a sync button (refresh icon) that lets you re-scrape the website to pull in the latest content. This is useful if you’ve recently updated your website. To re-scrape a system website item:- Find the item in your Knowledge list (look for the System badge)
- Click the refresh icon button
- Wait for the scanning to complete
Viewing Scraped Content
To see what content was extracted from a URL:- Find the website item in your Knowledge list
- Click the eye icon (View content) button
Processing Status
After adding a URL, the item goes through a brief processing period:| Status | What it means |
|---|---|
| Scanning | The page is being fetched and processed. This usually takes a few seconds to a minute. |
| (No status badge) | Processing is complete. The content is ready and searchable by your assistant. |
| Error | Something went wrong. The page may be inaccessible, blocked, or contain no extractable text. |
Important Notes
- URLs must start with
http://orhttps://— the system validates the URL format - The content is a snapshot — if your website changes, you’ll need to re-scrape system items or delete and re-add user-created items to pull in the updated content
- Password-protected pages won’t work — the scraper needs public access to the page
- Website items cannot be edited after creation — you can view or delete them, but you cannot modify the scraped content directly. To update, delete the item and add the URL again.

