Catalog & Sitemaps

How Lumio discovers and imports your products by scanning your sitemap.

Lumio imports your product catalog by scanning your website’s sitemap — the same way AI shopping agents discover your products. This means Lumio sees exactly what the AI sees, giving you an accurate picture of your discoverability.

How catalog import works

When you enter your store URL, Lumio follows this process:

1. Sitemap discovery

Lumio fetches your sitemap.xml file and parses it. If your sitemap contains references to child sitemaps (common for large stores), Lumio follows those too. It recursively discovers all URLs in your sitemap tree.

2. Product page identification

Not every URL in your sitemap is a product page. Lumio uses AI-powered URL classification to identify which URLs are likely product pages based on URL patterns, path structure, and naming conventions.

Common patterns it recognizes:

  • /products/...
  • /p/...
  • /shop/...
  • /item/...
  • Category + product slug patterns

3. JSON-LD extraction

For each identified product page, Lumio fetches the page and extracts JSON-LD structured data. This is the <script type="application/ld+json"> block that contains your product’s machine-readable information — the same data AI shopping agents read.

Lumio stores the complete JSON-LD payload along with normalized fields (title, description, brand, price, images) for display and scoring.

4. Catalog creation

All discovered products are stored in your workspace’s catalog with:

  • Product title and description
  • Brand and pricing information
  • The raw JSON-LD payload
  • Source URL for reference

What if my site doesn’t have a sitemap?

Most ecommerce platforms (Shopify, WooCommerce, BigCommerce) generate sitemaps automatically. Check by visiting yourstore.com/sitemap.xml in your browser.

If you don’t have a sitemap:

  • Shopify — Automatically generates one at /sitemap.xml
  • WooCommerce — WordPress generates one; also check Yoast SEO settings
  • BigCommerce — Available at /xmlsitemap.php
  • Custom platforms — You may need to generate one manually or use a sitemap generator

What if my products don’t have JSON-LD?

Lumio can still import basic product information from your sitemap URLs, but scoring and enrichment work best with structured data. If your products lack JSON-LD markup:

  • Shopify — Most themes include basic Product schema. Check with Google’s Rich Results Test.
  • WooCommerce — Use a schema plugin like Yoast SEO or Schema Pro
  • Custom platforms — Add Product schema markup to your product page templates

Even without perfect JSON-LD, Lumio extracts what it can and scores accordingly — low schema completeness scores will highlight exactly what’s missing.

Rescanning your catalog

Your catalog isn’t static. When you add new products, update descriptions, or improve your structured data, you can rescan:

  • Full rescan — Re-fetches all product pages and updates stored data
  • Rescore — After rescanning, run scoring again to see how your improvements affected AI readiness

Coming soon

  • Shopify app integration — Direct API connection for real-time catalog sync
  • CSV upload — Import product data from spreadsheets
  • Scheduled rescans — Automatic periodic rescanning to keep your catalog fresh