PrestaShop Artificial Intelligence

AI Semantic Audit — Vector clustering for PrestaShop 8 & 9

The semantic SEO audit. Detect content that dilutes your thematic authority.

Vector clustering of your entire PrestaShop catalog. Identify off-topic pages, get restructuring suggestions, strengthen your semantic coherence.

PrestaShop 8 & 9 OpenAI / Mistral compatible Local mode without API Multilingual Multi-store
  • 30-day refund
  • 12 months updates
  • 24h support
www.datafirefly.com/en/
AI Semantic Audit — Vector clustering for PrestaShop 8 & 9
v1.0.5 · updated 2026-05-27
What it does

The short version.

01

3 embedding providers to choose from

OpenAI (text-embedding-3-small, 1,536 dimensions), Mistral (mistral-embed, European alternative) or local TF-IDF with no API and no recurring cost. Switch providers any time, the module recomputes automatically.

02

Off-topic page detection

The module identifies products, categories or CMS pages whose cosine distance to their cluster exceeds the threshold. These contents dilute your store's thematic authority in Google's eyes.

03

Restructuring suggestions

For each off-topic page, the module suggests the second-best cluster and computes the expected coherence gain. You know exactly where to move content to strengthen site structure.

04

Interactive 2D semantic map

Projection of all your content onto a 2D plane via Johnson-Lindenstrauss. Each dot represents a page, each color a cluster. Visually identify misplaced content.

05

Incremental audit

Only modified content is reprocessed. Automatic update hooks on products, categories and CMS pages. Signed cron URL for automated weekly audits.

06

CSV export and history

Download each audit as CSV for external processing. Keep the full history of past audits to measure the evolution of your semantic coherence over time.

The long version

Everything you'd want to know before you install.

A detailed look at how AI Semantic Audit — Vector clustering for PrestaShop 8 & 9 works, why we built it the way we did, and the thinking behind the features above.

§ 01

Why audit your store's semantic coherence

Google no longer just reads keywords. Since BERT, MUM and the Search Generative Experiences, the engine evaluates the overall thematic relevance of your site. A store that mainly sells automotive accessories but contains a few pages about interior decoration sends a contradictory signal. The result: Google struggles to know which topics to rank you for, and your overall authority decreases. The semantic audit identifies these 'thematic leaks' so you can delete, move or regroup them.

§ 02

How vector clustering works

Each piece of content in your store is transformed into a vector of hundreds or thousands of dimensions, where each dimension captures an abstract semantic aspect. Two pieces of content on the same topic will have close vectors, two pieces on different topics will be far apart. The k-means algorithm groups these vectors into thematic clusters without human intervention — you discover the real structure of your catalog as a modern search engine perceives it.

§ 03

Three providers for three budgets

OpenAI offers the best quality with text-embedding-3-small (1,536 dimensions, ~€0.02 per 1,000 products). Mistral provides a European alternative hosted in France with mistral-embed (1,024 dimensions, similar pricing). Local TF-IDF mode runs entirely on your server with no API calls, using classic statistical NLP principles — ideal to get started with no commitment or recurring budget.

§ 04

The audit report in practice

After each audit, the module produces five complementary views: a dashboard with key KPIs (off-topic rate, median distance, cluster count), a 2D semantic map to visualize the catalog structure, a detailed cluster list with their representative terms and cohesion scores, a table of off-topic pages with move suggestions and estimated gain, and a table of 'hopeless' pages as noindex or deletion candidates.

§ 05

Automation and continuous workflow

The module registers hooks on the creation, modification and deletion of products, categories and CMS pages. As soon as content changes, it is flagged for re-embedding at the next run. A signed cron URL triggers the indexing → embeddings → audit pipeline in headless mode, for automated weekly or monthly verification. The full audit history is preserved to measure the evolution of your semantic coherence over time.