AI search

AI Hallucination

An AI hallucination is when a language model states something false as if it were true, presenting fabricated facts, citations, or details with the same confident tone it uses for correct answers, because the model predicts plausible text rather than retrieving verified information.

Hallucination happens because a language model generates the next most likely words, not the most accurate ones. The model has learned the shape of correct-sounding language, so it can produce fluent, well-structured claims even when it holds no real knowledge of the subject. When it has nothing genuine to draw on, it does not pause or flag the gap: it fills the space with an invented product spec, a fabricated quote, a price that was never charged, or a citation pointing to a page that does not exist. Crucially, the confident tone is identical whether the answer is grounded or guessed, which is what makes hallucination hard to catch by reading alone.

Consider a Shopify merchant selling merino base layers. A shopper asks an assistant whether the brand offers a lifetime repair guarantee. The store has never published such a policy, but similar outdoor brands often do, so the model infers one and states it plainly: yes, with free repairs for life. The shopper arrives expecting a promise the merchant never made. Nothing was retrieved; the claim was assembled from the statistical neighbourhood of comparable brands. The same pattern invents wash instructions, fibre percentages, or sizing advice that contradicts the actual product page.

The most reliable defence is grounding the model in retrieved sources at answer time, the approach behind retrieval-augmented generation. When a system pulls real documents and answers only from them, hallucination drops sharply, because the model is summarising evidence rather than inventing it. This is also why what a model says about your brand depends on what it can find. If accurate, corroborated information about your products, policies, and pricing is easy to retrieve, the model has something true to anchor on instead of guessing from the average of your category.

For answer engines such as ChatGPT, Perplexity, and Google AI Overviews, this turns hallucination into a practical reason to publish clearly. These systems lean on what they can read about you across your site, structured data, and third-party sources. Sparse or contradictory information widens the gap the model will fill with invention; consistent, specific, well-marked-up facts narrow it. Customer reviews help here too, because they corroborate real claims about fit, durability, and service in language the model can cite.

Hallucination is never fully eliminated, only reduced, so treat any unsourced AI claim about pricing, availability, or specifications as unverified until checked against a primary source. For operators, the honest takeaway is twofold: you cannot control what a model invents in a vacuum, but you can shrink the vacuum by making real, citable information easy to find, and you should periodically ask the major assistants about your own store to catch confident falsehoods before customers do.