Grupa Insight

AI Search Glossary

Definitions of key terms in AI Search, Entity SEO, and modern web architecture.

HomeGlossary
AI Search

AI Overviews

AI Overviews is a feature in Google Search that generates a summarized AI-powered answer at the top of search results, drawing from multiple web sources. It appears for selected queries and includes links to cited sources. There are no special technical requirements or additional schema markup needed to appear in AI Overviews beyond standard SEO best practices.

READ MORE →
AI Search

AI Readiness

AI Readiness is the state of a website being technically accessible, semantically clear, and content-complete enough to be discovered, interpreted, attributed, and cited by AI-powered search systems. A website that is AI-ready can be crawled by AI search crawlers, parsed for meaning, attributed to a recognized entity, and retrieved as a source for relevant queries.

READ MORE →
AI Search

AI Search Optimization

AI Search Optimization is the practice of making a website more discoverable, attributable, and citable by AI-powered search systems such as ChatGPT Search, Perplexity, and Google AI Overviews. It builds on technical SEO, entity clarity, structured data, content architecture, and trust signals to improve the probability that AI systems retrieve and cite a website as a source.

READ MORE →
AI Search

AI Search Visibility

AI Search Visibility is the degree to which a website is discovered, retrieved, and cited by AI-powered search systems such as ChatGPT Search, Perplexity, and Google AI Overviews. Unlike classic search ranking positions, AI Search Visibility is measured through citation frequency, source attribution, referral traffic patterns, and prompt testing rather than fixed rank tracking.

READ MORE →
Semantic SEO

Content Cluster

A content cluster is a group of interlinked pages on a website that together cover a specific topic comprehensively. It typically consists of a pillar article covering the topic broadly, supported by more focused articles, FAQ pages, case studies, and service pages — all connected through internal links. Content clusters build topical authority and improve retrievability in both search engines and AI systems.

READ MORE →
Performance

Core Web Vitals

Core Web Vitals are a set of standardized performance metrics defined by Google to measure real-world user experience on web pages. The three current metrics are Largest Contentful Paint (LCP), Interaction to Next Paint (INP), and Cumulative Layout Shift (CLS). They are part of Google's Page Experience signals and influence search ranking.

READ MORE →
Performance

Cumulative Layout Shift (CLS)

Cumulative Layout Shift (CLS) is a Core Web Vitals metric that measures visual stability — how much visible page elements shift unexpectedly during loading. A high CLS score indicates that content moves around as the page loads, disrupting the user experience. Google considers CLS under 0.1 as good, 0.1 to 0.25 as needing improvement, and over 0.25 as poor.

READ MORE →
Trust & Authority

E-E-A-T

E-E-A-T stands for Experience, Expertise, Authoritativeness, and Trustworthiness. It is a framework from Google's Search Quality Rater Guidelines used to evaluate whether content is easy to trust, verify, and attribute. It is not a direct ranking factor but describes the properties that make content more credible and useful — for users and AI systems alike.

READ MORE →
Semantic SEO

Entity SEO

Entity SEO is the practice of optimizing a website so that brands, people, products, and services are clearly defined as named entities that search engines and AI systems can identify, understand, and attribute. It shifts focus from keyword matching to entity recognition and semantic relationships.

READ MORE →
Technical SEO

GPTBot

GPTBot is OpenAI's web crawler associated with potential model training data collection. It operates independently from OAI-SearchBot, which is used for ChatGPT Search features. Blocking GPTBot in robots.txt prevents content from being used for model training but does not affect visibility in ChatGPT Search results.

READ MORE →
Technical SEO

Headless CMS

A headless CMS is a content management system that separates the content repository (backend) from the presentation layer (frontend). Content is stored and managed in the CMS and delivered via API to any frontend — a website, mobile app, or other channel. This architecture gives developers full control over how content is rendered and optimized.

READ MORE →
Semantic SEO

Knowledge Graph

A knowledge graph is a structured database of entities and their relationships. Search engines and AI systems use knowledge graphs to understand what things are, how they relate to each other, and whether a source can be trusted and attributed. Google's Knowledge Graph powers entity recognition in Search, AI Overviews, and related features.

READ MORE →
Performance

Largest Contentful Paint (LCP)

Largest Contentful Paint (LCP) is a Core Web Vitals metric that measures how long it takes for the largest visible content element on a page to fully render. It reflects perceived load speed from the user's perspective. Google considers LCP under 2.5 seconds as good, 2.5 to 4.0 seconds as needing improvement, and over 4.0 seconds as poor.

READ MORE →
Technical SEO

OAI-SearchBot

OAI-SearchBot is OpenAI's dedicated web crawler used to surface websites in ChatGPT Search features. It operates independently from GPTBot, which is used for potential model training. Allowing OAI-SearchBot in robots.txt enables a website to appear as a cited source in ChatGPT Search results.

READ MORE →
Technical SEO

PerplexityBot

PerplexityBot is the web crawler used by Perplexity to discover and index web content for surfacing in Perplexity search results. According to Perplexity's documentation, it respects robots.txt and does not use blocked content for pre-training foundation models. It is distinct from crawlers used for model training purposes.

READ MORE →
AI Search

Retrieval-Augmented Generation (RAG)

Retrieval-Augmented Generation (RAG) is an AI architecture in which a language model retrieves relevant information from an external source before generating a response. Instead of relying solely on knowledge encoded during training, the model fetches current or specific content and uses it to produce a more accurate, grounded answer.

READ MORE →
Technical SEO

Schema Markup

Schema markup is structured data added to a web page in a format that search engines and AI systems can parse without interpreting prose. It uses vocabulary from schema.org to describe what a page is, who created it, what it contains, and how it relates to other entities. It makes pages self-describing and machine-readable.

READ MORE →
Semantic SEO

Semantic SEO

Semantic SEO is the practice of optimizing content for meaning and context rather than exact keyword matching. It focuses on covering topics comprehensively, using related concepts and entities, and structuring content so that search engines and AI systems can understand the full scope of a subject — not just surface-level keyword presence.

READ MORE →
Semantic SEO

Topical Authority

Topical authority is the degree to which a website is recognized as a credible, comprehensive source on a specific subject. It is built through consistent publication of interconnected content that covers a topic with sufficient depth and breadth — signaling to search engines and AI systems that the site is a reliable reference in that domain.

READ MORE →
Technical SEO

llms.txt

llms.txt is a proposed standard for a plain-text file placed in the root directory of a website that helps large language models (LLMs) understand the site's content structure, identify authoritative pages, and use the information correctly at inference time. It is not a confirmed universal requirement for AI Search visibility.

READ MORE →