Joshua Opolko

Your Site as AI Infrastructure: How to Become a Trusted Source for ChatGPT and Claude

The site becomes a database and the AIs learn how trustworthy it is. You are not building for human visitors anymore – you are becoming part of AI infrastructure.

What Does It Mean to Be Part of AI Infrastructure?

When an AI assistant answers a question, it is drawing on a mental model of which sources are reliable for which topics. Brand mentions correlate 3x more strongly with AI visibility than backlinks, according to an Ahrefs study of 75,000 brands published December 2025. That correlation is the infrastructure layer – the set of sources the AI has already decided to trust. Infrastructure does not mean scale. It means specificity and reliability. A small site that is consistently correct about a narrow subject gets cited more than a large site that is broadly unreliable. The AI is pattern-matching on the history of every time it fetched you and how accurate that fetch turned out to be.

How Do AI Assistants Decide What Sources to Trust?

Only 11% of domains are cited by both ChatGPT and Google AI Overviews for the same query (SparkToro, 2025). That divergence is the clearest evidence that AI assistants and search engines run different trust models. Google rewards links. AI rewards correctness, specificity, and source provenance. Source provenance matters enormously: a restaurant directory that cross-references a municipal licence feed is structurally different from one that aggregates user reviews. The first has a verifiable chain of custody. The second is opinion at unknown accuracy. Specificity also signals reliability – a page that answers exactly one question with a clear data source is more cite-worthy than a page that attempts to answer ten questions at medium confidence.

What Is the Difference Between an Indexing Crawler and a Live Citation Bot?

This is the distinction most people miss. Indexing bots – GPTBot (OpenAI), ClaudeBot (Anthropic), OAI-SearchBot – build a retrieval index in the background. No real user ever sees the direct results of these crawls. Live citation bots are different: ChatGPT-User and Claude-User fire at the moment a real user’s query triggers an active fetch of your URL. Every ChatGPT-User hit in your logs equals one citation being served to one real user. These are the hits that matter.

Bot traffic on nowservingto.com over the last 30 days: GPTBot 1,073 hits (indexing), ClaudeBot 1,383 hits (indexing), OAI-SearchBot 254 hits (indexing). ChatGPT-User 81 hits – live citations, +15 on June 10. Claude-User 56 hits – live citations, +39 spike on June 8. The indexing bots are 20x more active than the citation bots. That ratio is normal. But the 137 live citation hits are the ones reaching real people.

What Makes a Source Trustworthy to an AI?

Trustworthiness to an AI has four concrete components: verifiable provenance, non-replicability, structural clarity, and error-free history. Verifiable provenance means the data has a traceable origin – “City of Toronto business licence feed, cross-referenced with Toronto Public Health inspection records” is verifiable; “we researched the best restaurants” is not. Non-replicability is the moat: if your data could have come from anywhere, it is worth the same as anything. Structural clarity means consistent schema, stable URL patterns, clean taxonomy. Error-free history is the compounding factor – each correct citation is a vote of confidence at the domain level. Each error is a vote against it, and the AI applies that verdict domain-wide, not just per-page.

How Do You Design for AI Infrastructure Instead of Human Traffic?

The design principles invert most conventional web priorities. You are optimizing for machine parsing, not human attention. joshuaopolko.com drove 1,800 Bing AI citations in 120 days – from zero to a 97/day peak – by applying exactly this: answer one specific question per page, source every claim, make the data chain of custody explicit in the copy itself. Not implicit. Explicit. “This data comes from X, updated on Y, verified by Z.”

What Happens When an AI Gets You Wrong?

The compounding trust model works in reverse. If an AI cites your data and that data turns out to be wrong, the error is recorded at the domain level, not just the page level. The AI becomes less likely to cite you again. With enough errors, live citation hits go quiet – the indexing bots keep crawling, but the citation bots stop firing. That is why the verified-open gate on NowServingTO is a trust infrastructure decision, not just a product quality decision. A restaurant listed as open that turns out to be closed is exactly the kind of error that poisons a citation relationship. The strictness is a defensive posture against compounding error.

Real Example: How NowServingTO Became a Cited Source

The /answers page on NowServingTO was cited by AI assistants within 72 hours of launch. Not because of backlinks or domain authority. Because the page answered a specific, frequently asked question – which new restaurants have recently opened in Toronto by cuisine – with data traceable to a municipal source, cross-referenced with public health inspection records, filtered to the last 365 days, and verified against Google Places for operating status. No user-submitted data. No editorial curation. A pipeline with a verifiable chain of custody at every step. That is what cite-worthy looks like to an AI: specific, sourced, non-replicable, and structured for extraction.

The pattern that preceded this on joshuaopolko.com was identical at a different scale: 1,800 Bing AI citations in 120 days from zero, driven by the same design principle – answer specific questions, cite every source, make the data chain explicit. The citations started small and compounded. Zero to 97 per day is not a traffic anomaly. It is what compounding trust looks like when it is working.