AI engines read text, and your images and videos have none, so they're never found or cited. SeenLayer scans your site, writes the missing description layer for every visual, and installs it: plugin, API, or done for you.
SeenLayer doesn't redesign anything. It writes the machine-readable layer your assets never had. Your visitors see the same page; AI crawlers see a source they can cite.
<img src="IMG_0712.jpg" alt="">
<img src="IMG_0712.jpg"
alt="Fashion editorial for KENZO: platinum-bob
model in emerald tailoring against a
Memphis-style set of coral arches">
<script type="application/ld+json">
{ "@type": "ImageObject",
"caption": "Sculptural KENZO tailoring vs
Memphis Milano set design",
"about": ["KENZO", "Memphis Design"],
"description": "citation-ready snippet…" }
</script>
Perplexity, ChatGPT Search and Google AI Overviews never look at your pixels while crawling. They read alt text, transcripts and structured data. Without that layer, your entire visual catalog is a stack of sealed files: present on the page, absent from every AI answer.
AI tools generate millions of videos a day, and every one of them ships mute: no transcript, no chapters, no schema. One video goes into SeenLayer. The citable layer comes out for every destination it lives on.
{ "@type": "VideoObject",
"transcript": "…",
"hasPart": [
{ "@type": "Clip", "startOffset": 0,
"name": "The breath before the stage" },
{ "@type": "Clip", "startOffset": 13,
"name": "The book reveal" } ] }
Title: optimized for AI search Description: dense, citable Chapters: 00:00 The breath before the stage 00:06 Under the spotlight 00:13 The book reveal
1 00:00:00,000 --> 00:00:04,000 [stage silence, a held breath]
IG / TikTok / X / LinkedIn captions: the text AI engines actually read when they cite social posts.
A GEO agency with 38 visual assets on its site: 17/100, with 97% invisible to AI. A content site with 18 images: 0/100. If it happens to the people selling AI visibility, it is happening to you.
| Domain | Assets | Score |
|---|---|---|
| cipionmarketing.com | 38 · one full video at zero | 17 |
| dalinar.com | 18 · every hero image, empty alt | 0 |
INVISIBLE
Your clients ask what you're doing about AI search. Hand them a number. The Visibility Audit runs under your brand, the fix runs on our engine, the retainer grows on your side.
Generating the language is half the job. Installing it is the other half. Four ways in, from zero-tech to fully automatic:
Your report includes the exact alt text and JSON-LD block for every asset. Paste once into your pages. Done.
Connects to your media library. Everything already uploaded gets tagged in batch; every future upload is tagged on arrival. Set and forget.
POST an asset, get the layer back in JSON. Your team plugs it into any CMS, DAM or publishing flow.
Your AI agent builds or maintains the site? It calls SeenLayer as a tool, and every visual is tagged at the moment of creation.
We're in open beta: one free report, two paid fixes, no subscriptions yet. Beta buyers lock the lowest price we'll ever charge.
Plugin, API and agent integrations (ways in 02–04): on the roadmap, coming after beta. Subscriptions with citation tracking arrive with them.
Paste your URL. The scan runs live, no signup, no email, no waiting. The number appears on this page in seconds.
Scans the exact URL you paste: your homepage or any specific page. Conservative by design; the full-site number in the emailed report can only be equal or worse.
Want the full-domain report? Every page, every asset, scored and itemized. Generated and emailed automatically, usually within 15 minutes.
Generative Engine Optimization for visual content. AI crawlers read text, not pixels: alt text, transcripts, Schema.org. Visual GEO generates that text layer so AI engines can find, understand and cite your images and video with your brand as the source.
Four dimensions, weighted by what AI crawlers actually read: structured data (35 points), alt text and transcript quality (25), citable context (20), entity linking (20). The domain score is the average across every asset found. Scoring is conservative by design.
No, and nobody honestly can. Citation patterns shift 40 to 60 percent month over month. We apply what research validates (structured, factual, entity-linked content improved generative visibility up to 40% in the Princeton GEO study) and we track your actual citations so the layer adapts as engines change.
Yes. VideoObject structured data with per-moment Clips, transcripts, chapters, optimized titles and descriptions. For YouTube we generate the fields you control there, plus the schema for the page where you embed the video.
No. The layer is metadata: JSON-LD in the head, alt attributes, captions. Invisible to your visitors, legible to machines. A few kilobytes of text.