PeerHosting / APIs / Structured Data Extractor
Live
Structured Data Extractor — JSON-LD, OpenGraph, Twitter Cards, Microdata
All the structured data from any URL in one call: every JSON-LD block with its schema.org types, OpenGraph and Twitter Card tags, canonical URL, meta basics, and microdata items. Batch over URL lists, or call it synchronously from AI agents and SEO tools.
What you get per URL
{
"url": "https://example.com/article",
"status": 200,
"title": "Page title",
"metaDescription": "...",
"canonical": "https://example.com/article",
"jsonLd": [ { "@type": "Article", "...": "..." } ],
"jsonLdTypes": ["Article"],
"openGraph": { "title": "...", "image": "...", "type": "article" },
"twitter": { "card": "summary_large_image" },
"microdataItems": [],
"errors": []
}
- Malformed JSON-LD is reported in
errorsinstead of silently dropped — exactly what an SEO audit needs - Unreachable URLs produce an error record without killing the batch
- HTTP-first — no browser overhead, so it's fast and cheap at volume
Use cases
- SEO audits — validate schema.org markup across a whole site in one run
- Product and content data — pull clean Article, Product, Recipe, Event, JobPosting objects from pages that publish them
- Link previews — fetch OG/Twitter metadata for URLs your app needs to render
- Agent tool — "what does this page say it is?" answered in one synchronous call
Pricing
| Event | Price (USD) |
|---|---|
| Run start | $0.0001 |
| Per URL extracted | $0.002 |
| API call (standby) | $0.01 |
1,000 URLs ≈ $2. The live price on the Apify page is authoritative.
Runs on Apify — free account, pay per event, batch or synchronous standby endpoint.
Run on Apify →Related: Sitemap Diff · RDAP Domain Intel