New Service
IsMyPageTrained
Check whether your website has been crawled by Common Crawl over the last eight years and receive a detailed forensic report via email.
Enter a domain or URL and we will pull yearly snapshots from Common Crawl, summarize what was captured, and send you the full report by email.
Eight-Year Coverage
Automatically searches one capture per year across the last eight calendar years to document whether your pages were scraped.
Source-Level Evidence
Each result links back to the exact Common Crawl index lookup, including timestamps, HTTP status codes, and WARC file references.
Email Delivery
The full report is emailed instantly so you can forward or retain it as part of your litigation or compliance workflow.
Request Your Report
Provide a domain (for example example.com) or a full URL. We will automatically broaden the search to cover all paths on that host.
Need deeper analysis?
Our technical experts can extend this baseline crawl evidence into full training data forensics, including model probing, dataset reconstruction, and expert testimony.