Today we launched a feature of CloudPreservation.com that will allow you to find pages or documents that didn’t exist in the previous crawl. This is going to allow to do things like:
- Find all pages that were added to the archive in during a specific crawl.
- Use keywords to search the text or meta-data of pages that were added to the archive during a specific crawl.
To use this handy new feature, select a crawl from the “Feeds and Crawls” drop down under the search text field. You’ll see a checkbox appear below and by clicking on that, your search will be limited to only pages and documents that were found for the first time for that crawl.
Advertisement

[...] this blog post on the feature for further [...]