That's why it's good to crawl twice, so if the site got deleted by the first crawl, you can check and then discard the results. Saves a bit of disk space.
There's a similar problem with AV automatically unsubscribing all of your customers from their spam (and newsletters) the first time you start scanning emails for malicious links. It's a little less than completely solvable, in fact.