I would create a smart group based on contact-ids instead of names.
The first time you run it, it will take long time anyway. (So probably good idea to split into smaller groups)
But when you have done that, you could limit every new duplicate search to contact-ids over a certain number. In that way you only search the new ones. The old ones you have already deduped.