Configuring Duplicate Contacts Finder batch processing

Duplicate Contacts Finder batch processing compares a set of contacts that have recently changed against the rest of the contacts in the ContactManager database. Recently is defined by either of the following:

  • Contact changes that occurred after the last time the batch process ran
  • Contact changes that occurred after the date and time set in the configuration parameter DuplicateContactsEarliestModificationDate

For example:

  1. Duplicate Contacts Finder batch processing ran today at 1:00 am. There were 100 contacts in the database at that time.
  2. Duplicate Contacts Finder found 5 possible duplicate contacts.
  3. The contact administrator checked today for duplicate contacts and merged 5 of them, leaving 95 contacts in the database.
  4. After Duplicate Contacts Finder last ran at 1:00 am today, 10 new contacts were created.
  5. Duplicate Contacts Finder runs at 1:00 am tomorrow. It compares each of those 10 new contacts against 104 contacts—the other 9 new contacts and the 95 contacts that were already in the database.

Duplicate Contacts Finder checks the value of DuplicateContactsEarliestModificationDate, which you can set in config.xml. Duplicate Contacts Finder ignores any contacts created or modified before the date and time in this configuration parameter. You can use this setting to avoid processing every contact in the database the first time you run the batch process.

See also

Set the DuplicateContactsEarliestModificationDate configuration parameter