Calculation of Match Score

The calculation of Match Score depends on the type of rule (predefined or customized). For predefined rules, it also depends on the type of content the rule is detecting.

Match Score calculation by rule and content type

Rule Type

Content Type

Match Score calculation

Predefined EDD rules

Credit Card Numbers

A Match Score of 1 is generated for each valid, unique credit card number detected in a file. For example, if the same credit card number is detected three times in a file, a Match Score of 1 is generated. If three unique credit card numbers are detected, a Match Score of 3 is generated.

US Social Security Numbers

A Match Score of 1 is generated for each valid, unique SSN detected in a file. For example, if the same SSN is detected three times in a file, a Match Score of 1 is generated. If three unique SSNs are detected, a Match Score of 3 is generated.

Personal Health Information

A Match Score of 1 is generated for each valid, unique Personal Health Identifier detected in a file. Although a file must include at least one health term for a match to be detected, no Match Score is assigned to a matched health term. Therefore, the higher the Match Score the more likely that a file contains Personal Health Information for many individuals rather than just one.

Personal Financial Information

A Match Score of 1 is generated for each valid, unique Personal Financial Identifier detected in a file. Although a file must include at least one financial term for a match to be detected, no Match Score is assigned to a matched financial term. Therefore, the higher the Match Score the more likely that a file contains Personal Financial Information for many individuals rather than just one.

Encrypted or Password Protected

A Match Score of 1 is generated for each encrypted or password-protected file.

GDPR Personal Data

A Match Score of 1 is generated for each valid, unique EEA personal identifier detected in a file. For example, if the same German Taxpayer ID Number is detected three times in a file, a Match Score of 1 is generated. If three unique German Taxpayer ID Numbers are detected, a Match Score of 3 is generated.

Unscannable

A Match Score of 1 is generated for each unscannable file.

Customized EDD rules

All content types

A Match Score of 1 is generated for each unique "word" in an expression that is detected in a file.

For example, you create a rule to detect Loan numbers in the format, "Loan number #######". The rule detects the following file content: "Loan number 5534215" and "Loan number 7645981". A Match Score of 4 is assigned to the file for the words "Loan", "number", "5534215", and "7645981".