Example
The results of applying the five individual heuristics to the sample document presented earlier are as follows:
OML = [(hr, 1), (br, 2), (b, 3)]
RPL = [(hr, 1), (br, 2), (b, 3)]
SDL = [(hr, 1), (b, 2), (br, 3)]
ITL = [(hr, 1), (br, 2), (b, 3)]
HTL = [(b, 1), (br, 2), (hr, 3)]
Combining these five individual heuristics together yields:
ORSIH: [(hr, 99.96%), (b, 64.75%), (br, 56.34%)]
Hence, ‘hr’ is chosen as the record separator.