No question, accuracy must come first.
The reason so many rows are being skipped is only the first occurrence of 'the' is being looked at:
seq 23: Antikythera
seq29: no space between writes and the
seq 108 and 109: Southern Ohio
etc.
etc.
Just so this can be looked at with a totally open (maybe that should be blank) mind, can you state what the original request and requirements were regarding this ?
|