Suggestion: without actual examples we can only speculate. The fact you still get dates implies your regex is not correct. If you skip content because of a pattern are you catering for the adjustment. Perhaps substitute the skipped pattern with 'whitespace'. If you do not then the doc flow is all over the place, you need to retain the doc structure.