I'm working on a similar task and was wondering — did you end up finding a library that can reliably convert HTML to PDF with full PDF/UA compliance?
Reasons:
Blacklisted phrase (2): was wondering
RegEx Blacklisted phrase (3): did you end up finding a