in reply to
Looking for an HTML structure-cleaner
The documents in question are documents, e.g. literary passages, sometimes-elaborate math formatting and so on, all of which are being presented as a part of a vocational test-giving application. So, the formatting (egregious as it sometimes is...) is important. My problem is that, sometimes it is incomplete. Therefore, when my template embeds it into a <div> tag, without the proper closing-tags within the embedded text that <div> tag is not seen as enclosing it anymore.
So, thinking about this requirement a little bit more, I guess that I am really most concerned with “DOM structure” matters ... of making sure that the content, whatever it is, gets wedged into the container. I really don’t want to delve into the guts of that content. I simply want to keep it inside the box. Of course I thought about using the <frame> tag, but the deployment is so bandwidth-constrained that the result looks perfectly dreadful.
You’re right about the Word-generated HTML content, tho’ ... it is hideous. But, it works. And I basically want to keep it working.