Need to extract content in <title>, <para> and <trim.para> tags from a moderate sized (1100 pgs) xml document. I've been checking into regex (but repeatedly see folks say it's not the best tool for this), but I don't know anything about XML DOM parsers or stuff like that (just a simple arbortext user.) Is there any easy way to do this? Either within Arbortext or a simple tool I can find online to parse an xml file and dump content to text or whatever based on what tags I select? Request advise, over. (Bob, Bristol RI)
Thanks all for the quick replies.Suzanne's write-untagged command was perfect - quick and so simple, even a caveman could.... well, you know.
Happy holidays and thanks again. (Bob, Bristol RI)
Thx - but note - I'm trying to extract it - in order (all sequential instances of title, trim.para, and para in the order they appear in the book).
Pulling them out separately won't help, as I'm having to try to re-purpose a large portion of the Tech Manual content over to a training doc that isn't done in XML, it's in MS Word, hence the difficulty.
If I can get it out as text in some manner, I can simply copy and paste it in to the other DOC.
Thanks for reading and any replies welcome. Happy Holidays (Bob, Bristol RI)