Gathering text from xml
Hello!
I use Java (the plugin is made) for modifying the content of the opened XML/SGML files (Arbortext 5.2).
I need to collect the text from text nodes. Additionally, I have to add "\r\n" after parsing some nodes.
"\r\n" are used as sentence terminator.
<heading>Some text</heading><content>Main article<content> will be transformed into "Some text\r\nMain article\r\n". So, we have 2 sentences though they don't have standard dots at the end.
So, in this example "heading", "content" are external tags, "b" is an internal tag.
Can I get the list of such standard internal/external tags? Or should I make a dialog with the settings which allows to add them to ini file? The division is required to control sentence boundaries.

