Community Tip - Need to share some code when posting a question or reply? Make sure to use the "Insert code sample" menu option. Learn more! X
Hi,
We have a business requirement to export documents from Arbortext 5.4 M020 directly into MS Word format. What is the best we can do for this?
From what I've read, export to MS Word from Arbortext is only supported via RTF, taking care to use special "Word fields", "styles", "instructions" and/or stylesheets. And, even then, the Arbortext Styler Guide& Help files suggest there are plenty of limitations and "gotchas" lurking along this path.
One of our "power" users suggests she has used elsewhere a feature / add-on to EPIC / Editor that can be used to directly export to, or compose into, a Microsoft Word (i.e., ".doc") format. Does this sound familiar to anyone? What is she talking about? She can't remember the feature, but believes it may be a separately-licensed add-on.
Any tips or leads will be carefully followed up!
Regards,
-- Marty
In Reply to Marty Ross:
Hi,
We have a business requirement to export documents from Arbortext 5.4 M020 directly into MS Word format. What is the best we can do for this?
From what I've read, export to MS Word from Arbortext is only supported via RTF, taking care to use special "Word fields", "styles", "instructions" and/or stylesheets. And, even then, the Arbortext Styler Guide& Help files suggest there are plenty of limitations and "gotchas" lurking along this path.
One of our "power" users suggests she has used elsewhere a feature / add-on to EPIC / Editor that can be used to directly export to, or compose into, a Microsoft Word (i.e., ".doc") format. Does this sound familiar to anyone? What is she talking about? She can't remember the feature, but believes it may be a separately-licensed add-on.
Any tips or leads will be carefully followed up!
Regards,
-- Marty
Hi Marty: Re. "from Arbortext 5.4 M020 directly into MS Word format" I wouldn't start from there. Can your requirement be re-stated as "from XML into OOXML (docx, Word 2003 and later) or "into RTF" (Word 97 and earlier)? I think couching this in app-to-app terms is problematic, as you're limited/tied to the apps themselves. XML is ideal for converting to other formats, so I would prefer to couch the problem in terms of "format-to-format", rather than "app-to-app". If you state the problem in those terms, I think it becomes more achievable - depending on your resources 🙂
We have for years converted XML to both RTF (just give the documents a .doc extension) and, more recently, to OOXML (docx). You don't mention round-tripping so I assume it's not a requirement. Just as well, as AFIAK no-one has yet managed it without unacceptable loss of data/quality. Personally I don't think it's been achievable to date, though if Word continues to converge with XML, who knows?
We use XSLT to go from XML to both RTF and OOXML. Our XML-to-OOXML process yields fully-styled Word docs with sophisticated functionality, including automated paragraph renumbering (outline numbering linked to styles) and dynamic cross-references. These documents support automated re-styling by end users (easy conversion to House Style), which is another of our requirements. The paragraph numbering (e.g. 3.4.5.6) that the editors see inthe Arbortext screen FOSI matches that in output the Word documents. I don't know of any Word feature that cannot be supported in XML, and which cannot be achieved via XML-to-OOXML conversion, though I haven't "gone looking for trouble": I can only say that we're able to single-source our content in XML and support Web, Hard Copy, Word and PDF.
You don't mention what DTD/schema you're converting from. I'm not very well up on this, but aren't there ready-made XSLT scripts that will convert from, for example, DocBook XML to RTF/OOXML? I know there are free DocBook to HTML and DocBook to PDF tools, so I would hav expected DocBook-to-RTF to be out there too. Maybe it's not, since RTF is proprietary (and PDF isn't???)
Hope this helps.
Fellow Adepter James Sulak inspired me to tackle the XSLT route to the Word 2003 XML format ("WordML"). With knowledge of WordML structure it's doable. Again, a simple doctype helps.
See http://rep.oio.dk/Microsoft.com/officeschemas/wordprocessingml_article.htm
The Word 2003 XML format is virtually the same as the fully-realized Office Open XML introduced with Word 2007, without the compressed container folder structure (so you can transform to a single flat XML file), and is seamlessly converted to MS Word 2007/2010 with the built-in converter.
The main problems with the Arbortext ImportExport RTF export route are disappointing speed and expensive licensing. That said, we absolutely rely on ImportExport for conversion to XML of Word legacy and outside submissions. Only OmniMark is comparable as far as I'm aware.
Again, round-tripping is fraught.
Years ago I wasted a couple days looking at Word as an XML editor. It can do "user XML" as an overlay to the WordML but there's no real-time validation, poor performance, and no route back to Word from user XML.
- Lou Argyres
Continuing Education of the Bar - California
2100 Franklin St, Suite 500
Oakland, CA 94612
Lou.Argyres@ceb.ucla.edu