Word
Microsoft Word is one of the most widely used word processing programs in the world. Beneath its seemingly simple interface lies a complex technology that offers many possibilities for automated checking and conversion of manuscript data.
A central component is the OOXML (Office Open XML) file format. This XML-based format represents all text, formatting, images, and tables in a Word document in a structured way. OOXML itself is technically closely tied to the Word software, which is why we convert it into media-neutral XML using XProc and XSLT. This makes it easier to process automatically, import into other systems, or export for e-books and web content. Additionally, DOCX files can be recreated from other XML-based formats with transpect, for example when authors need to receive the manuscript back with typesetting corrections applied.
Another important feature for scientific or technical publications is the integration of mathematical formulas. In the past, MathType was frequently used for this purpose—a Word plugin for creating mathematical equations. In modern Word versions, there is an integrated formula editor based on OMML (Office Math Markup Language). OMML is an XML-based format that stores formulas directly in OOXML. Whether OMML or MathType, transpect can process both formats and convert them into clean MathML.
Word also offers scripting and automation capabilities that are extremely useful for publishers. The older Visual Basic for Applications (VBA) allows new menus and functions to be added to Word templates and automates recurring tasks. For modern Word versions, there are also Office Add-Ins based on web technologies such as JavaScript, HTML, and CSS. These add-ins extend Word with additional functions directly in the user interface, for example for connecting to content management systems, performing automatic style checks, or exporting to specific publication formats.
The flexibility of Microsoft Word also allows authors to work outside of style guidelines and technical standards. In such cases, it is often useful to normalize manuscripts—a process in which manuscripts are automatically prepared using rule-based and AI-based scripts, along with manual adjustments.
Word is not only installed on almost every computer but also provides structured file formats, mathematical editing capabilities, and flexible automation. Despite the availability of web and XML editors, Microsoft Word remains an indispensable tool for many authors and publishers.