Word Importer Best Practices
Submitted by
bnwest.
on 2007-06-20 17:30.
Word Importer Best Practices
Our import process geneates the following CNXML tags:
- <section>
- <name>
- <para>
- <quote>
- <list>
- <item>
- <note>
- <figure>
- <media>
- <param>
- <emphasis>
- <code>
- <table>, <tgroup>, <tbody>, <row>
- <link>
DOs
- Use the CNX Word Template. This is documented somewhere on cnx.org. Using the Word Templte should facilitate the import of the CNXML tags: <cite>, <code>, <emphasis>, <foreign>, <quote> and <term>.
- Use the default Word equation editor for all math (especially subscripts and supescripts), so that MathML will be imported.
- Use the default Word styles. Creating user defined Word styles may cause import problems.
- Use browser supported image types and not .wmf and .svg files.
- Use only straight forward tables.
DON'Ts
- Do not use MathType 5.0 equation Editor, since all of the MathType 5.0 equations will not be imported. After import, MathType 5.0 equation be copy from MathType 5.0 to the system clipboard (Elizabeth documented this, but I am unsure if that made to cnx.org) and pasted into the CNXML.
- Do not use lists or tables to format the document. These constructs are literally imported.
- Do not add line breaks via cntl-Enter.
- Do not use tab-s.
- Do not have separate lists which use the same numbering, i.e. continue numbering across separate lists.
- Do not have lists or multiple paragraphs as a single table entry.
- Do not bold/italized/underline text.
- Do not use images that are hyperlinked. Use local images.
- Do not use the 'wingding' font.
- Do not highlight multiple lines of code and format them "CNXML Code".

Also, why not import all the bold as emphasis?