From Word to Joomla and keep only some formatting

I have a project where I need to add a lot of text and have all as Word documents.

The text has been formatted nicely with headings, bold text, italics etc. No graphics.

Of course, under the surface, Word adds A LOT of codes that I do not even think about adding in Joomla. But I need to come up with a workflow for this that takes away all Word specific stuff, but keeps the bold and italics as well as centering of some text.

Does anyone know of any tools that do not strip out absolutely everything? I do not mind going through a couple of exports/imports to get it done. Also, I mainly work on Linux, but have both Windows and Mac available if needed.

Views: 191

Tags: formatting, import, remove, word

Comment by Svein Wisnaes on July 16, 2010 at 10:07am
Sounds exactly what I need, Brian! Thanks! Going to try now :-)
Comment by Svein Wisnaes on July 16, 2010 at 9:35pm
So far some positive things and some shocks. Pasted in some text from a Word document opened in OpenOffice. The COOOOOOOOOOOOOOOL things first:

Pasting in the (long) text with correctly generated footnotes moved all footnotes to the bottom of the document and linked the text to the correct footnotes. ABSOLUTELY COOL!!!!!!

Throughout the text, it seems the cleanup was successful. BUT -

The ugly part: A bunch of text at the bottom AFTER the footnotes has a TON of formatting codes and I believe this (and maybe some missing configuration from my side) totally breaks the site when viewing the text. Going to do some more testing. If I can just fix the last part here, this will be a total lifesaver. The site is for publishing academic papers and they have some strict guidelines for how things should be.
Comment by Svein Wisnaes on July 16, 2010 at 9:49pm
Well, deleting all the stuff at the end of the article (not visible in the WYSIWYG mode) brought the site back. A little disappointed that using the template css is the default. Turning it off give an error as a css does not exist for the editor and the fallback is to use the css for the template!! I really do not like using the template css while editing articles in the backend.
Comment by Svein Wisnaes on August 4, 2010 at 6:47pm
So, after using it a bit, here is some more info.

Seems like the first mishap with the garbage at the end was a fluke. Have not seen it since.

I had to fix the quotes (don't know if there is any special code for quotes in Word - could not find anything in OpenOffice...), but most other things looks good.

Hard line breaks gets translated into A LOT of linebreaks/paragraphs. So a little bit of a clean up job. Footnotes gets pushed to the end of the articles with working anchor links. Cool!

Got a bit of a scare today after trying to paste in an article of close to 15.000 words/90.000 char. After cleaning everything up, I saved it and the last part of it disappeared. Heart racing, thinking I was really in deep... It is an academic website with loooong papers to publish.

To the rescue comes Amy! Turns out there is a field in the database that needed a little type change, and now all is good!

Next project will be to try to remove some of the buttons/functions. And maybe try to put "blockquote" on a button.

But this is a solution I really can recommend to be able to paste from Word and OpenOffice Write into Joomla.

Comment

You need to be a member of All Together, As A Whole to add comments!

Join All Together, As A Whole

Badge

Loading…

© 2012   Created by Amy Stephen.

Badges  |  Report an Issue  |  Terms of Service