Converting documents to mediawiki markup

From OpenWetWare
Revision as of 13:24, 29 November 2005 by Barry Canton (talk | contribs)
Jump to navigationJump to search

Introduction

The initial goal of this page is to work out what existing tool are avaiable to automatically convert documents to mediawiki markup. Based on identified shortcomings in available tools, we'd like to propose some tools that should be developed to facilitate the growth of OpenWetWared.

There appears to be few existing tools to automatically convert other document formats (e.g .doc, .xls, .ppt). The simplest approach right now appears to be to convert documents into html and from that to convert them to wiki markup as there are good html->wiki converters. Here are some good links.

Word Documents

Saving a relatively simple Word document (no images or tables) to html and then running that through the converter here produced good mediawiki formatting. A document including images, tables, and centered text did not work as well. The images would need to be added to OWW separately, the table also didn't come out quite right and centered text was no longer centered.

A direct converter can be found here.

Excel Documents

If you can export a data in comma separated variable (CSV) format, then a converter exists.

Powerpoint Documents

Not a lot here, even the .ppt->html converters are unimpressive. Based on the partial interchangeability of the .doc and .ppt formats, it might be possible go .ppt->.doc->.html->wiki.

LaTex Documents

Mediawiki has support for some Tex math formatting so there are ways to do this. Most people who are active users of Tex are probably savvy enough to be able/willing to convert their documents into mediawiki format. Still room for improvement though.

Conclusions and deliverables

This page is in active development but very initial conclusions will begin to appear here
Based on what I've read so far, there are ways of converting document formats and it helps if you are somewhat computer savvy but there is definitely a need for a converter for a range of formats that is well integrated with the mediawiki sofware.