Hi all,
I hope someone of you has got experience with this subject: A department wants to unify its documentations. What's existing are 500+ MS Word documents written in a unified template. What they search: a way to generate different documents from single sources (user, teacher, support, in-line help for programs) and a "database" that enables access via search, index and keywords. Sounds very much like a jack of all trades to me, but maybe some of you know a feasible approach?
Best wishes Michael
Hi Michael,
Libreoffice provides a headless mode in which it can convert between different document formats. Apart from that .odt as well as .docx files are just zip containers which you can unpack to work with the contained xml code.
Converting files to plain text allows you, to index them, for a standard text search. If you don't want to do this with Libreoffice you can also use the tool catdoc, or in case of .odt extract the plaintext representation of the document from the zip container.
Performing XML transitions (XSLT based or other) on the contained XML data allows you to convert between different document templates.
Free Software provides the tools for those tasks, even for MS Word documents ;-)
On 17 luglio 2014 14:11:50 CEST, Michael Kesper mkesper@fsfe.org wrote:
Hi all,
I hope someone of you has got experience with this subject: A department wants to unify its documentations. What's existing are 500+ MS Word documents written in a unified template. What they search: a way to generate different documents from single sources (user, teacher, support, in-line help for programs) and a "database" that enables access via search, index and keywords. Sounds very much like a jack of all trades to me, but maybe some of you
know a feasible approach?
I suggest Sphinx, even if it doesn't have inline help for programs, as far as I know. But it does have a search function and it's very flexible. There's also a tool to convert from odt to restructuredtext: http://sphinx-doc.org/intro.html#conversion-from-other-systems
It's used by python and django project (and many others).
↪ 2014-07-19 Sat 09:50, Federico Bruni bruni@fsfe.org:
I suggest Sphinx, even if it doesn't have inline help for programs, as far as I know. But it does have a search function and it's very flexible. There's also a tool to convert from odt to restructuredtext: http://sphinx-doc.org/intro.html#conversion-from-other-systems
I suggest you look at pandoc too.