Skip to content

etangyushan/doctotext

Repository files navigation

/****************************************************************************
**
** DocToText - Converts DOC, XLS, XLSB, PPT, RTF, ODF (ODT, ODS, ODP),
**             OOXML (DOCX, XLSX, PPTX), iWork (PAGES, NUMBERS, KEYNOTE),
**             ODFXML (FODP, FODS, FODT), PDF, EML and HTML documents to plain text.
**             Extracts metadata and annotations. 
**
** Copyright (c) 2006-2013, SILVERCODERS(R)
** http://silvercoders.com
**
** Project homepage: http://silvercoders.com/en/products/doctotext
**
** This program may be distributed and/or modified under the terms of the
** GNU General Public License version 2 as published by the Free Software
** Foundation and appearing in the file COPYING.GPL included in the
** packaging of this file.
**
** Please remember that any attempt to workaround the GNU General Public
** License using wrappers, pipes, client/server protocols, and so on
** is considered as license violation. If your program, published on license
** other than GNU General Public License version 2, calls some part of this
** code directly or indirectly, you have to buy commercial license.
** If you do not like our point of view, simply do not use the product.
**
** Licensees holding valid commercial license for this product
** may use this file in accordance with the license published by
** SILVERCODERS and appearing in the file COPYING.COM
**
** This program is distributed in the hope that it will be useful,
** but WITHOUT ANY WARRANTY; without even the implied warranty of
** MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.
** 
*****************************************************************************/


说明:
0. 官方最后更新的版本: doctotext-4.0-20140202 

1. 程序不支持解析嵌套文件,只能获取第一层的文件内容(eml除外)

2. 源码编译过程中会有很多的 #warning TODO: XXX , 这些都是程序中还没有做的准备逐步完善的地方

3. 这个代码是在官方最后更新的版本基础上自己维护的版本

   开发环境是: centos6.5 开发版

   开发工具: eclipse-cpp-Mars Release (4.5.0)

测试注意:
doctotext.c 中的解析文件内容的printf 打印最好别修改,因为会造成运行自动化测测试时的错误

使用:
0. 编译: 

   运行 ./run.sh

1. 执行:  
   
   cd build

   ./start.sh

2. 测试 tests 下的示例文件

   ./test.sh
   
   

About

文件解析 doctotext 源码 4.0-20140202 版本

Resources

License

Unknown, GPL-2.0 licenses found

Licenses found

Unknown
COPYING.COM
GPL-2.0
COPYING.GPL

Stars

Watchers

Forks

Packages

 
 
 

Contributors