所属分类:
多国语言处理
开发工具:Visual C++
文件大小:417KB
下载次数:39
上传日期:2008-11-18 17:10:22
说明: XPDF:把pdf文档转化为TEXT文档的库,如需中文支持,请到官方网站下载中文语言包
HTM2TXT:把HTML文件转化为TEXT文件的库
ICTCLAS:对中文字符串进行分词的库
PS2TXT:把Postscript文件转化为TEXT文件的源码
(XPDF: the pdf file into a TEXT document library, for Chinese language support, please visit the official website to download Chinese language pack HTM2TXT: the HTML file into a TEXT file library ICTCLAS: Chinese string Segmentation of library PS2TXT: the Postscript file into a TEXT file source)
文件列表:
IR_Lib
......\Configure.xml
......\htm2txt.dll
......\htm2txt.h
......\htm2txt.lib
......\ICTCLAS.dll
......\ICTCLAS.h
......\ictclas.lib
......\ps2txt.cpp
......\ps2txt.h
......\xpdflib.dll
......\xpdflib.h
......\xpdflib.lib
......\说明.txt