所属分类:
多国语言处理
开发工具:Visual C++
文件大小:1423KB
下载次数:15
上传日期:2008-11-14 16:52:06
说明: 自己写的简单分词程序,能够识别中英文,标点符号,数字等,但是速度不是很理想,其中思想可以供大家参考!
(Himself wrote a simple segmentation procedure can identify in both Chinese and English, punctuation, numbers, etc., but the speed is not very ideal, in which ideas can be for your reference!)
文件列表:
SimpleSplit
...........\Dic.cpp
...........\Dic.h
...........\dict.txt
...........\Hash.cpp
...........\Hash.h
...........\SimpleSplit.cpp
...........\SimpleSplit.dsp
...........\SimpleSplit.dsw
...........\SimpleSplit.ncb
...........\SimpleSplit.opt
...........\SimpleSplit.plg
...........\Split.cpp
...........\Split.h