所属分类:
多国语言处理
开发工具:C/C++
文件大小:80KB
下载次数:63
上传日期:2006-05-28 16:02:24
说明: 1. 这是一个简单的语料库管理系统
2. 可以添加和删除语料文件,统计语料中的字数
3. 可以查找语料中的汉字串以及重叠形式
4. 语料文件存放在corpus目录下,查询结果保存在跟语料库相同目录下
5. corpus目录下有4个文本文件(其中test1, test2是两个小文件)供测试用
6. 只能处理文本文件,GB内码
(1. This is a simple Corpus management system 2. They can add and delete corpus, Statistics word corpus of 3. Corpus can find strings of Chinese characters and overlapping forms 4. documents stored in Corpus corp Contents us, Results from Corpus preserved in the same directory 5. corpus directory are four text files (tes t1, test2 are two small files) for the six tests. can handle only text files, GB Internal Code)
文件列表:
TestCorpus
..........\ChildFrm.
cpp
..........\ChildFrm.
h
..........\GetDataDlg.
cpp
..........\GetDataDlg.
h
..........\HanziInfoDlg.
cpp
..........\HanziInfoDlg.
h
..........\MainFrm.
cpp
..........\MainFrm.
h
..........\ReadMe.
txt
..........
\res
..........\...\TestCorpus.
rc2
..........\resource.
h
..........\retrieval.
cpp
..........\retrieval.
h
..........\SaveCorpus.
cpp
..........\SaveCorpus.
h
..........\SelectData.
cpp
..........\SelectData.
h
..........\StdAfx.
cpp
..........\StdAfx.
h
..........\TestCorpus.
aps
..........\TestCorpus.
clw
..........\TestCorpus.
cpp
..........\TestCorpus.
dsp
..........\TestCorpus.
dsw
..........\TestCorpus.
h
..........\TestCorpus.
ncb
..........\TestCorpus.
opt
..........\TestCorpus.
plg
..........\TestCorpus.
rc
..........\TestCorpusDoc.
cpp
..........\TestCorpusDoc.
h
..........\TestCorpusView.
cpp
..........\TestCorpusView.
h
..........\TitlesDlg.
cpp
..........\TitlesDlg.
h