所属分类:
多国语言处理
开发工具:Java
文件大小:5229KB
下载次数:6
上传日期:2008-12-31 00:18:55
说明: 简单分词程序 读入一个pdf 输出一个分好词的txt
(Reading of simple segmentation procedure into a pdf output of a good word txt)
文件列表:
WordSeg
.......\bin
.......\...\dic.dat
.......\...\WordSegment
.......\...\...........\BMM.class
.......\...\...........\Dictionary.class
.......\...\...........\DicTrainer.class
.......\...\...........\FMM.class
.......\...\...........\SegStrategy.class
.......\...\...........\WordSegment.class
.......\build
.......\.....\classes
.......\.....\.......\WordSegment
.......\.....\.......\...........\BMM.class
.......\.....\.......\...........\composite.class
.......\.....\.......\...........\Dictionary.class
.......\.....\.......\...........\DicTrainer.class
.......\.....\.......\...........\FMM.class
.......\.....\.......\...........\Init.class
.......\.....\.......\...........\Integert.class
.......\.....\.......\...........\PDFReader.class
.......\.....\.......\...........\Run.class
.......\.....\.......\...........\SegStrategy.class
.......\.....\.......\...........\WordReader.class
.......\.....\.......\...........\WordSegment.class
.......\build.xml
.......\nbproject
.......\.........\build-impl.xml
.......\.........\genfiles.properties
.......\.........\private
.......\.........\.......\config.properties
.......\.........\.......\private.properties
.......\.........\.......\private.xml
.......\.........\project.properties
.......\.........\project.xml
.......\news.pdf
.......\news.txt
.......\PDFBox-0.7.3
.......\............\PDFBox-0.7.3
.......\............\............\external
.......\............\............\........\FontBox-0.1.0-dev.jar
.......\............\............\........\PDFBox-0.7.3.jar
.......\rel_dic.dat
.......\result.txt
.......\SogouR.mini.txt
.......\src
.......\...\WordSegment
.......\...\...........\BMM.java
.......\...\...........\composite.java
.......\...\...........\Dictionary.java
.......\...\...........\DicTrainer.java
.......\...\...........\FMM.java
.......\...\...........\Integert.java
.......\...\...........\PDFReader.java
.......\...\...........\Run.java
.......\...\...........\SegStrategy.java
.......\...\...........\WordSegment.java
.......\test
.......\test.txt