[1] Huang C N, Zhao H. Chinese word segmentation: A decade review [J]. Journal of Chinese Information Processing,2007,21(3):8-19(in Chinese). 黄昌宁,赵 海. 中文分词十年回顾 [J].中文信息学报,2007,21(3):8-19.
[2] Asahara M,Goh C L, Wang X J,et al.Combining segmenter and chunker for Chinese word segmentation //Proceedings of Second SIGHAN Workshop on Chinese Language Processing.2003:144-147.
[3] Li M, Gao J F, Huang C N,et al.Unsupervised training for overlapping ambiguity resolution in Chinese word segmentation //Proceedings of Second SIGHAN Workshop on Chinese Language Processing.2003:1-7.
[4] Li J B, Zhou Q, Chen Z S. A study on fast algorithm for Chinese dictionary lookup [J]. Journal of Chinese Information Processing,2006,20(5):31-39(in Chinese). 李江波,周 强,陈祖舜.汉语词典快速查询算法研究 [J].中文信息学报,2006,20(5):31-39.
[5] Sun M S, Zuo Z P, Huang C N. An experimental study on dictionary mechanism for Chinese word segmentation [J]. Journal of Chinese Information Processing, 2000,14(1):1-6(in Chinese). 孙茂松,左正平,黄昌宁.汉语自动分词词典机制的实验研究 [J].中文信息学报,2000,14(1):1-6.
[6] Yang W F, Chen G Y, Li X. PATRICIA-tree based dictionary mechanism for Chinese word segmentation [J]. Journal of Chinese Information Processing, 2001,15(3):44-49(in Chinese). 杨文峰,陈光英,李 星.基于PATRICIA tree自动分词词典机制 [J].中文信息学报,2001,15(3):44-49.
[7] Wang S L, Zhang H P, Wang B. Research of optimization on double-array trie and its application [J]. Journal of Chinese Information Processing,2006,20(5):24-30(in Chinese). 王思力,张华平,王 斌.双数组Trie树算法优化及其应用研究 [J].中文信息学报,2006,20(5):24-30.
[8] Li Q H, Chen Y J, Sun J G. A new dictionary mechanism for Chinese word segmentation [J]. Journal of Chinese Information Processing, 2003,17(4):13-18(in Chinese). 李庆虎,陈玉健,孙家广.中文分词词典新机制—双字哈希机制 [J].中文信息学报,2003,17(4):13-18.
[9] Choi A, Cheng C H, Ko Y L. Word extraction from Chinese documents by occurrence counts //International Conference on Computer Processing of Chinese and Oriental Languages.Toronto, Canada,1988:488-491.
[10] Honglan Jin, Kam-Fai.A Chinese dictionary construction algorithm for information retrieval //Proceedings of the ACM Transactions on Asian Language Information Processing(TALIP).2002(4):281-296.
[11] Information Technology Lab, NIST . . http://www.nist.gov/speech/tests/tdt/.
[12] TDT3 Multilanguage Text Corpus, Version 2.0; LDC Catalog Number LDC2001T58, isbn: 158563-193-0.
|