|
|
论文题目 |
一种基于Lucene的影片搜索引擎的研究和应用 |
论文题目(英文) |
|
作者 |
匡振国 |
发表年度 |
2008 |
卷 |
44 |
期 |
29 |
页码 |
8-10 |
期刊名称 |
计算机工程与应用 |
摘要 |
Lucene 是一个优秀的开源搜索引擎框架, 已经广泛应用于信息搜索领域。 分析点播门户中现有的搜索引擎存在的不足, 设计一种基于双字哈希算法支持中文的分词器, 并利用该分词器和 Lucene 工具包, 设计并实现了一个视频点播影片快速搜索引擎,它不仅支持中文检索, 还具有搜索速度快、 易于扩展等优点。仿真实验证明提出的基于 Lucene 的影片搜索引擎具有良好的性能。 关键词: Lucene; 搜索引擎; 双字哈希; 中文分词; 倒排索引 |
摘要_英文 |
Lucene is an excellent framework for search engine of open source code, and it has been widely used in the field of information retrieval.After analyzing the disadvantage of existing search engines in VoD portal, a word segmentation method supporting Chinese based on double character hash index algorithm is designed.With the use of the word segmentation method and Lucene tool lib, a VoD Quick video search engine is implemented, which not only supports Chinese search but also has the benefits of searching fast, easy expansion and so on.The simulation results show that the video search engine based on the Lucene designed in this paper has a good performance. Key words: Lucene; search engine; double character hash index; Chinese word segmentation; inverted index |
|
|
|