本文共 1625 字,大约阅读时间需要 5 分钟。
coreseek 号称是sphinx中文版,特色是libmmseg 中文分词支持,目前稳定版是3.2.14,对应的sphinx版本是 0.9.9,(sphinx最新版为2.1.7)。
由于官方没有给CentOS 6打包,网上也几乎全是make/make install之类的教程,于是花了几个小时,将coreseek和mmseg分别打包成srpm,方便新手。
默认配置文件改为/etc/sphinx/csft.conf,在CentOS 6 x64 平台上测试打包。
编译后的rpm二进制软件包内容如下
coreseek
# rpm -qpl coreseek-3.2.14-2.el6.x86_64.rpm/etc/logrotate.d/coreseek/etc/rc.d/init.d/searchd/etc/sphinx/etc/sphinx/csft.conf/usr/bin/indexer/usr/bin/indextool/usr/bin/search/usr/bin/searchd/usr/bin/spelldump/var/lib/sphinx/var/log/sphinx/var/run/sphinx
mmseg
# rpm -qpl mmseg-3.2.14-1.el6.x86_64.rpm/etc/mmseg/etc/mmseg/mmseg.ini/etc/mmseg/uni.lib/etc/mmseg/unigram.txt/usr/bin/mmseg/usr/include/mmseg/usr/include/mmseg/ICorpusReader.h/usr/include/mmseg/SegmentPkg.h/usr/include/mmseg/Segmenter.h/usr/include/mmseg/SegmenterManager.h/usr/include/mmseg/Singleton.h/usr/include/mmseg/StringTokenizer.h/usr/include/mmseg/SynonymsDict.h/usr/include/mmseg/ThesaurusDict.h/usr/include/mmseg/UnigramCorpusReader.h/usr/include/mmseg/UnigramDict.h/usr/include/mmseg/UnigramRecord.h/usr/include/mmseg/Utf8_16.h/usr/include/mmseg/bsd_getopt.h/usr/include/mmseg/bsd_getopt_win.h/usr/include/mmseg/csr.h/usr/include/mmseg/csr_assert.h/usr/include/mmseg/csr_mmap.h/usr/include/mmseg/csr_pool.h/usr/include/mmseg/csr_typedefs.h/usr/include/mmseg/csr_utils.h/usr/include/mmseg/darts.h/usr/include/mmseg/dictionary.h/usr/include/mmseg/freelist.h/usr/include/mmseg/iniparser.h/usr/include/mmseg/mmthunk.h/usr/include/mmseg/os.h/usr/include/mmseg/scoped_ptr.h/usr/include/mmseg/tolowercase.h/usr/lib64/libmmseg.a/usr/lib64/libmmseg.la
附件超过2M,请移步百度网盘下载。
http://pan.baidu.com/s/1c0ovYzm
转载地址:http://pbkya.baihongyu.com/