Skip to content


linux下安装lucene

apache+tomcat整合  http://www.ibm.com/developerworks/cn/opensource/os-lo-apache-tomcat/index.html

  了解 KinoSearch A Perl search engine library. http://www.rectangular.com/kinosearch/  

Plucene基于java lucene项目创建 安装方法: perl -MCPAN -e “install Plucene” perl -MCPAN -e “install Plucene::Simple”

Zend Framework http://framework.zend.com/download

CLucene CLucene是C++版的全文检索引擎,完全移植于Lucene,采用 STL 编写。有php扩展,对中文支持不是很好。 http://sourceforge.net/projects/clucene/

Lucene4c The Lucene4c project is an implementation of the Lucene search engine in C, built on top of the Apache Portable Runtime.   http://incubator.apache.org/lucene4c/

Nutch Nutch 是一个开源Java 实现的搜索引擎。它提供了我们运行自己的搜索引擎所需的全部工具。包括全文搜索和Web爬虫。 http://lucene.apache.org/nutch http://nutch.sourceforge.net/docs/en/about.html

jdk6 http://www.java.net/download/jdk6/6u10/promoted/b24/binaries/jdk-6u10-beta-bin-b24-linux-i586-14_may_2008-rpm.bin ant http://apache.mirror.phpchina.com/ant/binaries/apache-ant-1.7.0-bin.tar.gz lucene http://apache.mirror.phpchina.com/lucene/java/lucene-2.3.2.tar.gz javac https://javacc.dev.java.net/files/documents/17/26776/javacc-4.0.tar.gz php-java bridge http://nchc.dl.sourceforge.net/sourceforge/php-java-bridge/php-java-bridge_5.2.2.tar.gz tomcat http://apache.mirror.phpchina.com/tomcat/tomcat-6/v6.0.16/bin/apache-tomcat-6.0.16.tar.gz

使用tomcat可以跳过第六步

一 安装java环境 [root@dev ~]# java -version java version “1.4.2” gcj (GCC) 3.4.3 20041212 (Red Hat 3.4.3-9.EL4)

[root@dev ~]# rpm -qa |grep java java-1.4.2-gcj-compat-1.4.2.0-26jpp

注: 通常,您不必使用 RPM 卸载 JRE,因为 RPM 可以在您安装新版本时自动卸载旧版本的 JRE!除非您准备永久删除 JRE,否则请跳过本节内容。

[root@dev ~]# rpm   -e   java-1.4.2-gcj-compat-1.4.2.0-26jpp

http://download.java.net/jdk6/ 下载jdk包 [root@dev ~]#wget –limit-rate=20000 http://www.java.net/download/jdk6/6u10/promoted/b24/binaries/jdk-6u10-beta-bin-b24-linux-i586-14_may_2008-rpm.bin 限制20k

[root@dev ~]# chmod 755 jdk-6u10-beta-bin-b24-linux-i586-14_may_2008-rpm.bin\?e\=1212404509\&h\=a151b74ce54cda9cba81a7444944c0ba

[root@dev ~]#./jdk-6u10-beta-bin-b24-linux-i586-14_may_2008-rpm.bin\?e\=1212404509\&h\=a151b74ce54cda9cba81a7444944c0ba 一路空格后健入yes

[root@dev ~]# vi /etc/profile set JAVA_HOME=/usr/java/jdk1.6.0_10 export JAVA_HOME set PATH=$PATH:$JAVA_HOME/bin export PATH set CLASSPATH=.:$JAVA_HOME/lib/dt.jar:$JAVA_HOME/lib/tools.jar export CLASSPATH

bourne shell家族中赋值不用 set,这个郁闷了我好久没找到变量无效的原因。 [root@dev ~]# source /etc/profile 

用文本编辑器新建一个Test.java文件,在其中输入以下代码并保存:

    public class Test {       public static void main(String args[]) {         System.out.println(“A new jdk test !”);       }     }

编译:在shell终端执行命令 javac Test.java

如果出错可能是javac还没装,先接着下面安装javac后,再返回到这里测试。

运行:在shell终端执行命令 java Test

当shell下出现“A new jdk test !”字样则jdk运行正常。

二 安装ant http://ant.apache.org/bindownload.cgi ant是一个基于JAVA的自动化脚本引擎,脚本格式为XML。除了做JAVA编译相关任务外,ANT还可以通过插件实现很多应用的调用,比make脚本来说还要好维护一些。

[root@dev ~]# wget http://apache.mirror.phpchina.com/ant/binaries/apache-ant-1.7.0-bin.tar.gz

[root@dev ~]# tar zxvf apache-ant-1.7.0-bin.tar.gz

[root@dev ~]# mv apache-ant-1.7.0 /usr/local/

[root@dev ~]# vi /etc/profile

在JAVA_HOME前加上 ANT_HOME=/usr/local/apache-ant-1.7.0 export ANT_HOME 编辑 set PATH=$PATH:$JAVA_HOME/bin:$ANT_HOME/bin

[root@dev ~]# source /etc/profile 

三 安装lucene wget http://apache.mirror.phpchina.com/lucene/java/lucene-2.3.2.tar.gz 不是lucene-2.3.2-src.tar.gz哦,这个无lucene-demos-2.3.2.jar [root@dev ~]# tar zxvf lucene-2.3.2.tar.gz

[root@dev ~]# mv lucene-2.3.2 /usr/local

四 安装javac https://javacc.dev.java.net/files/documents/17/26776/javacc-4.0.tar.gz [root@dev ~]# wget https://javacc.dev.java.net/files/documents/17/26776/javacc-4.0.tar.gz

[root@dev ~]# gunzip javacc-4.0.tar.gz

[root@dev ~]# tar -xvf javacc-4.0.tar

[root@dev ~]# mv javacc-4.0 /usr/local/

[root@dev ~]# cd  /usr/local/lucene-2.3.2

[root@dev ~]# echo javacc.home=/usr/local/javacc-4.0 > ~/build.properties

[root@dev ~]# ant

五 测试lucene 再修改/etc/profile,在CLASSPATH前加上 LUCENE_HOME=/usr/local/lucene-2.3.2 修改变量 CLASSPATH=.:${JAVA_HOME}/lib/dt.jar:${JAVA_HOME}/lib/tools.jar:${LUCENE_HOME}/lucene-core-2.3.2.jar:${LUCENE_HOME}/lucene-demos-2.3.2.jar

#source /etc/profile

生成索引

[root@dev ~]# cd ./src/demo

[root@dev demo]# java org.apache.lucene.demo.IndexFiles /usr/local/lucene-2.3.2/docs Exception in thread “main” java.lang.NoClassDefFoundError: org/apache/lucene/demo/IndexFiles Caused by: java.lang.ClassNotFoundException: org.apache.lucene.demo.IndexFiles at java.net.URLClassLoader$1.run(URLClassLoader.java:200) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:188) at java.lang.ClassLoader.loadClass(ClassLoader.java:307) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:252) at java.lang.ClassLoader.loadClassInternal(ClassLoader.java:320) Could not find the main class: org.apache.lucene.demo.IndexFiles. Program will exit.

出现以上错误估计是CLASSPATH没写对。

搜索…,输入以下命令就会出现搜索提示符。 [root@dev demo]# java org.apache.lucene.demo.SearchFiles

六 安装php-java bridge php/Java bridge What is php/Java bridge? The php/Java bridge is an optimized, XML-based network protocol, which can be used to connect a native script engine, PHP, with a Java or ECMA 335 virtual machine. It is more than 50 times faster than local RPC via SOAP, requires less resources on the web-server side, and it is faster and more reliable than direct communication via the Java Native Interface. read more… http://php-java-bridge.sourceforge.net

[root@dev ~]# wget –limit-rate=15000 http://nchc.dl.sourceforge.net/sourceforge/php-java-bridge/php-java-bridge_5.2.2.tar.gz

[root@dev php-java-bridge-5.2.2]# /opt/lampp/bin/phpize && ./configure –disable-servlet –with-java=/usr/java/jdk1.6.0_10 && make CFLAGS=”-m32″ && make install ./configure: line 2969: php-config: command not found ./configure: line 2970: php-config: command not found configure: error: Cannot find php-config. Please use –with-php-config=PATH

缺少xampp开发包和php-config 路径设置 http://sourceforge.net/project/showfiles.php?group_id=61776&package_id=60248

[root@dev ~]# tar -zxvf xampp-linux-devel-xxx.tar.gz

[root@dev ~]# mv lampp/* /opt/lampp/

mv: cannot overwrite directory /opt/lampp/lib' mv: cannot overwrite directory/opt/lampp/modules’ mv: cannot overwrite directory `/opt/lampp/share’

手动一个个移啦

[root@dev php-java-bridge-5.2.2]# /opt/lampp/bin/phpize && ./configure –disable-servlet –with-php-config=/opt/lampp/bin/php-config –with-java=/usr/java/jdk1.6.0_10 && make CFLAGS=”-m32″ && make install

make[1]: [php/java/bridge/JavaBridgeIllegalStateException.o] Error 1 make[1]: Leaving directory `/root/php-java-bridge-5.2.2/server’ make: [/root/php-java-bridge-5.2.2/modules/stamp] Error 2 报两个错,不去理它

[root@dev php-java-bridge-5.2.2]# cp modules/java.so /opt/lampp/modules/

vi /opt/lampp/etc/php.ini 加上 extension=”java.so”

[root@dev php-java-bridge-5.2.2]# /opt/lampp/lampp start Starting XAMPP for Linux 1.6.1… PHP Warning: PHP Startup: Unable to load dynamic library ‘/opt/lampp/lib/php/extensions/no-debug-non-zts-20060613/java.so’ – /opt/lampp/lib/php/extensions/no-debug-non-zts-20060613/java.so: cannot open shared object file: No such file or directory in Unknown on line 0

[root@dev php-java-bridge-5.2.2]# cp modules/java.so /opt/lampp/lib/php/extensions/no-debug-non-zts-20060613/ [root@dev php-java-bridge-5.2.2]# /opt/lampp/lampp start Starting XAMPP for Linux 1.6.1… Exception in thread “main” java.lang.NoClassDefFoundError: php/java/bridge/Standalone Caused by: java.lang.ClassNotFoundException: php.java.bridge.Standalone at java.net.URLClassLoader$1.run(URLClassLoader.java:200) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:188) at java.lang.ClassLoader.loadClass(ClassLoader.java:307) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:252) at java.lang.ClassLoader.loadClassInternal(ClassLoader.java:320) Could not find the main class: php.java.bridge.Standalone. Program will exit.

不行咯,换成用tomcat

七 安装tomcat http://tomcat.apache.org/tomcat-6.0-doc/setup.html [root@dev ~]# wget –limit-rate=20000 http://apache.mirror.phpchina.com/tomcat/tomcat-6/v6.0.16/bin/apache-tomcat-6.0.16.tar.gz [root@dev ~]# tar -zxvf apache-tomcat-6.0.16.tar.gz [root@dev ~]# mv apache-tomcat-6.0.16 /usr/local/apache-tomcat

[root@dev ~]# vi /etc/profile export JDK_HOME=${JAVA_HOME}

export CATALINA_BASE=/usr/local/apache-tomcat export CATALINA_HOME=/usr/local/apache-tomcat [root@dev ~]# source /etc/profile [root@dev ~]# vi /etc/rc.d/rc.local /usr/local/apache-tomcat/bin/startup.sh

测试 http://192.168.54.96:8080

 vi /usr/local/apache-tomcat/conf/server.xml

port=”8080″ protocol=”HTTP/1.1″ connectionTimeout=”20000″

URIEncoding=”UTF-8″#增加此行 redirectPort=”8443″>

< Host name =” localhost appBase =” webapps unpackWARs =” true autoDeploy =” true xmlValidation =” false xmlNamespaceAware =” false >

中增加以下内容,将weblucene设为根目录

  < Context path =” docBase =” /usr/local/apache-tomcat/webapps/weblucene reloadable =” true debug =” 0 crossContext =” true />

server.xml默认有下面一行:

这样允许任何人只要telnet到服务器的8005端口,输入”SHUTDOWN”,然后回车,服务器立即就被关掉了。 从安全的角度上考虑,我们需要把这个shutdown指令改成一个别人不容易猜测的字符串。 例如修改如下: ,这样就只有在telnet到8006,并且输入”lizongbo”才能够关闭Tomcat. 注意:这个修改不影响shutdown.bat的执行。运行shutdown.bat一样可以关闭服务器。

参考Tomcat安全文档英文链接:http://jakarta.apache.org/tomcat/faq/security.html#8005 还有两个问题需要注意: 1、 对于tomcat3.1中,屏蔽目录文件自动列出的方法是什么? 缺省情况下,如果你访问tomcat下的一个web应用,那么如果你输入的是一个目录名,而且该目录下没有一个可用的welcome文件,那么tomcat会将该目录下的所有文件列出来,如果你想屏蔽这个缺省行为,那么可以修改conf/web.xml文件,将其中的:

default org.apache.catalina.servlets.DefaultServlet

debug

0

listings

true 1修改为:

default org.apache.catalina.servlets.DefaultServlet

debug

0

listings

false 1 默认的shutdown.sh一执行就死机,用网上的代替 http://noroot.info/node/16153

cd /usr/local/apache-tomcat/bin

mv shutdown.sh shutdown.sh.old

vi /usr/local/apache-tomcat/bin/shutdown.sh //创建新的shutdown.sh关闭服务脚本

#!/bin/sh TOMCAT_PID=/bin/netstat -anp|/bin/grep :8080 |/bin/gawk '{print $7}' |/bin/gawk -F [/] '{print $1}' /bin/kill -9 $TOMCAT_PID 2>/dev/null if [ $? -ne 0 ];then echo ‘Tomcat is not running.’ else echo “Succeed to shutdown tomcat.” fi

chmod a+x shutdown.sh //为新建的脚本文件增加执行权限

八 apache整合

可以避免打8080 编辑apache http.conf servername devs.c1gstudio.com

ProxyPass / balancer://cluster/ BalancerMember http://192.168.54.96:8080/

Posted in Lucene, Tomcat, 技术.

Tagged with , , .


One Response

Stay in touch with the conversation, subscribe to the RSS feed for comments on this post.

  1. dresses says

    有没有C#的LUNCENE安装方式,我的VC2008没有这个



Some HTML is OK

or, reply to this post via trackback.