当前位置:首页 > Windows程序 > 正文

教你如何在win7中的cygwin64下安装hadoop

2021-05-24 Windows程序

首先我们要准备如下环境及软件:

win7(64位) cygwin 1.7.9-1 jdk-6u25-windows-x64.zip hadoop-0.20.2.tar.gz

1.在win7系统上正常安装jdk,同时注意设置好java环境的变量:

主要的变量包括:JAVA_HOME,PATH,CLASSPATH

 (不会设置的请自备梯子)

2.接下来是安装Hadoop,我目前安装的版本为0.20.2版本,为了方便,

我暂时是直接放到了cygwin64的/home目录下(正常情况下,请放在/usr目录下),

并使用tar命令进行解压操作。

lenovo@lenovo-PC /home $ tar -zxvf hadoop-0.20.2.tar.gz

3.光安装完Hadoop是不够的,,还需要一些简单的配置工作,主要的配置文件有4个,

它们位于Hadoop的安装目录的conf子目录下,分别是:

hadoop-env.sh core-site.xml hdfs-site.xml mapred-site.xml

下面将是如何修改的详细部分:

(1) 修改hadoop-env.sh文件:

这步比较简单,只需要将JAVA_HOME 修改成JDK 的安装目录即可:

红色标出的是修改后的样子

# Set Hadoop-specific environment variables here. # The only required environment variable is JAVA_HOME. All others are # optional. When running a distributed configuration it is best to # set JAVA_HOME in this file, so that it is correctly defined on # remote nodes. # The java implementation to use. Required. export JAVA_HOME=http://www.mamicode.com/cygdrive/d/android/java/jdk1.7.0_15 # Extra Java CLASSPATH elements. Optional. # export HADOOP_CLASSPATH=http://www.mamicode.com/ # The maximum amount of heap to use, in MB. Default is 1000. # export HADOOP_HEAPSIZE=http://www.mamicode.com/2000 # Extra Java runtime options. Empty by default. # export HADOOP_OPTS=-server # Command specific options appended to HADOOP_OPTS when specified export HADOOP_NAMENODE_OPTS=http://www.mamicode.com/"-Dcom.sun.management.jmxremote $HADOOP_NAMENODE_OPTS" export HADOOP_SECONDARYNAMENODE_OPTS=http://www.mamicode.com/"-Dcom.sun.management.jmxremote $HADOOP_SECONDARYNAMENODE_OPTS" export HADOOP_DATANODE_OPTS=http://www.mamicode.com/"-Dcom.sun.management.jmxremote $HADOOP_DATANODE_OPTS" export HADOOP_BALANCER_OPTS=http://www.mamicode.com/"-Dcom.sun.management.jmxremote $HADOOP_BALANCER_OPTS" export HADOOP_JOBTRACKER_OPTS=http://www.mamicode.com/"-Dcom.sun.management.jmxremote $HADOOP_JOBTRACKER_OPTS" # export HADOOP_TASKTRACKER_OPTS=http://www.mamicode.com/ # The following applies to multiple commands (fs, dfs, fsck, distcp etc) # export HADOOP_CLIENT_OPTS # Extra ssh options. Empty by default. # export HADOOP_SSH_OPTS=http://www.mamicode.com/"-o ConnectTimeout=1 -o SendEnv=HADOOP_CONF_DIR" # Where log files are stored. $HADOOP_HOME/logs by default. # export HADOOP_LOG_DIR=${HADOOP_HOME}/logs # File naming remote slave hosts. $HADOOP_HOME/conf/slaves by default. # export HADOOP_SLAVES=${HADOOP_HOME}/conf/slaves # host:path where hadoop code should be rsyncd from. Unset by default. # export HADOOP_MASTER=master:/home/$USER/src/hadoop # Seconds to sleep between slave commands. Unset by default. This # can be useful in large clusters, where, e.g., slave rsyncs can # otherwise arrive faster than the master can service them. # export HADOOP_SLAVE_SLEEP=http://www.mamicode.com/0.1 # The directory where pid files are stored. /tmp by default. # export HADOOP_PID_DIR=http://www.mamicode.com/var/hadoop/pids # A string representing this instance of hadoop. $USER by default. # export HADOOP_IDENT_STRING=http://www.mamicode.com/$USER # The scheduling priority for daemon processes. See man nice. # export HADOOP_NICENESS=http://www.mamicode.com/10

(注意:这里的路径不能是windows 风格的目录d:\java\jdk1.7.0_15,而是LINUX 风格/cygdrive/d/java/jdk1.7.0_15)

(2) 修改core-site.xml:

红色标出的是增加的代码。

温馨提示: 本文由Jm博客推荐,转载请保留链接: https://www.jmwww.net/file/70032.html