2 Run Hadoop Services - arwankhoiruddin/hadoopLab GitHub Wiki

First we need to create ssh

Create ssh

ssh-keygen

Copy the created SSH

cat /home/hadoop/.ssh/id_rsa.pub >> .ssh/authorized_keys

Test the ssh connection to localhost

ssh localhost

Run Hadoop service

hadoop/sbin/start-all.sh

It will start all the services needed to run Hadoop

WARNING: Attempting to start all Apache Hadoop daemons as arwan_khoiruddin in 10 seconds.
WARNING: This is not a recommended production deployment configuration.
WARNING: Use CTRL-C to abort.
Starting namenodes on [hadoop]
Starting datanodes
Starting secondary namenodes [hadoop]
Starting resourcemanager
resourcemanager is running as process 28858.  Stop it first.
Starting nodemanagers

Stop service

./sbin/stop-all.sh

Edit core-site.xml

nano ./etc/hadoop/core-site.xml

add these lines

<property>
        <name>hadoop.tmp.dir</name>
        <value>/home/your_user_name/hadooptmpdata</value>
        <description>Where Hadoop will place all of its working files</description>
    </property>
    <property>
        <name>fs.defaultFS</name>
        <value>hdfs://localhost:9000</value>
        <description>Where HDFS NameNode can be found on the network</description>
</property>
⚠️ **GitHub.com Fallback** ⚠️