
connexion hadoop refusée sur le port 9000

Je souhaite configurer un cluster hadoop en mode pseudo-distribué pour le développement. La tentative de démarrage du cluster hadoop échoue en raison d'une connexion refusée sur le port 9000.

Ce sont mes configs (jolie norme):


<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>


<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?>

haddop-env.sh - ici, j'ai changé la configuration en mode IPv4 uniquement (voir dernière ligne):

# Set Hadoop-specific environment variables here.

# The only required environment variable is Java_HOME.  All others are
# optional.  When running a distributed configuration it is best to
# set Java_HOME in this file, so that it is correctly defined on
# remote nodes.

# The Java implementation to use.  Required.
export Java_HOME=/usr/lib/jvm/Java-7-openjdk-AMD64

# Extra Java CLASSPATH elements.  Optional.

# The maximum amount of heap to use, in MB. Default is 1000.
# export HADOOP_HEAPSIZE=2000

# Extra Java runtime options.  Empty by default.
# export HADOOP_OPTS=-server

# Command specific options appended to HADOOP_OPTS when specified
export HADOOP_NAMENODE_OPTS="-Dcom.Sun.management.jmxremote $HADOOP_NAMENODE_OPTS"
export HADOOP_DATANODE_OPTS="-Dcom.Sun.management.jmxremote $HADOOP_DATANODE_OPTS"
export HADOOP_BALANCER_OPTS="-Dcom.Sun.management.jmxremote $HADOOP_BALANCER_OPTS"
export HADOOP_JOBTRACKER_OPTS="-Dcom.Sun.management.jmxremote $HADOOP_JOBTRACKER_OPTS"
# The following applies to multiple commands (fs, dfs, fsck, distcp etc)

# Extra ssh options.  Empty by default.
# export HADOOP_SSH_OPTS="-o ConnectTimeout=1 -o SendEnv=HADOOP_CONF_DIR"

# Where log files are stored.  $HADOOP_HOME/logs by default.

# File naming remote slave hosts.  $HADOOP_HOME/conf/slaves by default.
# export HADOOP_SLAVES=${HADOOP_HOME}/conf/slaves

# Host:path where hadoop code should be rsync'd from.  Unset by default.
# export HADOOP_MASTER=master:/home/$USER/src/hadoop

# Seconds to sleep between slave commands.  Unset by default.  This
# can be useful in large clusters, where, e.g., slave rsyncs can
# otherwise arrive faster than the master can service them.

# The directory where pid files are stored. /tmp by default.
# export HADOOP_PID_DIR=/var/hadoop/pids

# A string representing this instance of hadoop. $USER by default.

# The scheduling priority for daemon processes.  See 'man Nice'.

# Disabling IPv6 for HADOOP
export HADOOP_OPTS=-Djava.net.preferIPv4Stack=true

/etc/hosts:   localhost   zaphod

# The following lines are desirable for IPv6 capable hosts
::1     ip6-localhost ip6-loopback
fe00::0 ip6-localnet
ff00::0 ip6-mcastprefix
ff02::1 ip6-allnodes
ff02::2 ip6-allrouters

Mais au début, après l'appel de ./start-dfs.sh, les lignes suivantes figurent dans les fichiers journaux:


2013-08-19 21:21:59,430 INFO org.Apache.hadoop.hdfs.server.datanode.DataNode: STARTUP_MSG: 
STARTUP_MSG: Starting DataNode
STARTUP_MSG:   Host = zaphod/
STARTUP_MSG:   args = []
STARTUP_MSG:   version =
STARTUP_MSG:   build = git://hrt8n35.cc1.ygridcore.net/ on branch branch-0.20-security-204 -r 65e258bf0813ac2b15bb4c954660eaf9e8fba141; compiled by 'hortonow' on Thu Aug 25 23:25:52 UTC 2011
2013-08-19 21:22:03,950 INFO org.Apache.hadoop.metrics2.impl.MetricsConfig: loaded properties from hadoop-metrics2.properties
2013-08-19 21:22:04,052 INFO org.Apache.hadoop.metrics2.impl.MetricsSourceAdapter: MBean for source MetricsSystem,sub=Stats registered.
2013-08-19 21:22:04,064 INFO org.Apache.hadoop.metrics2.impl.MetricsSystemImpl: Scheduled snapshot period at 10 second(s).
2013-08-19 21:22:04,065 INFO org.Apache.hadoop.metrics2.impl.MetricsSystemImpl: DataNode metrics system started
2013-08-19 21:22:07,054 INFO org.Apache.hadoop.metrics2.impl.MetricsSourceAdapter: MBean for source ugi registered.
2013-08-19 21:22:07,060 WARN org.Apache.hadoop.metrics2.impl.MetricsSystemImpl: Source name ugi already exists!
2013-08-19 21:22:08,709 INFO org.Apache.hadoop.ipc.Client: Retrying connect to server: localhost/ Already tried 0 time(s).
2013-08-19 21:22:09,710 INFO org.Apache.hadoop.ipc.Client: Retrying connect to server: localhost/ Already tried 1 time(s).
2013-08-19 21:22:10,711 INFO org.Apache.hadoop.ipc.Client: Retrying connect to server: localhost/ Already tried 2 time(s).
2013-08-19 21:22:11,712 INFO org.Apache.hadoop.ipc.Client: Retrying connect to server: localhost/ Already tried 3 time(s).
2013-08-19 21:22:12,712 INFO org.Apache.hadoop.ipc.Client: Retrying connect to server: localhost/ Already tried 4 time(s).
2013-08-19 21:22:13,713 INFO org.Apache.hadoop.ipc.Client: Retrying connect to server: localhost/ Already tried 5 time(s).
2013-08-19 21:22:14,714 INFO org.Apache.hadoop.ipc.Client: Retrying connect to server: localhost/ Already tried 6 time(s).
2013-08-19 21:22:15,714 INFO org.Apache.hadoop.ipc.Client: Retrying connect to server: localhost/ Already tried 7 time(s).
2013-08-19 21:22:16,715 INFO org.Apache.hadoop.ipc.Client: Retrying connect to server: localhost/ Already tried 8 time(s).
2013-08-19 21:22:17,716 INFO org.Apache.hadoop.ipc.Client: Retrying connect to server: localhost/ Already tried 9 time(s).
2013-08-19 21:22:17,717 INFO org.Apache.hadoop.ipc.RPC: Server at localhost/ not available yet, Zzzzz...


2013-08-19 21:21:59,443 INFO org.Apache.hadoop.hdfs.server.namenode.NameNode: STARTUP_MSG: 
STARTUP_MSG: Starting NameNode
STARTUP_MSG:   Host = zaphod/
STARTUP_MSG:   args = []
STARTUP_MSG:   version =
STARTUP_MSG:   build = git://hrt8n35.cc1.ygridcore.net/ on branch branch-0.20-security-204 -r 65e258bf0813ac2b15bb4c954660eaf9e8fba141; compiled by 'hortonow' on Thu Aug 25 23:25:52 UTC 2011
2013-08-19 21:22:03,950 INFO org.Apache.hadoop.metrics2.impl.MetricsConfig: loaded properties from hadoop-metrics2.properties
2013-08-19 21:22:04,052 INFO org.Apache.hadoop.metrics2.impl.MetricsSourceAdapter: MBean for source MetricsSystem,sub=Stats registered.
2013-08-19 21:22:04,064 INFO org.Apache.hadoop.metrics2.impl.MetricsSystemImpl: Scheduled snapshot period at 10 second(s).
2013-08-19 21:22:04,064 INFO org.Apache.hadoop.metrics2.impl.MetricsSystemImpl: NameNode metrics system started
2013-08-19 21:22:06,050 INFO org.Apache.hadoop.metrics2.impl.MetricsSourceAdapter: MBean for source ugi registered.
2013-08-19 21:22:06,056 WARN org.Apache.hadoop.metrics2.impl.MetricsSystemImpl: Source name ugi already exists!
2013-08-19 21:22:06,095 INFO org.Apache.hadoop.metrics2.impl.MetricsSourceAdapter: MBean for source jvm registered.
2013-08-19 21:22:06,097 INFO org.Apache.hadoop.metrics2.impl.MetricsSourceAdapter: MBean for source NameNode registered.
2013-08-19 21:22:06,232 INFO org.Apache.hadoop.hdfs.util.GSet: VM type       = 64-bit
2013-08-19 21:22:06,234 INFO org.Apache.hadoop.hdfs.util.GSet: 2% max memory = 17.77875 MB
2013-08-19 21:22:06,235 INFO org.Apache.hadoop.hdfs.util.GSet: capacity      = 2^21 = 2097152 entries
2013-08-19 21:22:06,235 INFO org.Apache.hadoop.hdfs.util.GSet: recommended=2097152, actual=2097152
2013-08-19 21:22:06,748 INFO org.Apache.hadoop.hdfs.server.namenode.FSNamesystem: fsOwner=pschmidt
2013-08-19 21:22:06,748 INFO org.Apache.hadoop.hdfs.server.namenode.FSNamesystem: supergroup=hadoop
2013-08-19 21:22:06,748 INFO org.Apache.hadoop.hdfs.server.namenode.FSNamesystem: isPermissionEnabled=true
2013-08-19 21:22:06,754 INFO org.Apache.hadoop.hdfs.server.namenode.FSNamesystem: dfs.block.invalidate.limit=100
2013-08-19 21:22:06,768 INFO org.Apache.hadoop.hdfs.server.namenode.FSNamesystem: isAccessTokenEnabled=false accessKeyUpdateInterval=0 min(s), accessTokenLifetime=0 min(s)
2013-08-19 21:22:07,262 INFO org.Apache.hadoop.hdfs.server.namenode.FSNamesystem: Registered FSNamesystemStateMBean and NameNodeMXBean
2013-08-19 21:22:07,322 INFO org.Apache.hadoop.hdfs.server.namenode.NameNode: Caching file names occuring more than 10 times 
2013-08-19 21:22:07,326 INFO org.Apache.hadoop.hdfs.server.common.Storage: Storage directory /home/pschmidt/hacking/hadoop- does not exist.
2013-08-19 21:22:07,329 ERROR org.Apache.hadoop.hdfs.server.namenode.FSNamesystem: FSNamesystem initialization failed.
org.Apache.hadoop.hdfs.server.common.InconsistentFSStateException: Directory /home/pschmidt/hacking/hadoop- is in an inconsistent state: storage directory does not exist or is not accessible.
    at org.Apache.hadoop.hdfs.server.namenode.FSImage.recoverTransitionRead(FSImage.Java:291)
    at org.Apache.hadoop.hdfs.server.namenode.FSDirectory.loadFSImage(FSDirectory.Java:97)
    at org.Apache.hadoop.hdfs.server.namenode.FSNamesystem.initialize(FSNamesystem.Java:379)
    at org.Apache.hadoop.hdfs.server.namenode.FSNamesystem.<init>(FSNamesystem.Java:353)
    at org.Apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.Java:254)
    at org.Apache.hadoop.hdfs.server.namenode.NameNode.<init>(NameNode.Java:434)
    at org.Apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.Java:1153)
    at org.Apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.Java:1162)
2013-08-19 21:22:07,331 ERROR org.Apache.hadoop.hdfs.server.namenode.NameNode: org.Apache.hadoop.hdfs.server.common.InconsistentFSStateException: Directory /home/pschmidt/hacking/hadoop- is in an inconsistent state: storage directory does not exist or is not accessible.
    at org.Apache.hadoop.hdfs.server.namenode.FSImage.recoverTransitionRead(FSImage.Java:291)
    at org.Apache.hadoop.hdfs.server.namenode.FSDirectory.loadFSImage(FSDirectory.Java:97)
    at org.Apache.hadoop.hdfs.server.namenode.FSNamesystem.initialize(FSNamesystem.Java:379)
    at org.Apache.hadoop.hdfs.server.namenode.FSNamesystem.<init>(FSNamesystem.Java:353)
    at org.Apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.Java:254)
    at org.Apache.hadoop.hdfs.server.namenode.NameNode.<init>(NameNode.Java:434)
    at org.Apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.Java:1153)
    at org.Apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.Java:1162)

2013-08-19 21:22:07,332 INFO org.Apache.hadoop.hdfs.server.namenode.NameNode: SHUTDOWN_MSG: 
SHUTDOWN_MSG: Shutting down NameNode at zaphod/

Après le reformatage du fichier hdfs, la sortie suivante est affichée:

13/08/19 21:50:21 INFO namenode.NameNode: STARTUP_MSG: 
STARTUP_MSG: Starting NameNode
STARTUP_MSG:   Host = zaphod/
STARTUP_MSG:   args = [-format]
STARTUP_MSG:   version =
STARTUP_MSG:   build = git://hrt8n35.cc1.ygridcore.net/ on branch branch-0.20-security-204 -r 65e258bf0813ac2b15bb4c954660eaf9e8fba141; compiled by 'hortonow' on Thu Aug 25 23:25:52 UTC 2011
Re-format filesystem in ~/hacking/hd-data/nn ? (Y or N) Y
13/08/19 21:50:26 INFO util.GSet: VM type       = 64-bit
13/08/19 21:50:26 INFO util.GSet: 2% max memory = 17.77875 MB
13/08/19 21:50:26 INFO util.GSet: capacity      = 2^21 = 2097152 entries
13/08/19 21:50:26 INFO util.GSet: recommended=2097152, actual=2097152
13/08/19 21:50:27 INFO namenode.FSNamesystem: fsOwner=pschmidt
13/08/19 21:50:27 INFO namenode.FSNamesystem: supergroup=hadoop
13/08/19 21:50:27 INFO namenode.FSNamesystem: isPermissionEnabled=true
13/08/19 21:50:27 INFO namenode.FSNamesystem: dfs.block.invalidate.limit=100
13/08/19 21:50:27 INFO namenode.FSNamesystem: isAccessTokenEnabled=false accessKeyUpdateInterval=0 min(s), accessTokenLifetime=0 min(s)
13/08/19 21:50:27 INFO namenode.NameNode: Caching file names occuring more than 10 times 
13/08/19 21:50:27 INFO common.Storage: Image file of size 110 saved in 0 seconds.
13/08/19 21:50:28 INFO common.Storage: Storage directory ~/hacking/hd-data/nn has been successfully formatted.
13/08/19 21:50:28 INFO namenode.NameNode: SHUTDOWN_MSG: 
SHUTDOWN_MSG: Shutting down NameNode at zaphod/

Utiliser netstat -lpten | grep Java:

tcp        0      0 *               LISTEN      1000       50995       9875/Java       
tcp        0      0 *               LISTEN      1000       51775       9639/Java       
tcp6       0      0 :::2181                 :::*                    LISTEN      1000       20841       2659/Java       
tcp6       0      0 :::36743                :::*                    LISTEN      1000       20524       2659/Java 

L'utilisation de netstat -lpten | grep 9000 ne renvoie rien, en supposant qu'il n'y a pas d'application liée à ce port désigné.

Que puis-je rechercher pour que mes fichiers hdfs soient opérationnels? N'hésitez pas à demander d'autres journaux et fichiers de configuration.

Merci d'avance.

dyster fyr

Utilisez le chemin absolu pour cela et assurez-vous que l'utilisateur hadoop dispose des autorisations nécessaires pour accéder à ce répertoire: -


assurez-vous également de formater ce chemin comme

# hadoop namenode -format

La réponse simple est que votre service de noeud n'est pas en cours d'exécution. simplement faire 


Essayez la commande.

telnet localhost 9000 

Si la commande telnet retourne la connexion refusée, alors Start hadoop by:


Ensuite, en exécutant de nouveau telnet, vous pouvez voir que la connexion est établie.
