hadoop ha 集群遇到的问题

发布于:2025-06-20 ⋅ 阅读:(18) ⋅ 点赞:(0)

192.168.64.142:8485: Call From hadoop-001/192.168.64.141 to hadoop-002:8485 failed on connection exception: java.net.ConnectException: Connection refused; For more details see: http://wiki.apache.org/hadoop/ConnectionRefused at org.apache.hadoop.hdfs.qjournal.client.QuorumException.create(QuorumException.java:81) at org.apache.hadoop.hdfs.qjournal.client.QuorumCall.rethrowException(QuorumCall.java:305) at org.apache.hadoop.hdfs.qjournal.client.QuorumJournalManager.hasSomeData(QuorumJournalManager.java:282) at org.apache.hadoop.hdfs.server.common.Storage.confirmFormat(Storage.java:1185) at org.apache.hadoop.hdfs.server.namenode.FSImage.confirmFormat(FSImage.java:212) at org.apache.hadoop.hdfs.server.namenode.NameNode.format(NameNode.java:1274) at org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:1726) at org.apache.hadoop.hdfs.server.namenode.NameNode.main(NameNode.java:1834) 2025-06-19 15:18:55,556 INFO util.ExitUtil: Exiting with status 1: org.apache.hadoop.hdfs.qjournal.client.QuorumException: Unable to check if JNs are ready for formatting. 2 successful responses: 192.168.64.141:8485: false 192.168.64.143:8485: false

连接不上一般也有可能是免密登录没有配置好
cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys
chmod 700 ~/.ssh
chmod 600 ~/.ssh/authorized_keys
ssh-copy-id hadoop-001
ssh-copy-id hadoop-002
ssh-copy-id hadoop-003

1. JournalNode 服务未启动

确保 hadoop-002 上的 JournalNode 已经正确启动。 格式化之前要启动namenode

# 检查 JournalNode 是否运行
jps

# 如果没有 QuorumPeerMain 或 JournalNode,则启动它:
hadoop-daemon.sh start journalnode

2. namenode 启动了,之后还是连接不上,还不是网络防火墙问题 默认namenode  datanode, journal 都在tmp 但是tmp 存在权限问题,配置的和建的目录必须一样,并且目录中必须为空

mkdir /var/lib/hadoop/journal/
mkdir /var/lib/hadoop/datanode
mkdir /var/lib/hadoop/namenode
chown -R hadoop:hadoop  /var/lib/hadoop/journal
chown -R hadoop:hadoop  /var/lib/hadoop/datanode
chown -R hadoop:hadoop  /var/lib/hadoop/namenode
chmod 755 /var/lib/hadoop/datanode
chmod 755 /var/lib/hadoop/namenode
chmod 755 /var/lib/hadoop/journal
<!-- JournalNode 数据目录 -->
    <property>
        <name>dfs.journalnode.edits.dir</name>
        <value>/var/lib/hadoop/journal</value>
    </property>

    <!-- DataNode 数据目录 -->
    <property>
        <name>dfs.datanode.data.dir</name>
        <value>/var/lib/hadoop/datanode</value>
    </property>

    <!-- HA 故障转移配置 -->
    <property>
        <name>dfs.ha.automatic-failover.enabled</name>
        <value>true</value>
    </property>

    <!-- Fencing 方法(生产环境建议用 sshfence) -->

网站公告

今日签到

点亮在社区的每一天
去签到