你卡桑德拉集群无法部署。 副本状态变为PERMANENTLY_FAILING。 副本是不健康的

2019-10-21 08:36发布

我尝试使用谷歌计算引擎,没有成功部署卡桑德拉集群。 我试了好几次,错误总是相同的:

module: DEPLOYMENT_FAILED
Replica module-1234 failed with status PERMANENTLY_FAILING: Replica State
changed to PERMANENTLY_FAILING. Replica was unhealthy 2 consecutive times.

下面这个简短的故障排除指南后: https://cloud.google.com/solutions/cassandra/click-to-deploy#troubleshooting ,日志如下:

antoniogallo88_gmail_com@cassandra-coord-v8ip:/gagent/metaOutput$ tail $(ls -1tr /gagent/metaOutput/stderr.*.txt | 
tail -n 1)
Still waiting for resourceview cassandranode-4da4e to have 3 members ...
Still waiting for resourceview cassandranode-4da4e to have 3 members ...
Still waiting for resourceview cassandranode-4da4e to have 3 members ...
Still waiting for resourceview cassandranode-4da4e to have 3 members ...
Still waiting for resourceview cassandranode-4da4e to have 3 members ...
Still waiting for resourceview cassandranode-4da4e to have 3 members ...
Still waiting for resourceview cassandranode-4da4e to have 3 members ...
Still waiting for resourceview cassandranode-4da4e to have 3 members ...
Still waiting for resourceview cassandranode-4da4e to have 3 members ...
[ERROR] resourceview cassandranode-4da4e does not have 3 members after 60 attempts.

你有任何想法如何解决这一问题?

谢谢。

安东尼奥

Answer 1:

你可以检查实例类型您选择(在内核#)和集群成员的数量超过您使用的项目CPU配额? 还需要检查磁盘容量值与你的整体磁盘配额。

您可以检查最大允许磁盘和CPU配额下的Compute Engine>配额控制台。

这听起来像是即使控制台不堆焊配额错误配额问题。

你可以做的另一件事就是创建另一个部署,然后迅速切换到该实例列表页面查找名为“卡桑德拉 - 坐标-foo”的一个实例是管理磁盘创建一个短命的实例。 如果您在部署过程中ssh到该节点并运行以下命令,您可能会看到一个磁盘或CPU配额警告:

tail -f /gagent/metaOutput/*

克里斯



文章来源: Your Cassandra cluster failed to deploy. Replica State changed to PERMANENTLY_FAILING. Replica was unhealthy 2 consecutive times