本文介绍了为什么我的Google云计算实例总是意外重启?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

帮助!帮助!救命!

这真的很烦人,我几乎受不了了!我使用的是Google云计算引擎实例,但是它们经常意外地重启,而不会事先发出任何通知.实例的重启似乎是随机发生的,我不知道那里出了什么问题!我很确定在重启时实例已被占用(CPU的使用率> 50%,并且所有GPU都在使用中).谁能告诉我如何解决这个问题?预先感谢!

It is really annoying and I almost cannot bear it anymore! I'm using google cloud compute engine instances but they often unexpectedly restart without any notification in advance. The restart of instances seems to happen randomly and I have no idea what's going wrong there! I'm pretty sure that the instances are been occupied (usage of CPUs > 50% and all GPUs are in use) when restart happens. Could anyone please tell me how to solve this problem? Thanks in advance!

推荐答案

问题就在这里:

如果您查看有关GPU的官方文档:

If you check the official documentation about GPU:

这是因为附加了GPU的实例无法迁移到其他主机上进行维护,因为其余虚拟机都会发生这种情况.为了使物理GPU与实例连接并具有裸机性能,您使用的是GPU passthrough,这令人遗憾地意味着,如果主机必须进行维护,则VM将会随之崩溃.

This is because an instance that has a GPU attached cannot be migrated to another host for maintenance as it happens for the rest of the virtual machines. To get a physical GPU attached to the instance and bare metal performance you are using GPU passthrough , which sadly means if the host has to go through maintenance the VM is going down with it.

这篇关于为什么我的Google云计算实例总是意外重启?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!

09-21 08:04