本文介绍了Google Dataproc是否支持Apache Impala?的处理方法,对大家解决问题具有一定的参考价值,需要的朋友们下面随着小编来一起学习吧!

问题描述

我不熟悉使用云服务,并且浏览Google的Cloud Platform非常令人生畏.当涉及到Google Dataproc时,他们会做广告Hadoop,Spark和Hive.

I am new to using cloud services and navigating Google's Cloud Platform is quite intimidating. When it comes to Google Dataproc, they do advertise Hadoop, Spark and Hive.

我的问题是,Impala完全可用吗?

My question is, is Impala available at all?

我想使用所有这四个工具来进行一些基准测试项目,并且我需要Apache Impala和Spark/Hive一起使用.

I would like to do some benchmarking projects using all four of these tools and I require Apache Impala along side Spark/Hive.

推荐答案

您也可以尝试使用Dataproc的另一个新实例,而不是使用默认实例.

You can try also using another new instance of Dataproc, instead of using the default.

例如,您可以使用 HUE(Hadoop用户体验)创建一个Dataproc实例,该实例是处理Cloudera构建的Hadoop集群的接口.这样做的好处是,HUE具有默认组件Apache Impala.它还具有Pig,Hive等.因此,这是使用Impala的一个很好的解决方案.

For example, you can create a Dataproc instance with HUE (Hadoop User Experience) which is an interface to handle Hadoop cluster built by Cloudera. The advantage here is that HUE has as a default component Apache Impala. It also has Pig, Hive, etc. So it's a pretty good solution for using Impala.

另一种解决方案是从头开始创建自己的集群,但这不是一个好主意(至少您要自定义所有内容).通过这种方式,您可以安装Impala.

Another solution will be to create your own cluster by the beginning but is not a good idea (at least you want to customize everything). With this way, you can install Impala.

这里是一个链接,有关更多信息:

Here is a link, for more information:

https://github.com/GoogleCloudPlatform/dataproc-initialization-actions/tree/master/hue

这篇关于Google Dataproc是否支持Apache Impala?的文章就介绍到这了,希望我们推荐的答案对大家有所帮助,也希望大家多多支持!

06-11 16:19