快速开始

Kubedoop 是一个基于 Kubernetes 的数据平台，它提供了一套完整的数据处理工具，包括数据采集、数据存储、数据处理、数据分析等功能。Kubedoop 通过 Operator 的方式部署在 Kubernetes 集群中，用户可以通过简单的配置文件来部署和管理数据处理任务。

在这篇快速开始中，我们以 Kubedoop 的 hive-operator 为例，介绍如何在 Kubernetes 集群中部署一个 Hive Metastore。

环境准备

要开始使用 datastack ，需要满足一下条件：

datastack 依赖于 Operator Lifecycle Manager (OLM) 来管理 Operator。如果你的集群中没有安装 OLM，你可以通过以下命令安装：

curl -sL https://github.com/operator-framework/operator-lifecycle-manager/releases/download/v0.25.0/install.sh | bash -s v0.25.0

OLM 通过 Subscription 来管理 Operator 的安装。我们可以通过创建一个 Subscription 来安装 hive-metastore-operator：

kubectl apply -f https://raw.githubusercontent.com/kubedoop.dev/kubedatastack/main/examples/hive/olm-subscriptions.yaml

验证 operator 的 pod 是否正常运行：

kubectl get pods -n operator

创建一个 hive 的命名空间，用于部署 hive 集群：

kubectl create ns hive

Hive 集群是通过 hive-operator 来管理的，我们可以通过创建一个 HiveCluster 对象来部署一个 Hive Metastore：

kubectl apply -f https://raw.githubusercontent.com/kubedoop.dev/kubedatastack/main/examples/hive/hive-metastore.yaml

Hive 集群部署完成后，我们可以通过以下命令来访问 Hive Metastore：

kubectl exec -it hive-metastore-0 -n hive -- bash

运行下面命令，清理 hive 集群：

kubectl delete -f https://raw.githubusercontent.com/kubedoop.dev/kubedatastack/main/examples/hive/hive-cluster.yaml

运行下面命令，清理 operator：

kubectl delete -f https://raw.githubusercontent.com/kubedoop.dev/kubedatastack/main/examples/hive/olm-subscriptions.yaml