K8s User Guide


1. Pull hyper-zoo Docker Image

You may pull the prebuilt Analytics Zoo hyper-zoo Image from Docker Hub as follows:

sudo docker pull intelanalytics/hyper-zoo:latest

Speed up pulling image by adding mirrors

To speed up pulling the image from DockerHub, you may add the registry-mirrors key and value by editing daemon.json (located in /etc/docker/ folder on Linux):

{
  "registry-mirrors": ["https://<my-docker-mirror-host>"]
}

For instance, users in China may add the USTC mirror as follows:

{
  "registry-mirrors": ["https://docker.mirrors.ustc.edu.cn"]
}

After that, flush changes and restart docker:

sudo systemctl daemon-reload
sudo systemctl restart docker

2. Launch a K8s Client Container

3. Run Analytics Zoo Examples on k8s

Note: Please make sure kubectl has appropriate permission to create, list and delete pod.

3.1 Use init_orca_context

We recommend using init_orca_context in your code to run on standard K8s clusters. <TODO: add detailed descriptions>

3.2 Use spark_submit

Alternatively, you may use spark_submit to run your program on K8s clusters.

Run Python programs

Run Jupyter Notebooks

Run Scala programs

3.3 Access logs and clear pods