Starting an interactive Spark Cluster
- This example demonstrates how to create, and connect to a Practicus AI Spark cluster, and execute simple Spark operations.
Note on shared drives
- Practicus AI distributed clusters require a shared drive accessible by multiple workers, such as Practicus AI
~/my
or~/shared
folders. - If you do not have access to ~/my or ~/shared folders, please check the auto-scaled examples which does not need such drives, but are limited in functionality.
import practicuscore as prt
# Let's define the distributed features
distributed_config = prt.distributed.JobConfig(
job_type = prt.distributed.JobType.spark,
worker_count = 2,
)
# Let's define worker features of the cluster
worker_config = prt.WorkerConfig(
worker_size="X-Small",
distributed_config=distributed_config,
)
# Creating the coordinator (master) worker
# will also create the cluster.
coordinator_worker = prt.create_worker(
worker_config=worker_config,
)
# Since this is an interactive Spark cluster,
# let's login to execute some code.
notebook_url = coordinator_worker.open_notebook()
print("Page did not open? You can open this url manually:", notebook_url)
Please continue experimenting on the new browser tab
by opening the next notebook in this directory
# Done experimenting? Let's terminate the coordinator
# which will also terminate the cluster.
coordinator_worker.terminate()
Previous: Introduction | Next: Use Cluster