Nautilus Job Launcher

This Nautilus Job Launcher is a Python library that enables the automation of launching jobs on the NRP Nautilus HyperCluster.

Installation

To install the Nautilus Job Launcher, you can use pip:

pip3 install git+https://github.com/MUAMLL/nautilus-job-launcher.git

Alternatively, you can clone this repository and use pip to install it:

git clone https://github.com/MUAMLL/nautilus-job-launcher.git
pip3 install nautilus-job-launcher

Usage

Note: You must have your Kubernetes config file in ~/.kube/config to use this library!

The Nautilus Launcher can be used as an application at the command line that will kick off jobs from a YAML config file, or it can be utilized as a library integrated into other Python applications.

Details of these use cases are described below.

Command Line Usage

The job launcher is invoked as a library and uses a configuration file (YAML):

python3 -m nautiluslauncher -c cfg.yaml

You can choose to perform a dryrun by passing a --dryrun flag:

python3 -m nautiluslauncher -c cfg.py --dryrun

Running a Subset of Jobs

You may also choose to pass in a j/--jobs key with a list of the job names you'd like to run. This is helpful for placing a large number of jobs in a single YAML file, passing that same file to the Nautilus launcher, and then running only a single or subset of those jobs at a time:

python3 -m nautiluslauncher -c cfg.py -j my-job-1

Using a Base Config

You may also choose to place all of your defaults into a base configuration file, and then place your jobs into a separate YAML file.

In this usage configuration, all keys except for the jobs key are placed in the base configuration file while all jobs go into their own file under the jobs key. This allows for more ease in templating jobs:

python3 -m nautiluslauncher -c cfg.py -b base.py

This can also be combined with the -j flag for more flexibility:

python3 -m nautiluslauncher -c cfg.py -b base.py -j my-job-1

Configuration

Configuration is done via a YAML file. There are sample YAML configs in the configs directory of this repository.

Required Keys

There are two required keys and one optional key. The two required keys are:

namespace
jobs

The namespace is the namespace on the Nautilus cluster you'd like to use. The jobs key is a list of dictionaries that define all of the parameters for each job. However, this process is made easier using the third optional key: defaults.

Describing a Job

Key	Description	Default	Type
job_name	The name of the job	required	str
image	The container image to use	required	str
command	The command to run when the job starts	required	str/list[str]
workingDir	Working directory when the job starts	None	str
env	The environment variables	None	dict[str, str]
volumes	The volumes to mount	None	dict[str, str]
ports	The container ports to expose	None	list[int]
tolerations	The tolerations of the pod	None	list[str]
region	The region the job should run in	None	str
hostname	The hostname of the node should run on	None	str
min_cpu	Minimum # of CPU Cores	2	int
max_cpu	Max # of CPU cores	4	int
min_ram	Min GB of RAM	4	int
max_ram	Max GB of RAM	8	int
gpu	# of GPUs	0	int
gpu_type	The type of GPUs required	None	str
shm	When true, add shared memory mount	false	bool

Using Defaults

The defaults key is a starting place for all jobs in your config. All jobs will use the defaults as the beginning configuration and then whatever is placed in each job will be added to or override what is present in the defaults key. Note that when a key is present in both defaults and the job, the job will take precedence.

Here is a simple example:

defaults:
    container: python:3.8
    workingDir: /mydir

jobs:
-
    container: python:3.7
-
    workingDir: /mydir2
-
    container: python:3.7
    workingDir: /mydir2

Library Usage

If you would like to integrate launching jobs with your application/library, you can choose to import them into your scripts instead:

from nautiluslauncher import Job, NautilusAutomationClient

client = NautilusAutomationClient("mynamespace")
images = ["python:3.6", "python:3.7", "python:3.8"]
for i, img in enumerate(images):
    j = Job(job_name=f"test_python_{i}", image=i, command=["python", "-c", "print('hello world')"])
    client.create_job(j)

If you'd rather utilize a dictionary to configure your jobs and integrate that into your application:

from nautiluslauncher import NautilusJobLauncher

my_jobs = {
    "namespace": "mynamespace",
    "jobs": [
        {"image": "python:3.6", command: ["python", "-c", "print('hello world')"], "job_name": "myjob1"}
        {"image": "python:3.7", command: ["python", "-c", "print('hello world')"], "job_name": "myjob2"}
        {"image": "python:3.8", command: ["python", "-c", "print('hello world')"], "job_name": "myjob3"}
    ]
}

launcher = NautilusJobLauncher(my_jobs)
launcher.run()

Or from a YAML file:

from nautiluslauncher import NautilusJobLauncher

my_file = "myCfg.yaml"

launcher = NautilusJobLauncher.from_config(my_file)
launcher.run()

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
configs		configs
nautiluslauncher		nautiluslauncher
notebooks		notebooks
.gitignore		.gitignore
README.md		README.md
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Nautilus Job Launcher

Installation

Usage

Command Line Usage

Running a Subset of Jobs

Using a Base Config

Configuration

Required Keys

Describing a Job

Using Defaults

Library Usage

About

Uh oh!

Releases

Packages

Uh oh!

Languages

MUAMLL/Nautilus-Job-Launcher

Folders and files

Latest commit

History

Repository files navigation

Nautilus Job Launcher

Installation

Usage

Command Line Usage

Running a Subset of Jobs

Using a Base Config

Configuration

Required Keys

Describing a Job

Using Defaults

Library Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages