Enhance documentation on configuration flexibility

- Describe .settings.yml usage for global override customization
- Explain nodes.rb for individual node override configuration
- Provide example files demonstrating override usage
- Include guidance for minimal resource configuration
This commit is contained in:
Kris Lamoureux 2024-08-16 23:53:11 -04:00
parent f1bd37b0d8
commit b1aee01586
Signed by: kris
GPG Key ID: 3EDA9C3441EDA925
3 changed files with 144 additions and 0 deletions

View File

@ -86,3 +86,90 @@ To clean up files placed on the host through Vagrant file sharing:
This command is useful when you want to remove all generated files and return
to a clean state. The Makefile is quite simple, so you can refer to it directly
to see exactly what's being cleaned up.
If you have included override settings that you want to remove as well, run:
git clean -fdx
This command will remove all untracked files and directories, including those
ignored by .gitignore. Be cautious when using this command as it will delete
files that are not tracked by Git. Use the `-n` flag to dry-run first.
## Global Overrides
If you wish to override the default settings on a global level,
you can do so by creating a `.settings.yml` file based on the provided
`example-.settings.yml` file:
cp example-.settings.yml .settings.yml
Once you have copied the `example-.settings.yml` to `.settings.yml`, you can
edit it to override the default settings. Below are the available settings:
### Vagrant Settings Overrides
- `VAGRANT_BOX`
- Default: `debian/bookworm64`
- Tested most around Debian Stable x86_64 (currently Bookworm)
- `VAGRANT_CPU`
- Default: `2`
- Two threads or cores per node, depending on CPU architecture
- `VAGRANT_MEM`
- Default: `2048`
- Two GB of RAM per node
- `SSH_FORWARD`
- Default: `false`
- Enable this if you need to forward SSH agents to the Vagrant machines
### Minimal Resource Setup
Resource-conscious users can copy and use the provided `example-.settings.yml`
file without modifications. This results in a cluster configuration using only
1 vCPU and 1 GB RAM per node (totaling 4 threads/cores and 4 GB RAM), allowing
basic operation on modest hardware.
### Slurm Settings Overrides
- `SLURM_NODES`
- Default: `4`
- The _total_ number of nodes in your Slurm cluster
- `JOIN_TIMEOUT`
- Default: `120`
- Timeout in seconds for nodes to obtain the shared munge.key
## Per-Node Overrides
The naming convention for nodes follows a specific pattern: `nodeX`, where `X`
is a number corresponding to the node's position within the cluster. This
convention is strictly adhered to due to the iteration logic within the
`Vagrantfile`, which utilizes a loop iterating over an array range defined by
the number of slurm nodes (`Array(1..SLURM_NODES)`). Each iteration of the loop
corresponds to a node, and the loop counter is in the node name (`nodeX`).
The overrides, if specified in `nodes.rb`, take the highest precedence,
followed by the overrides in `.settings.yml`, and lastly, the defaults hard
coded in the `Vagrantfile` itself. This hierarchy allows for a flexible
configuration where global overrides can be specified in `.settings.yml`, and
more granular, per-node overrides can be defined in `nodes.rb`. If a particular
setting is not overridden in either `.settings.yml` or `nodes.rb`, the default
value from the `Vagrantfile` is used.
If you wish to override the default settings on a per-node level, you can do so
by creating a `nodes.rb` file based on the provided `example-nodes.rb` file:
cp example-nodes.rb nodes.rb
Once you have copied the `example-nodes.rb` to `nodes.rb`, you can edit it to
override the default settings. Below are the available settings available
per-node:
- `BOX`
- Default: `debian/bookworm64` (or as overridden in `.settings.yml`)
- Vagrant box or image to be used for the node.
- `CPU`
- Default: `2` (or as overridden in `.settings.yml`)
- Defines the number of CPU cores or threads (depending on architecture).
- `MEM`
- Default: `2048` (2 GB) (or as overridden in `.settings.yml`)
- Specifies the amount of memory (in MB) allocated to the node.
- `SSH`
- Default: `false` (or as overridden in `.settings.yml`)
- Enable this if you need to forward SSH agents to the Vagrant machine
All settings are optional, and as many or as few options can be overridden on
any arbitrary node.

16
example-.settings.yml Normal file
View File

@ -0,0 +1,16 @@
########################
### Example settings ###
########################
# This configuration as-is will take 4 threads/cores and 4 GB of RAM total.
# Set per-node overrides in nodes.rb if your setup requires it
# Vagrant default global overrides
#VAGRANT_BOX: debian/bookworm64
VAGRANT_CPU: 1
VAGRANT_MEM: 1024
#SSH_FORWARD: true
# Slurm default overrides
#SLURM_NODES: 4
#JOIN_TIMEOUT: 120

41
example-nodes.rb Normal file
View File

@ -0,0 +1,41 @@
#########################
### Example overrides ###
#########################
# This configuration as-is will take 10 threads/cores and 10 GB of RAM assuming
# that .settings.yml isn't overriding the defaults. Make sure you have enough
# resources before running something like this.
# Set SLURM_NODES in .settings and update the slurm.conf if you run more/less than 4 nodes
# NOTE: The primes.sh script was only designed to run an array across two nodes.
NODES = {
# Head node
'node1' => {
#'BOX' => 'debian/bookworm64',
'CPU' => 1,
'MEM' => 1024,
#'SSH' => true
},
# Submit node
'node2' => {
#'BOX' => 'debian/bookworm64',
'CPU' => 1,
'MEM' => 1024,
#'SSH' => true
},
# Compute node3
'node3' => {
#'BOX' => 'debian/bookworm64',
'CPU' => 4,
'MEM' => 4096,
#'SSH' => true
}
# Compute node4
'node4' => {
#'BOX' => 'debian/bookworm64',
'CPU' => 4,
'MEM' => 4096,
#'SSH' => true
}
}