Fix munge key race condition and update docs
- Add sleep to prevent munge.key race condition - Warn about CPU override conflicts in README - Update README with CPU config details for minimal setup - Correct `less` command examples for prime number output - Force update slurm/cgroups configs in the provision script
This commit is contained in:
		
							
								
								
									
										11
									
								
								README.md
									
									
									
									
									
								
							
							
						
						
									
										11
									
								
								README.md
									
									
									
									
									
								
							| @@ -63,7 +63,7 @@ By default, each node is allocated: | ||||
| 4. View the resulting prime numbers found, check `ls` for exact filenames | ||||
|  | ||||
|        less slurm-1_0.out | ||||
|        less slurm-2_1.out | ||||
|        less slurm-1_1.out | ||||
|  | ||||
| ### Configuration Tool | ||||
|  | ||||
| @@ -96,6 +96,10 @@ ignored by .gitignore. Be cautious when using this command as it will delete | ||||
| files that are not tracked by Git. Use the `-n` flag to dry-run first. | ||||
|  | ||||
| ## Global Overrides | ||||
|  | ||||
| **WARNING:** Always update `slurm.conf` to match any CPU overrides to prevent | ||||
| resource allocation conflicts. | ||||
|  | ||||
| If you wish to override the default settings on a global level, | ||||
| you can do so by creating a `.settings.yml` file based on the provided | ||||
| `example-.settings.yml` file: | ||||
| @@ -125,6 +129,11 @@ file without modifications. This results in a cluster configuration using only | ||||
| 1 vCPU and 1 GB RAM per node (totaling 4 threads/cores and 4 GB RAM), allowing | ||||
| basic operation on modest hardware. | ||||
|  | ||||
| When using this minimal setup with 1 vCPU, you'll need to update the `slurm.conf` file. | ||||
| Apply the following change to the default `slurm.conf`: | ||||
|  | ||||
| 	sed -i 's/CPUs=2/CPUs=1/g' slurm.conf | ||||
|  | ||||
| ### Slurm Settings Overrides | ||||
| - `SLURM_NODES` | ||||
|   - Default: `4` | ||||
|   | ||||
							
								
								
									
										11
									
								
								provision.sh
									
									
									
									
									
								
							
							
						
						
									
										11
									
								
								provision.sh
									
									
									
									
									
								
							| @@ -46,12 +46,12 @@ fi | ||||
| dpkg -s slurm-client &>/dev/null || apt-get install -y slurm-client | ||||
|  | ||||
| # Create directories for Slurm | ||||
| mkdir -p /var/spool/slurm /var/log/slurm /etc/slurm | ||||
| chown slurm:slurm /var/spool/slurm /var/log/slurm /etc/slurm | ||||
| mkdir -p /var/spool/slurm /etc/slurm | ||||
| chown slurm:slurm /var/spool/slurm /etc/slurm | ||||
|  | ||||
| # Copy slurm.conf and cgroup.conf | ||||
| cp -u /vagrant/slurm.conf /etc/slurm/slurm.conf | ||||
| cp -u /vagrant/cgroup.conf /etc/slurm/cgroup.conf | ||||
| cp -f /vagrant/slurm.conf /etc/slurm/slurm.conf | ||||
| cp -f /vagrant/cgroup.conf /etc/slurm/cgroup.conf | ||||
| chown slurm:slurm /etc/slurm/slurm.conf /etc/slurm/cgroup.conf | ||||
| chmod 644 /etc/slurm/slurm.conf /etc/slurm/cgroup.conf | ||||
|  | ||||
| @@ -107,7 +107,8 @@ else | ||||
|     sleep 10 | ||||
|   done | ||||
|  | ||||
|   # Enable/start/test munge service | ||||
|   # Enable/start munge service | ||||
|   sleep 3 | ||||
|   cp -f /vagrant/munge.key /etc/munge/munge.key | ||||
|   chown munge:munge /etc/munge/munge.key | ||||
|   chmod 400 /etc/munge/munge.key | ||||
|   | ||||
		Reference in New Issue
	
	Block a user