You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Improved the documentation to make it clearer and more readable. Mostly,
I added a bunch of commas because, in general, commas make the world a
better, safer place. Isn't that right, Grandma?
Copy file name to clipboardexpand all lines: background/ncn_boot_workflow.md
+31-2
Original file line number
Diff line number
Diff line change
@@ -29,15 +29,44 @@ PXE. The method to use will vary depending on the system environment.
29
29
ncn# cat /proc/cmdline
30
30
```
31
31
32
-
If it starts with `kernel` then the node network booted. If it starts with `BOOT_IMAGE=(` then it disk booted.
32
+
If it starts with `kernel`, then the node network booted. If it starts with `BOOT_IMAGE=(`, then it disk booted.
33
33
34
34
1. Check output from `efibootmgr`.
35
35
36
36
```bash
37
37
ncn# efibootmgr
38
38
```
39
39
40
-
The `BootCurrent` value should be matched to the list beneath to see if it lines up with a networking option or a `cray sd*)` option for disk boots.
40
+
The `BootCurrent` value should be matched to the list beneath it to see if it lines up with a networking option or a `cray sd*)` option for disk boots.
Copy file name to clipboardexpand all lines: install/bootstrap_livecd_remote_iso.md
+15-15
Original file line number
Diff line number
Diff line change
@@ -30,8 +30,8 @@ The LiveCD Remote ISO has known compatibility issues for nodes from certain vend
30
30
### 2. Attaching and Booting the LiveCD with the BMC
31
31
32
32
> **Warning:** If this is a re-installation on a system that still has a USB device from a prior
33
-
> installation then that USB device must be wiped before continuing. Failing to wipe the USB, if present, may result in confusion.
34
-
> If the USB is booted still then it can wipe itself using the [basic wipe from Wipe NCN Disks for Reinstallation](wipe_ncn_disks_for_reinstallation.md#basic-wipe). If it is not booted, please do so and wipe it _or_ disable the USB ports in the BIOS (not available for all vendors).
33
+
> installation, then that USB device must be wiped before continuing. Failing to wipe the USB, if present, may result in confusion.
34
+
> If the USB is still booted, then it can wipe itself using the [basic wipe from Wipe NCN Disks for Reinstallation](wipe_ncn_disks_for_reinstallation.md#basic-wipe). If it is not booted, please do so and wipe it _or_ disable the USB ports in the BIOS (not available for all vendors).
35
35
36
36
Obtain and attach the LiveCD cray-pre-install-toolkit ISO file to the BMC. Depending on the vendor of the node,
37
37
the instructions for attaching to the BMC will differ.
@@ -109,7 +109,7 @@ On first login (over SSH or at local console) the LiveCD will prompt the adminis
109
109
1. Setup Variables.
110
110
111
111
```bash
112
-
# The IPv4 Address for the nodes external interface(s); this will be provided if not already by the site's network administrator or network authority.
112
+
# The IPv4 Address for the nodes external interface(s); this will be provided, if not already by the site's network administrator or network authority.
113
113
pit# site_ip=172.30.XXX.YYY/20
114
114
pit# site_gw=172.30.48.1
115
115
pit# site_dns=172.30.84.40
@@ -126,7 +126,7 @@ On first login (over SSH or at local console) the LiveCD will prompt the adminis
1. (recommended) print `lan0`, and if it has an IP address thenexit console and log in again using SSH. The
129
+
1. (recommended) print `lan0`, and if it has an IP address,thenexit console and log in again using SSH. The
130
130
SSH connection will provide larger window sizes and better bufferhandling (screen wrapping).
131
131
132
132
```bash
@@ -237,7 +237,7 @@ On first login (over SSH or at local console) the LiveCD will prompt the adminis
237
237
238
238
1. Download and install/upgrade the workaround and documentation RPMs.
239
239
240
-
If this machine does not have direct Internet access these RPMs will need to be externally downloaded and then copied to the system.
240
+
If this machine does not have direct Internet access, these RPMs will need to be externally downloaded and then copied to the system.
241
241
242
242
**Important:** In an earlier step, the CSM release plus any patches, workarounds, or hotfixes
243
243
were downloaded to a system using the instructions in [Check for Latest Workarounds and Documentation Updates](../update_product_stream/index.md#workarounds). Use that set of RPMs rather than downloading again.
@@ -265,14 +265,14 @@ On first login (over SSH or at local console) the LiveCD will prompt the adminis
265
265
- `ncn_metadata.csv`
266
266
- `switch_metadata.csv`
267
267
- `system_config.yaml` (see below)
268
-
269
-
> The optional `application_node_config.yaml` file may be provided forfurther defining of settings relating to how application nodes will appearin HSM forroles and subroles. See [Create Application Node YAML](create_application_node_config_yaml.md)
270
-
268
+
269
+
> The optional `application_node_config.yaml` file may be provided to further assign application nodes to roles and subrolesin the HSM. See [Create Application Node YAML](create_application_node_config_yaml.md)
270
+
271
271
> The optional `cabinets.yaml` file allows cabinet naming and numbering as well as some VLAN overrides. See [Create Cabinets YAML](create_cabinets_yaml.md).
272
-
273
-
> The `system_config.yaml` is required fora reinstall, because it was created during a previous install. For a first time install, the informationin it can be provided as command line arguments to `csi config init`.
274
-
275
-
272
+
273
+
> The `system_config.yaml` is required fora reinstall because it was created during a previous install. For a first time install, the informationin it can be provided as command line arguments to `csi config init`.
274
+
275
+
276
276
1. Change into the preparation directory.
277
277
278
278
```bash
@@ -281,9 +281,9 @@ On first login (over SSH or at local console) the LiveCD will prompt the adminis
281
281
```
282
282
283
283
After gathering the files into this working directory, generate your configurations.
284
-
285
-
1. If doing a reinstall and have the `system_config.yaml` parameter file avail available, then generate the system configuration reusing this parameter file (see [avoiding parameters](../background/cray_site_init_files.md#save-file--avoiding-parameters)).
286
-
284
+
285
+
1. If doing a reinstall and have the `system_config.yaml` parameter file available, then generate the system configuration reusing this parameter file (see [avoiding parameters](../background/cray_site_init_files.md#save-file--avoiding-parameters)).
286
+
287
287
If not doing a reinstall of Shasta software, then the `system_config.yaml` file will not be available, so skip the rest of this step.
288
288
289
289
1. Check forthe configuration files. The needed files should bein the current directory.
Copy file name to clipboardexpand all lines: install/collecting_ncn_mac_addresses.md
+10-9
Original file line number
Diff line number
Diff line change
@@ -5,8 +5,8 @@ you will have the MAC addresses needed for the Bootstrap MAC, Bond0 MAC0, and Bo
5
5
6
6
The Bootstrap MAC address will be used for identification of this node during the early part of the PXE boot process before the bonded interface can be established.
7
7
The Bond0 MAC0 and Bond0 MAC1 are the MAC addresses for the physical interfaces that your node will use for the various VLANs.
8
-
The Bond0 MAC0 and Bond0 MAC1 should be on the different network cards to establish redundancy for a failed network card.
9
-
On the other hand, if the node has only a single network card, then MAC1 and MAC0 will still produce a valid configuration if they do reside on the same physical card.
8
+
The Bond0 MAC0 and Bond0 MAC1 should be on different network cards to establish redundancy in case either network card fails.
9
+
On the other hand, if the node only has a single network card, then MAC1 and MAC0 will still produce a valid configuration if they reside on the same physical card.
10
10
11
11
#### Sections
12
12
@@ -18,13 +18,13 @@ On the other hand, if the node has only a single network card, then MAC1 and MAC
18
18
19
19
The easy way to do this leverages the NIC-dump provided by the metal-ipxe package. This page will walk-through
20
20
booting NCNs and collecting their MACs from the conman console logs.
21
-
> The alternative is to use serial cables (or SSH) to collect the MACs from the switch ARP tables, this can become exponentially difficult for large systems.
21
+
> The alternative is to use serial cables (or SSH) to collect the MACs from the switch ARP tables, which can become exponentially difficult for large systems.
22
22
> If this is the only way, please proceed to the bottom of this page.
23
23
24
24
<aname="procedure-ipxe-consoles"></a>
25
25
## Procedure: iPXE Consoles
26
26
27
-
This procedure is faster for those with the LiveCD (CRAY Pre-Install Toolkit) it can be used to quickly
27
+
This procedure is faster for those with the LiveCD (CRAY Pre-Install Toolkit). It can be used to quickly
28
28
boot-check nodes to dump network device information without an operating system. This works by accessing the PCI Configuration Space.
29
29
30
30
<aname="requirements"></a>
@@ -72,7 +72,7 @@ For help with either of those, see [LiveCD Setup](bootstrap_livecd_remote_iso.md
4. Now waitfor the nodes to netboot. You can follow them with `conman -j ncn-*id*-mgmt` (use `conman -q` to see ). This takes less than 3 minutes, speed depends on how quickly your nodes POST.
75
+
4. Now waitfor the nodes to netboot. You can follow them with `conman -j ncn-*id*-mgmt` (use `conman -q` to see the list of nodes). This takes less than 3 minutes, speed depends on how quickly your nodes POST.
76
76
5. Print off what has been found in the console logs, this snippet will omit duplicates from multiple boot attempts:
77
77
```bash
78
78
pit# forfilein /var/log/conman/*;do
@@ -101,7 +101,7 @@ For help with either of those, see [LiveCD Setup](bootstrap_livecd_remote_iso.md
7. Examine the output from `grep` to identify the MAC address that make up Bond0 for each management NCN, use the lowest value MAC address per PCIe card.
104
+
7. Examine the output from `grep` to identify the MAC address that make up Bond0 for each management NCN. Use the lowest value MAC address per PCIe card.
105
105
106
106
> example: 1 PCIe card with 2 ports for a total of 2 ports per node.\
107
107
@@ -127,7 +127,7 @@ For help with either of those, see [LiveCD Setup](bootstrap_livecd_remote_iso.md
127
127
-----
128
128
```
129
129
130
-
The above output identified MAC0 and MAC1 of the bond as 94:40:c9:5f:b5:df and 14:02:ec:da:b9:99 respectively.
130
+
The above output identified MAC0 and MAC1 of the bond as 94:40:c9:5f:b5:df and 14:02:ec:da:b9:98 respectively.
131
131
132
132
8. Collect the NCN MAC address for the PIT node. This information will be used to populate the MAC addresses for ncn-m001.
133
133
@@ -140,7 +140,7 @@ For help with either of those, see [LiveCD Setup](bootstrap_livecd_remote_iso.md
140
140
9. Update `ncn_metadata.csv` with the collected MAC addresses for Bond0 from all of the management NCNs.
141
141
> Tip: Mind the index (3, 2, 1.... ; not 1, 2, 3)
142
142
143
-
For each NCN update the corresponding row in`ncn_metadata` with the values for Bond0 MAC0 and Bond0 MAC1. The Bootstrap MAC should have the same value as the Bond0 MAC0.
143
+
For each NCN, update the corresponding row in`ncn_metadata` with the values for Bond0 MAC0 and Bond0 MAC1. The Bootstrap MAC should have the same value as the Bond0 MAC0.
If you have an incorrect `ncn_metadata.csv` file, you will be unable to deploy the NCNs. This section details a recovery procedure incase that happens.
181
181
182
-
1. Remove the incorrectly generated configurations. Before deleting the incorrectly generated configurations, consider making a backup of them. In case they need to be examined at a later time.
182
+
1. Remove the incorrectly generated configurations. Before deleting the incorrectly generated configurations, consider making a backup of them, in case, they need to be examined at a later time.
183
+
183
184
184
185
> **`WARNING`** Ensure that the `SYSTEM_NAME` environment variable is correctly set. If `SYSTEM_NAME` is
185
186
> not set the command below could potentially remove the entire prep directory.
0 commit comments