To support long running patch-able systems that don't require a
reinstall, the entire root disk will be allocated to the cgts-vg volume
group as part of installation.
This update simply removes the use of MINIMUM_PLATFORM_PV_SIZE and
ensures that the 'platform_pv' uses all available space.
NOTE: A followup commit will be provided to clean up the large, small,
tiny disk references and provide an accurate log checking for and
displaying minimal disk size based on default logical volume
sizes.
Test Plan:
PASS - Install AIO-SX, bootstrap, unlock
PASS - Install 2+2+2, bootstrap, unlock
Change-Id: I3a50f2305b781de1cf9b80c5aed62b03bebc4790
Story: 2010444
Task: 46981
Signed-off-by: Robert Church <robert.church@windriver.com>
From the e2fsck man pages, the exit codes of 0, 1, 2 should
not be treated as failures. We should never see exit code 2 though,
since it only occurs when e2fsck is run against a mounted filesystem.
The solution is to extend the check to only fail on exit code > 1.
Test Plan
PASS: Verify e2fsck exit code handling during subcloud install
with resized partition
Closes-Bug: 1998611
Change-Id: Ie22fd77e3d2e2d631ba467b818bdc77c77f0d8b8
Signed-off-by: Kyle MacLeod <kyle.macleod@windriver.com>
Now that the root filesystem is based on an LVM logical volume, discover
the root disk by searching for the boot partition.
Changes include:
- remove detection of rootfs_part/rootfs and adjust rootfs related
references with boot_disk.
- run bashate on the script and resolve indentation and syntax related
errors. Leave long-line errors alone for improved readability.
Test Plan:
PASS - run 'wipedisk', answer prompts, and ensure all partitions are
cleaned up except for the platform backup partition
PASS - run 'wipedisk --include-backup', answer prompts, and ensure all
partitions are cleaned up
PASS - run 'wipedisk --include-backup --force' and ensure all partitions
are cleaned up
Change-Id: I036ce745353b6a26bc2615ffc6e3b8955b4dd1ec
Closes-Bug: #1998204
Signed-off-by: Robert Church <robert.church@windriver.com>
In cases when wipedisk isn't run or isn't working correctly,
pre-existing volume groups, physical volumes, and logical volumes will
be present on the root disk. Depending on the sizes and layout of the
previous install along with partial or aborted cleanup activities, this
may lead [unknown] PVs with duplicate volume group names.
Adjust the cleanup logic to:
- Discover existing volume groups by UUID so that duplicate volume
groups (i.e two occurrences of cgts-vg) can be handled individually.
- Ignore [unknown] physical volumes in a volume group as they cannnot be
removed. Cleaning up existing physical volumes across all volume
groups will resolve any [unknown] physical volumes.
In addition, unify if/then for/do syntax in the %pre-part hook
Test Plan:
PASS - create a scenario with multiple partitions along with a
nova-local and cgts-vg volume group that result in an [unknown]
physical volume and a duplicate cgts-vg. Do not wipe the disks
and install an ISO with the above changes. Observe proper cleanup
and install.
PASS - Perform consecutive installs without wipedisk and observe proper
cleanup and install
Change-Id: Idf845cf00ca3c009d72dedef0805a77d94fa3d97
Partial-Bug: #1998204
Signed-off-by: Robert Church <robert.church@windriver.com>
In the case when the root disk partition table is wiped but individual
partitions are not wiped correctly, this will leave previous physical
volume metadata intact on the disk.
When a new LVM partition is created and assigned as a newly created
physical volume the old LVM metadata on the disk partition will prevent
the cgts-vg volume group from being created.
This update will wipe all the magic strings present in the new physical
volume partition established by the kickstart by executing 'wipefs -a'
prior to creating the cgts-vg.
Test Plan:
PASS - Successfully install an ISO with this change on a system that did
not cleanup the LVM metatadata from a previous install. Log in to
the installed system and confirm that the cgts-vg is properly
configured.
Change-Id: I63f4235a27cb40a4283f0f4c34f63564a4f18cdd
Partial-Bug: #1998204
Signed-off-by: Robert Church <robert.church@windriver.com>
The sw_version was uninitialized for workers.
This led to a 404 error when doing ostree pull
during a patch installation on worker nodes.
The problem was introduced by
https://review.opendev.org/c/starlingx/metal/+/864930
Test Plan:
Build / Install /Deploy Duplex env with a worker
Successfully apply a patch on the worker
Closes-Bug: 1997130
Signed-off-by: Al Bailey <al.bailey@windriver.com>
Change-Id: If40466b0ac9ffe0ce1ae068e948682eafa3703e5
When we create the prestage iso with an external script,
ks-setup.cfg, we may not provide the rootfs_device or
boot_device parameter. This is a valid scenario where these
parameters are defined in ks-setup.cfg. An installation failure
is observed in this case.
The cause of the failure is that the prestage code is handled in
a pre-part hook. This commit moves it to a ks-early hook.
In this commit, a provision for the execution of a custom script
named ks-addon.cfg is also made. This script is a bash script
that must execute in the last post hook.
Test Plan:
PASS: Verify that the installation succeeds when the rootfs
and boot device parameters are only specified via
the ks-setup.cfg.
PASS: Verify that the external script, ks-addon.cfg, is executed
after the install and configurations are done.
PASS: Verify that the logs from the execution of ks-addon.cfg
are present in kickstart.log.
Closes-bug: 1997305
Signed-off-by: Shrikumar Sharma <shrikumar.sharma@windriver.com>
Change-Id: Ica1735aef3ab457cf0609ebee6aac45671e97987
Move the /var and /root partition based filesystems into the cgts-vg so
that they can be resized as required at runtime in the future.
This change includes:
- Update pxeboot network personality files to add installer command line
parameters inst_ostree_root andinst_ostree_var to allow specifying the
root and var devices to be created and populated by the installer.
- Update the StarlingX grub.cfg file to add a new single option booting
that drops the rollback boot option (not working) and adds grub
options ostree_root, rd.lvm.lv, and ostree_var to enable mounting the
root and var filesystems at boot time.
- Update the kickstart/miniboot config files to:
- Remove support for lat/lat-disk partition size variables and
refactor the hooks to use specific PART_SZ_* and LV_SZ_* variables.
- Increase /boot partition size to 2GB from 500M to provide some
additional space for future patching scenarios that may require
staging multiple ostree deployments prior to reboot and cleanup.
- Create logical volumes for root and var set to the current 20GB
values.
- Adjust the minimum physical volume size used on AIO and worker
personalities to include the new root and var logical volumes.
- Adjust normal install disk thresholds to 219GB for AIOs and 120GB
for workers.
- Fix mkfs hook to ensure that the aio vs. std sizes are correctly
reflected on hook execution.
Test Plan:
- PASS: BIOS AIO-SX
- PASS: UEFI AIO-SX
- PASS: BIOS 2+2+2
- SKIP: secure boot, not ready for Stx8.0
- PASS: AIO-SX upgrade
- PASS: AIO-DX upgrade
- PASS: DC subcloud install (virtual test)
Change-Id: I5f77266336b53d178eaae0e6fbb556bbea6400e8
Depends-On: https://review.opendev.org/c/starlingx/integ/+/865076
Story: 2010444
Task: 46881
Signed-off-by: Robert Church <robert.church@windriver.com>
Modify the stx grub template file to remove the
normal / rollback image switching/toggle algorithm.
Also remove the temporary sed based method in the
kickstart code.
Effectively, this moved the previous change introduced by
https://review.opendev.org/c/starlingx/metal/+/861461
... to a grub.cfg 'code block remove' rather
than 'on the fly sed modification' by the kickstart.
Test Plan:
PASS: Verify build and install
PASS: Verify on target code removed from /boot/efi/EFI/BOOT/grub.cfg
PASS: Verify normal image is selected after 10 back to back reboots
Story: 2009968
Task: 46886
Signed-off-by: Eric MacDonald <eric.macdonald@windriver.com>
Change-Id: Id8799dff6eef7ef8aa6f66180d6ed971c005618d
This update introduces a new script that can be called
by patching to refresh the kernel, initrd and other
system node install feed staged files in support of
kernel patching.
This update also introduces and enables new service file
that triggers the creation of the pxeboot feeds or refreshes
the pxeboot feeds if what they contain does not match the
content in /boot.
Both new script and service files are added to the
pxe-network-installer package so they get installed
into the filesystem properly.
Lastly, there are 2 kickstart changes implemented.
1. The kickstart code that copied the kickstart files from
/var/www/pages/feed/rel-xx.xx/
to
/var/www/pages/feed/rel-xx.xx/kickstart
is removed in favor of the pxe-network-installer package
doing that automatically.
2. The kickstart is modified to remove the previous pxeboot
feed fetch and creation function.
One exception to this is the efi.img file, its fetch remains.
Note the efi image is currenly not included in the /boot dir.
Test Plan:
PASS: Verify Debian build and AIO DX install (cd and pxe installs)
PASS: Verify Debian Standard 2+1 DX system install
PASS: In above cases verify end-to-end handling of the following
test case staging.
PASS: Verify pxeboot feed staging on subcloud controller-0 install
PASS: Verify pxeboot feed file positioning in
- /var/pxeboot/rel-xx.xx (kernel and initrd images)
- /var/www/pages/feed/rel-xx.xx/pxeboot (kernel/initrd images)
- /var/www/pages/feed/rel-xx.xx/pxeboot/EFI/BOOT (other files)
- /var/pxeboot and /var/www/pages/feed/rel-xx.xx (efi.img)
PASS: Verify rsync bypass for the above cases when the files match
- complete and partial cases
PASS: Verify staging when the stage dirs are missing
- complete and partial cases
PASS: Verify staging when stage files mismatch
- complete and partial cases
PASS: Verify service enable on controllers for AIO and STD configs
PASS: Verify kickstart file position change
PASS: Verify shellcheck static analysis
PASS: Verify pxeboot_feed.sh script error handling
PASS: Verify pxeboot_feed.sh script logging
Story: 2009968
Task: 46789
Signed-off-by: Eric MacDonald <eric.macdonald@windriver.com>
Change-Id: Ic98b2686c417103749cb777adb28ac73ac1d397c
The entry for the ostree remote in computes is using
pxecontroller. This is an ipv4 address, and therefore
will not be accessible on an unlocked IPv6 Worker
(or storage).
The fix is to use 'controller' instead of 'pxecontroller'
That address exists in both ipv4 and ipv6.
Test Plan:
Debian: Successfully apply a patch to a worker node
Closes-Bug: 1997130
Signed-off-by: Al Bailey <al.bailey@windriver.com>
Change-Id: Idbc1e2728582ab3cd5c73761790cdd9fbc6d951a
To create a duplex subcloud with multiple nodes with different
personalites, pxeboot must be supported by controller-0 of the
subcloud.
Here, pxeboot is enabled on controller-0 by copying efi.img
from the mounted miniboot iso image to /var/pxeboot. This will
allow the installation of controller-1 and the computes of the
subcloud via pxeboot.
Test Plan:
PASSED: Verify that all nodes in the subcloud install, come online
and are unlocked, enabled and available by the end of the
installation process.
PASSED: Verify that multinode install completes successfully with
prestaged ostree_repo.
Depends-On: https://review.opendev.org/c/starlingx/metal/+/862619/
Story: 2010118
Task: 46754
Signed-off-by: Shrikumar Sharma <shrikumar.sharma@windriver.com>
Change-Id: I0a6789b5a86f89da5e86581ab7b3eed950361ce7
Move the packages of "metal" from stx-std.lst to debian_iso_image.inc
Test Plan:
Pass: build-pkgs -c -a
Pass: build-image
Pass: boot
Story: 2008862
Task: 46844
Signed-off-by: Yue Tao <yue.tao@windriver.com>
Change-Id: Ib284ae6f1762b0f3ca2fea242b49c1b75846286d
Removing setup for pmon files. The service sysinv-fpga-agent
doesn't exist anymore. So this change is only a cleanup.
Test plan (AIO-SX):
PASS: Build, boot, bootstrap and unlock.
Story: 2010087
Task: 45628
Depends-on: https://review.opendev.org/c/starlingx/integ/+/864133
Signed-off-by: Davi Frossard <dbarrosf@windriver.com>
Change-Id: I0e56483f49be3a64bcb8047934df5bbb13fe1490
This reverts commit d09313ff0b527efdcfd2c03bdfb950eb1432be10.
While the code here itself is functionally correct and tested,
a download in the code is dependent on a location on the
active System Controller that is overridden by a drbd2 mount
on /var/www/pages/iso.
This drbd2 mount masks the pxeboot related files which were
placed there during System Controller installation.
Reverting this change until a resolution to the drbd mount on
/var/www/pages/iso on the active System Controller is resolved.
Signed-off-by: Shrikumar Sharma <shrikumar.sharma@windriver.com>
Change-Id: Ie91fde9a09f693d133fa484782a7df28ffd29faf
Initial delivery of UEFI system node installs did not
use the signed boot loader. As a result Secure Boot
of system nodes was not supported. This update changes
that by swapping in the signed bootx64.efi boot loader
in a puppet update ; see depends on.
This update modifies to the pxe-network-installer
and kickstart to support a robust UEFI system node
install that supports Secure Boot.
The first change creates and uses an stx template
file from LAT grub file. This is done to avoid ongoing
and difficult to implement LAT grub file hack changes
from the kickstart.
This new grub.cg.stx file is packaged in the
pxe-network-installer.
The kickstarts are modified to replace the LAT grub.cfg
file with the new stx template file grub.cfg.stx. As far
as this update goes, this template file is a null change
from the LAT grub file and represents what the LAT grub
file looked like at the time the template was created.
Moving forward, further changes to the system node
install grub file will be made to this new grub.cfg.stx
template file.
The second change is to modify existing stx unprovisioned
default pxe-grub.cfg files to look for the new mac based
config file with the '.cfg' extention.
The system node install mac-based grub files are dynamically
created with no signature file. To work around that, this
update exports the LAT environment variable 'skip_check_cfg'
which instructs LAT to 'skip' the grub menu signature 'check'
for these dynamically created grub files.
An additional change is made to handle timer reload on menu
refresh if the new node remains unprovisioned after timeout.
Test Plan:
PASS: Verify the default LAT file is renamed and the new
template file positioned in its place.
PASS: Verify Debian pxe-network-installer package update
PASS: Verify Debian AIO DX UEFI Install
PASS: Verify CentOS kickstarts do not require the kickstart change
PASS: Verify build and UEFI install
- Debian
- CentOS
PASS: Verify unprovisioned grub menu reload handling with
re-occuring timeout until node is provisioned.
Regression:
PASS: Verify host-delete and host-update install and unlock
PASS: Verify host-reinstall and host-unlock
PASS: Verify lock/unlock controller-1 and controller-0
PASS: Verify lock/delete/reinstall/unlock controller-1
PASS: Verify swact to controller-1
PASS: Verify lock/delete/reinstall/unlock controller-0
Depends-On: https://review.opendev.org/c/starlingx/stx-puppet/+/863776
Story: 2009968
Task: 46701
Signed-off-by: Eric MacDonald <eric.macdonald@windriver.com>
Change-Id: Id073842ac1b29acf54c999022a9e37d4c2366031
The persistent_size variable is passed in via the dcmanager
install-values file, allowing the customer to specify
a non-default size for the platform-backup partition.
Default size is 30000 MiB.
When deviating from the default size we must handle the
following installation cases:
1. New Partition
- persistent_size must be >= default (30000 MiB)
2. Existing Partition
- persistent_size must be >= default (30000 MiB)
- persistent_size must be >= existing partition size
- if persistent_size > existing, then we must also
extend the filesystem to match the new persistent_size
Story: 2010118
Task: 46698
Test Plan:
PASS:
- Fail installation if persistent_size is < default
- New Partition:
- Installation with unspecified persistent_size
- Installation with specified persistent_size == default value
- Installation with specified persistent_size > default
- Existing Partition:
- Installation with unspecified persistent_size
- Installation with specified persistent_size == default value
- Fail installation if persistent_size is < existing partition size
- Installation with specified persistent_size > default
- Verify that existing filesystem is extended to match
new partition size
Signed-off-by: Kyle MacLeod <kyle.macleod@windriver.com>
Change-Id: I8d06ee585ad96acf1076d4b140d7c516a17f15ea
To create a duplex subcloud with multiple nodes with different
personalites, pxeboot must be supported by controller-0 of the
subcloud.
Here, pxeboot is enabled on controller-0 by downloading files
required for pxeboot from the System Controller, and copying
them to the relevant locations under /var/www/pages/feed/rel-id
and /var/pxeboot.
Test Plan:
PASS: Verify that all nodes in the subcloud install, come online
and are unlocked, enabled and available by the end of the
installation process.
PASS: Verify that multinode install completes successfully with
prestaged ostree_repo.
Depends-On: https://review.opendev.org/c/starlingx/metal/+/862619/
Story: 2010118
Task: 46754
Change-Id: I8cfda9688d41d1f6f5997ac81f9b6e21d7f3ebe6
Signed-off-by: Shrikumar Sharma <shrikumar.sharma@windriver.com>
There is a lighttpd rule that prevents a subcloud install from
accessing the system controller's feed directory for the purpose
of setting up its own feed dir in prep for its own system node
installs.
This update enhances the kickstart.cfg file to stage the
feed content for a subcloud controller-0 install.
The new subcloud feed dir is /var/www/pages/iso/rel-xx.xx
Test Plan:
PASS: Verify /var/www/pages/iso file content for
- usb install
- pxe install
- controller system node install
PASS: Verify subcloud can install controller-1 from iso feed
Regression:
PASS: Verify /var/pxeboot content for
- usb install
- pxe install
- controller system node install
PASS: Verify /var/www/pages/feed file content for
- usb install
- pxe install
- controller system node install
PASS: Verify Debian DX System Install
- usb install then controller-1 as system node install
- pxe install then controller-1 as system node install
PASS: Verify UEFI and Legacy BIOS Install
Story: 2009968
Task: 46648
Signed-off-by: Eric MacDonald <eric.macdonald@windriver.com>
Change-Id: Ia1511813c5673762ad386122cf9a1666e6392b30
This commit avoids install/bootstrap issues when the hwclock
is very far out of date with the system controller.
If the hwclock is more than approximiately 20m different than the
system controller then we set hwclock based on the system date
LAT initializes the system date based on the 'instdate' boot parameter.
The instdate boot parameter is the timestamp applied when the miniboot
bootimage.iso file is created on the system controller. It is close
enough to avoid any major out-of-sync system clock on the subcloud.
Secondary change: Optimize the interface assignment prior to ostree pull
This was mentioned during review
https://review.opendev.org/c/starlingx/metal/+/861017
Rather than employ sleeps, the /sys/class/net/${mgmt_dev}/operstate
file is used to determine when the interface has settled.
This is much more efficient than sleeps. We timeout after 60s.
Test Plan:
PASS:
- Install with subcloud in relative sync with system controller
(within seconds). No hwclock change is applied to subcloud.
- Install when subcloud hwclock is more than 20m out of sync with
system controller.
- Verify that the hwclock is updated on the subcloud before
ostree pull is initiated.
- Verify that system date is proper on the first post-miniboot
boot into the ostree installation.
- Test interface assignment wait/timeout functionality before
ostree pulls. This is done on hardware subclouds (success path)
and in sushy subclouds where failure mode testing was done
by simulating stuck inteface operstate values.
Closes-Bug: 1995643
Signed-off-by: Kyle MacLeod <kyle.macleod@windriver.com>
Change-Id: Ieddc774f962878f3c7f5886148310b87d4ffddfe
Adding retries to handle the following types of failure:
1. Create communication session failed - Failed to create session.
2. Unable to establish Redfish client connections to BMC at <ip address>
(Server not reachable, return code: 503).
3. Fail to set System Power State to On/Off.
Test Plan:
PASS: Retries work properly when session creation fails.
PASS: Retries work properly when Unable to establish Redfish client
connection to BMC.
PASS: Retries work properly when returning 500 error in the "Power Off
Host" stage.
PASS: rvmc script executed successfully without above errors.
Story: 2010144
Task: 46761
Signed-off-by: Li Zhu <li.zhu@windriver.com>
Change-Id: I6bb2e0822a51770b181181b49a86fb51d6dca18b
To save on network resources, it is required to prestage the
ostree_repo and the container images. When installation of a
subcloud is done, it is preferred that the prestaged ostree
repo and the prestaged container images are used, instead of
downloading them.
The prestaged container images are copied over to
/opt/platform-backup from the mounted media containing the
prestage iso.
Test Plan:
PASS: Verify that the prestaged container images are copied to
/opt/platform-backup/<release version>.
PASS: Verify that the ostree_repo is not pulled from the system
controller but from /opt/platform-backup for installation.
Story: 2010120
Task: 46709
Signed-off-by: Shrikumar Sharma <shrikumar.sharma@windriver.com>
Change-Id: Ie2276d65e44f51a36dfcff922afaf5a9bdd0cf89
The Debian installations are generating the same machine-id if using
the same BUILD_ID. This ID is used to generate the value of random
MACs for SRIOV's VF interfaces, since it is the same across the same
BUILD_ID the network cards are generating the exact same MAC if the
NIC is on the same pci-slot across multiple nodes
This change removes the existing files so each installation's systemd
can generate an exclusive value
Test Plan (Debian)
[PASS] install multiple nodes and verify that each one contains an
exclusive /etc/machine-id content
[PASS] reboot node to validate that machine-id does not change on
subsequent boots
Closes-Bug: 1995505
Signed-off-by: Andre Kantek <andrefernandozanella.kantek@windriver.com>
Change-Id: I702d1cc0353d0d19149fdd1ac1ec4bd16e674119
Incorporate the following commits into miniboot.cfg:
- https://review.opendev.org/c/starlingx/metal/+/857894
- https://review.opendev.org/c/starlingx/metal/+/862669
The current redfish installs are broken because the sysadmin user
is no longer properly setup for password-change required on first login.
This commit pulls in the required change from kickstart.cfg.
Test Plan:
PASS:
- Verify that the sysadmin/sysadmin password is required to change
on first login via sushy boot
- Verify unsuccessful installation in sushy emulator without this fix
- Verify successful installation in sushy emulator with this fix
- Verify the FD deletion error is no longer present
Story: 2010118
Task: 46714
Signed-off-by: Kyle MacLeod <kyle.macleod@windriver.com>
Change-Id: I1c39184c05442946b55ec375e643e94e6dc89fd6
/opt/platform-backup
To make installation more efficient, we need to prestage the
ostree_repo in /opt/platform-backup.
When we add a subcloud, the ostree_repo will not be fetched from
the System Controller, but from the prestaged ostree_repo, if it
exists.
Test Plan:
PASS: Verify that the ostree_repo is not pulled from the system
controller but from /opt/platform-backup for installation.
Story: 2010120
Task: 46691
Change-Id: I6a00b28abe96f5d254a77331fc231cd9edac7bea
Signed-off-by: Shrikumar Sharma <shrikumar.sharma@windriver.com>
Some pipe files are generated when assigning variables
and getting the process's fd, and are removed when done.
Since the pipe's fd is not the dev's fd, filter out the pipe fd.
Test Plan:
PASS: qemu iso installed successful
PASS: lab iso installed successful
Closes-Bug: 1991816
Signed-off-by: Wentao Zhang<Wentao.Zhang@windriver.com>
Change-Id: Icfdd4922dcc7e3c2e5003296c90519f8b7624d88
The interface setup post phase of the kickstart issues a
dhclient (dhcp) request for IP address. Normally this executes
fine and an ip address (lease) is acquired.
However, in a failure mode case in ipv6 mode that dhclient
request will hang there waiting until the dhcp server responds
So, if there is a network configuration error that precludes
dhclient from getting a response the kickstart and therefore
the entire installation process hangs.
This is a known issue/behavior in dhclient that is typically
worked around with the -1 option.
- https://bugzilla.redhat.com/show_bug.cgi?id=585047
- https://linux.die.net/man/8/dhclient
Rather than using the -1 option which changes the behavior
with fixed 30 second timeout, this update uses the linux 'timeout'
command with a chosen 60 second upper bound on the vlan dhclient
(dhcp) request. If the request does not complete in that time
then it is terminated in error, allowing the kickstart to proceed.
Test Plan: Change does not affect CentOS in any way.
PASS: Verify Debian build and iso install for ipv6 and ipv4.
PASS: Verify success path with and without vlan in ipv4 and ipv6
PASS: verify failure path handling in ipv6 vlan case
PASS: Verify logging
Closes-Bug: 1993342
Signed-off-by: Eric MacDonald <eric.macdonald@windriver.com>
Change-Id: I2853f52b79e0f82c0a2e645fdeb9e7b7aa4f0a9e
Add Debian support for the bootstrap_vlan install parameter
in the subcloud install values file. For use in redfish installs.
The vlan interface is configured during the initial boot,
prior to the initial ostree pull. We also configure the
interfaces in /etc/network/interfaces.d for the post-ostree
reboot.
Story: 2010118
Task: 46538
Test Plan:
PASS:
- Install and provision subcloud with bootstrap_vlan value
configured via subcloud install values - IPv6 hardware
- Verify that subcloud installs, bootstraps, and is
properly unlocked
- Tested in libvirt only: IPv4 configuration, including
bootstrap_vlan value in IPv4 format. This test
ensured that configuration is properly applied.
- Install and provision subcloud without bootstrap_vlan
(regression case)
- Failure mode: verify that bootstrap fails if invalid
vlan is configured
Signed-off-by: Kyle MacLeod <kyle.macleod@windriver.com>
Change-Id: If7e400359ad36cfb6835a8aff0f2ebd7d8e1817d
The current offline handler assumes the node is offline after
'offline_search_count' reaches 'offline_threshold' count
regardless of whether mtcAlive messages were received during
the search window.
The offline algorithm requires that no mtcAlive messages
be seen for the full offline_threshold count.
During a slow shutdown the mtcClient runs for longer than
it should and as a result can lead to maintenance seeing
the node as recovered before it should.
This update manages the offline search counter to ensure that
it only reached the count threshold after seeing no mtcAlive
messages for the full search count. Any mtcAlive message seen
during the count triggers a count reset.
This update also
1. Adjusts the reset retry cadence from 7 to 12 secs
to prevent unnecessary reboot thrash during
the current shutdown.
2. Clears the hbsClient ready event at the start of the
subfunction handler so the heartbeat soak is only
started after seeing heartbeat client ready events
that follow the main config.
Test Plan:
PASS: Debian and CentOS Build and DX install
PASS: Verify search count management
PASS: Verify issue does not occur over lock/unlock soak (100+)
- where the same test without update did show issue.
PASS: Monitor alive logs for behavioral correctness
PASS: Verify recovery reset occurs after expected extended time.
Closes-Bug: 1993656
Signed-off-by: Eric MacDonald <eric.macdonald@windriver.com>
Change-Id: If10bb75a1fb01d0ecd3f88524d74c232658ca29e
This update packages the pxeboot utilities into the
pxe-network-installer and modifies the kickstart to
no longer fetch them.
Once packaged they no longer need to be staged in feed.
Once packaged then they can be patched.
Test Plan:
PASS: Build and install AIO DX
PASS: Compare /var/pxeboot dir before and after update
PASS: Verify system host-reinstall controller-0 from controller-1
PASS: Verify unlock of reinstalled controller-0
PASS: Verify kickstart logs now exclude pxeboot utility staging
Story: 2009968
Task: 46619
Signed-off-by: Eric MacDonald <eric.macdonald@windriver.com>
Change-Id: I75bcfe06724cbfc7203b187cb7c131694de69920
Remove the user and group workarounds from the LAT
build and place them in the kickstart.cfg when the
ISO is initialized.
Depends-On: https://review.opendev.org/c/starlingx/tools/+/857893
Test Plan
Build platform-kickstarts package
Build ISO
Boot ISO
Login as sysadmin user
Story: 2009964
Task: 44285
Signed-off-by: Charles Short <charles.short@windriver.com>
Change-Id: I5267f228bd114a79d9acadf7ffb74a04eeb87df1