Ros os upgrade not starting new version

hi there,

i just wanted to upgrade from a running server with rancher/os:v0.8.0-rc5 to rancher/os:v0.8.0-rc6 but it always starts again with rc5. any clue what i do wrong?

ros os upgrade -f -i rancher/os:v0.8.0-rc6
Upgrading to rancher/os:v0.8.0-rc6
os-upgrade_1 | > time="2017-01-25T21:45:32Z" level=warning msg="Cloud-config not provided: you might need to provide cloud-config on bootDir with ssh_authorized_keys"
os-upgrade_1 | Installing from :v0.8.0-rc6
> INFO[0000] Rebooting

thanks for help

looks an awful lot like upgrade is broken.

[root@test docker]# ros os upgrade -f -i rancher/os:v0.8.0-rc6
Upgrading to rancher/os:v0.8.0-rc6
Pulling os-upgrade (rancher/os:v0.8.0-rc6)...
v0.8.0-rc6: Pulling from rancher/os
0a8490d0dfd3: Pull complete
0690e52389de: Pull complete
9d941214f486: Pull complete
2643593d92dc: Pull complete
5ebef577beda: Pull complete
39e6e88df41d: Pull complete
79902cb25aea: Pull complete
a38f53a2be24: Pull complete
0061afe50916: Pull complete
Digest: sha256:70ad3b9cec1594691118364f8896a8ab53fb311b58d06bcb6b39e05ee15751b2
Status: Downloaded newer image for rancher/os:v0.8.0-rc6
os-upgrade_1 | > time="2017-01-26T10:48:31Z" level=warning msg="Cloud-config not provided: you might need to provide cloud-config on bootDir with ssh_authorized_keys"
os-upgrade_1 | Installing from :v0.8.0-rc6
os-upgrade_1 | lsblk: 1: not a block device
os-upgrade_1 | > time="2017-01-26T10:48:31Z" level=error msg="error layDownOS exit status 32"
os-upgrade_1 | > time="2017-01-26T10:48:31Z" level=fatal msg="Failed to run install" err="exit status 32"
> INFO[0086] Rebooting
Connection to 127.0.0.1 closed by remote host.
exit status 255
sven@i7:~/src/rancher$ docker-machine ssh test
[docker@test ~]$ uname -a
Linux test 4.9.5-rancher #1 SMP Sat Jan 21 13:35:55 UTC 2017 x86_64 GNU/Linux
[docker@test ~]$ ros -v
ros version v0.8.0-rc5
[docker@test ~]$ ls -la /bin/ros
-rwxr-xr-x    1 root     root      17678832 Jan 22 12:55 /bin/ros

I’ll look into it tomorrow and make an rc7.

sorry, ignore that example - that was using docker-machine, which doesn’t install - it only uses the disk for state (and reboots to use the ISO)

but it is something where i need to improve the error reporting (to say that there is no upgrading a RO ISO when there’s only a STATE partition :/)

I’ll do some real testing tomorrow.

hi sven,

thanks for helping.
so you mean its not possible to upgrade from one rc version to another when i get you right?

cheers maik

yeah, it looks like upgrading from an installed 0.7.1 to 0.8.0-rc6 works, but from 0.8.0-rc5 doesn’t.

another detail that i need to figure out :slight_smile:

yup - see https://github.com/rancher/os/pull/1557

rc7 should resolve this.

hi sven,

thanks for the fix.
i guess this means for me a complete new installation to switch to the next rc version, right?

if so is there some handy script to backup all my stuff on the server (was something i wanted to look into next week but maybe u can give me a tipp)?

so far i use a privat catalog where i have my stacks, so this is fine.
for other services i use the rancher export function so im quick up with the functionality.
but last thing i need to check is whats the best way to get the state / data back.
i was thinking about a container which run some shell script to export container and volumes but i could think this problem is already solved.

p.s. on another machine i just upgraded from rc6 to rc7 without any problems.

the last rc should be able to upgrade previous rc’s now :slight_smile:

nice, thanks sven.
you guys do a awesome job with rancher, thanks for that.

Hello,

I’m facing the same type of issue:
rancher@rancher ~] sudo ros os version v1.1.3 [rancher@rancher ~] sudo ros os list
rancher/os:v1.4.0 local latest
rancher/os:v1.3.0 remote available
rancher/os:v1.2.0 remote available
rancher/os:v1.1.4 remote available
rancher/os:v1.1.3 remote available running
rancher/os:v1.1.2 remote available
rancher/os:v1.1.1 remote available
rancher/os:v1.1.0 remote available
rancher/os:v1.0.5 remote available
rancher/os:v1.0.4 remote available
rancher/os:v1.0.3 remote available
rancher/os:v1.0.2 remote available
rancher/os:v1.0.1 remote available
rancher/os:v1.0.0 remote available
rancher/os:v0.9.2 remote available
rancher/os:v0.8.1 remote available
rancher/os:v0.7.1 remote available
rancher/os:v0.6.1 remote available
rancher/os:v0.5.0 remote available
rancher/os:v0.4.5 remote available
[rancher@rancher ~] [rancher@rancher ~] sudo ros os upgrade
Upgrading to rancher/os:v1.4.0
Continue [y/N]: y
os-upgrade_1 | Installing from :v1.4.0
Continue with reboot [y/N]: y
INFO[0017] Rebooting
INFO[0018] Setting ros timeout to 60 (rancher.shutdown_timeout set to 60)
…^[ ] reboot:info: Setting reboot timeout to 60 (rancher.shutdown_timeout set to 60)
.=.[ ] reboot:info: Stopping /docker : 468787ce8910
…N…[ ] reboot:info: Stopping /ntp : bd0e840d47df
…>.[ ] reboot:info: Stopping /network : 1587ef2921e6
.<…[ ] reboot:info: Stopping /udev : 5ecd71c69333
.B.[ ] reboot:info: Stopping /system-cron : 53bb0faa43b2
…>…[ ] reboot:info: Stopping /syslog : a968924d0bd0
.<.[ ] reboot:info: Stopping /acpid : 32e5a9e8bc89
.acpid: exiting
.I…[ ] reboot:info: Console Stopping [/console] : 11585c3ae0a2
…[ 198.308368] reboot: Restarting system
[ 0.000000] Linux version 4.9.75-rancher (root@3aa51a65381a) (gcc version 5.4.0 20160609 (Ubuntu 5.4.0-6ubuntu1~16.04.5) ) #1 SMP Sat Jan 6 00:16:10 UTC 2018
[ 0.000000] Command line: console=ttyS0 BOOT_IMAGE=/boot/vmlinuz-4.9.75-rancher rancher.password=kvk.nqe0 printk.devkmsg=on rancher.state.dev=LABEL=RANCHER_STATE rancher.state.wait rancher.resize_device=/dev/sda
[ 0.000000] KERNEL supported cpus:
[ 0.000000] Intel GenuineIntel
[ 0.000000] AMD AuthenticAMD
[ 0.000000] Centaur CentaurHauls
[ 0.000000] x86/fpu: Supporting XSAVE feature 0x001: ‘x87 floating point registers’
[ 0.000000] x86/fpu: Supporting XSAVE feature 0x002: ‘SSE registers’
[ 0.000000] x86/fpu: Supporting XSAVE feature 0x004: ‘AVX registers’
[ 0.000000] x86/fpu: xstate_offset[2]: 576, xstate_sizes[2]: 256
[ 0.000000] x86/fpu: Enabled xstate features 0x7, context size is 832 bytes, using ‘standard’ format.
[ 0.000000] x86/fpu: Using ‘eager’ FPU context switches.
[ 0.000000] e820: BIOS-provided physical RAM map:
[ 0.000000] BIOS-e820: [mem 0x0000000000000000-0x000000000009fffe] usable
[ 0.000000] BIOS-e820: [mem 0x0000000000100000-0x00000000bfffffff] usable
[ 0.000000] BIOS-e820: [mem 0x0000000100000000-0x000000013fffffff] usable
[ 0.000000] NX (Execute Disable) protection: active
[ 0.000000] SMBIOS 2.6 present.
[ 0.000000] Kernel/User page tables isolation: disabled
[ 0.000000] e820: last_pfn = 0x140000 max_arch_pfn = 0x400000000
[ 0.000000] MTRR: Disabled
[ 0.000000] x86/PAT: MTRRs disabled, skipping PAT initialization too.
[ 0.000000] CPU MTRRs all blank - virtualized system.
[ 0.000000] x86/PAT: Configuration [0-7]: WB WT UC- UC WB WT UC- UC
[ 0.000000] e820: last_pfn = 0xc0000 max_arch_pfn = 0x400000000
[ 0.000000] found SMP MP-table at [mem 0x000f0000-0x000f000f] mapped at [ffff9e54800f0000]
[ 0.000000] Scanning 1 areas for low memory corruption
[ 0.000000] Using GB pages for direct mapping
[ 0.000000] RAMDISK: [mem 0x04b27000-0x07253fff]
[ 0.000000] ACPI: Early table checksum verification disabled

           ,        , ______                 _                 _____ _____TM

,------------|’------’| | ___ \ | | / _ / |
/ . ‘-’ |- | |
/ /
_ _ __ | | ___ _ __ | | | \ '–.
/| | | | // ’ | ’ \ / | ’ \ / _ \ '
’ | | | |’–.
| ..’----’ | |\ \ (_| | | | | (| | | | / | | _/ /_/ /
| | | | _| __
,|| |_|_
|| ||_
|_| _
/_
/
_
/ __/ Linux 4.9.75-rancher

     RancherOS #1 SMP Sat Jan 6 00:16:10 UTC 2018 rancher ttyS0
     docker-sys: 172.18.42.2 eth0: 192.168.2.16 lo: 127.0.0.1

rancher login: rancher
Password:
[rancher@rancher ~] sudo ros os version v1.1.3 [rancher@rancher ~]

I always end-up with 1.1.3
Have I missed something ?
Thanks

Looks like an issue on freenas

V.