SLES11 SP3 crashes after integrating into AD

Hello.
We’re runnning a virtual SLES11 SP3 server on VMware 5. After integrating the Samba into AD (Windows 2008 R2 Std Sp1) the server crashes each sunday at 02:00AM.
How can I find and fix the error?

Thanks in advance for tips!

Best regards

Dirk Emmermacher

Hi Dirk,

[QUOTE=Emmermacher;17405]Hello.
We’re runnning a virtual SLES11 SP3 server on VMware 5. After integrating the Samba into AD (Windows 2008 R2 Std Sp1) the server crashes each sunday at 02:00AM.
How can I find and fix the error?

Thanks in advance for tips!

Best regards

Dirk Emmermacher[/QUOTE]

do you have any details about the “crash” - does the VM hang, is it being restarted,… ?

Next thing to look at are the system logs (/var/log/messages) and any VMware server logs for the VM.

As it is reproducible and hits at that exact time, checking cron jobs run at that time might provide a hint, as well.

Regards,
Jens

Good morning (Moin) Jens.

Thanks for your answer.
Here ist is /var/log/messages.
Nov 9 22:00:06 myserver syslog-ng[1448]: Log statistics; dropped=‘pipe(/dev/xconsole)=0’, dropped=‘pipe(/dev/tty10)=0’, processed=‘center(queued)=18201’, processed=‘center(received)=18201’, processed=‘destination(messages)=18201’, processed=‘destination(mailinfo)=0’, processed=‘destination(mailwarn)=0’, processed=‘destination(localmessages)=0’, processed=‘destination(newserr)=0’, processed=‘destination(mailerr)=0’, processed=‘destination(netmgm)=0’, processed=‘destination(warn)=0’, processed=‘destination(console)=0’, processed=‘destination(null)=0’, processed=‘destination(mail)=0’, processed=‘destination(xconsole)=0’, processed=‘destination(firewall)=0’, processed=‘destination(acpid)=0’, processed=‘destination(newscrit)=0’, processed=‘destination(newsnotice)=0’, processed=‘source(src)=18201’
Nov 9 22:10:02 myserver /usr/sbin/cron[30426]: (megaadm) CMD (/mega/megadb/export/export.sc >/mega/megadb/export/export.log 2>&1)
Nov 9 22:30:02 myserver – MARK –
Nov 9 22:50:02 myserver – MARK –
Nov 9 23:00:06 myserver syslog-ng[1448]: Log statistics; dropped=‘pipe(/dev/xconsole)=0’, dropped=‘pipe(/dev/tty10)=0’, processed=‘center(queued)=18205’, processed=‘center(received)=18205’, processed=‘destination(messages)=18205’, processed=‘destination(mailinfo)=0’, processed=‘destination(mailwarn)=0’, processed=‘destination(localmessages)=0’, processed=‘destination(newserr)=0’, processed=‘destination(mailerr)=0’, processed=‘destination(netmgm)=0’, processed=‘destination(warn)=0’, processed=‘destination(console)=0’, processed=‘destination(null)=0’, processed=‘destination(mail)=0’, processed=‘destination(xconsole)=0’, processed=‘destination(firewall)=0’, processed=‘destination(acpid)=0’, processed=‘destination(newscrit)=0’, processed=‘destination(newsnotice)=0’, processed=‘source(src)=18205’
Nov 9 23:20:07 myserver – MARK –
Nov 9 23:40:07 myserver – MARK –
Nov 10 00:00:06 myserver syslog-ng[1448]: Log statistics; dropped=‘pipe(/dev/xconsole)=0’, dropped=‘pipe(/dev/tty10)=0’, processed=‘center(queued)=18208’, processed=‘center(received)=18208’, processed=‘destination(messages)=18208’, processed=‘destination(mailinfo)=0’, processed=‘destination(mailwarn)=0’, processed=‘destination(localmessages)=0’, processed=‘destination(newserr)=0’, processed=‘destination(mailerr)=0’, processed=‘destination(netmgm)=0’, processed=‘destination(warn)=0’, processed=‘destination(console)=0’, processed=‘destination(null)=0’, processed=‘destination(mail)=0’, processed=‘destination(xconsole)=0’, processed=‘destination(firewall)=0’, processed=‘destination(acpid)=0’, processed=‘destination(newscrit)=0’, processed=‘destination(newsnotice)=0’, processed=‘source(src)=18208’
Nov 10 00:20:07 myserver – MARK –
Nov 10 00:40:07 myserver – MARK –
Nov 10 01:00:07 myserver – MARK –
Nov 10 01:00:07 myserver syslog-ng[1448]: Log statistics; dropped=‘pipe(/dev/xconsole)=0’, dropped=‘pipe(/dev/tty10)=0’, processed=‘center(queued)=18212’, processed=‘center(received)=18212’, processed=‘destination(messages)=18212’, processed=‘destination(mailinfo)=0’, processed=‘destination(mailwarn)=0’, processed=‘destination(localmessages)=0’, processed=‘destination(newserr)=0’, processed=‘destination(mailerr)=0’, processed=‘destination(netmgm)=0’, processed=‘destination(warn)=0’, processed=‘destination(console)=0’, processed=‘destination(null)=0’, processed=‘destination(mail)=0’, processed=‘destination(xconsole)=0’, processed=‘destination(firewall)=0’, processed=‘destination(acpid)=0’, processed=‘destination(newscrit)=0’, processed=‘destination(newsnotice)=0’, processed=‘source(src)=18212’
Nov 10 01:20:07 myserver – MARK –
Nov 10 01:40:07 myserver – MARK –
Nov 10 02:00:07 myserver – MARK –
Nov 10 02:00:07 myserver syslog-ng[1448]: Log statistics; dropped=‘pipe(/dev/xconsole)=0’, dropped=‘pipe(/dev/tty10)=0’, processed=‘center(queued)=18216’, processed=‘center(received)=18216’, processed=‘destination(messages)=18216’, processed=‘destination(mailinfo)=0’, processed=‘destination(mailwarn)=0’, processed=‘destination(localmessages)=0’, processed=‘destination(newserr)=0’, processed=‘destination(mailerr)=0’, processed=‘destination(netmgm)=0’, processed=‘destination(warn)=0’, processed=‘destination(console)=0’, processed=‘destination(null)=0’, processed=‘destination(mail)=0’, processed=‘destination(xconsole)=0’, processed=‘destination(firewall)=0’, processed=‘destination(acpid)=0’, processed=‘destination(newscrit)=0’, processed=‘destination(newsnotice)=0’, processed=‘source(src)=18216’
Nov 11 11:01:16 myserver syslog-ng[1487]: syslog-ng starting up; version=‘2.0.9’
Nov 11 11:01:20 myserver kernel: klogd 1.4.1, log source = /proc/kmsg started.
Nov 11 11:01:20 myserver kernel: [ 24.760219] fuse init (API version 7.16)
Nov 11 11:01:20 myserver kernel: [ 37.967079] type=1400 audit(1384164038.853:2): apparmor=“STATUS” operation=“profile_load” name="/bin/ping" pid=1088 comm=“apparmor_parser”
Nov 11 11:01:20 myserver kernel: [ 38.099860] type=1400 audit(1384164038.985:3): apparmor=“STATUS” operation=“profile_load” name="/sbin/klogd" pid=1093 comm=“apparmor_parser”
Nov 11 11:01:20 myserver kernel: [ 39.066541] type=1400 audit(1384164039.953:4): apparmor=“STATUS” operation=“profile_load” name="/sbin/syslog-ng" pid=1104 comm=“apparmor_parser”
Nov 11 11:01:20 myserver kernel: [ 39.224133] type=1400 audit(1384164040.109:5): apparmor=“STATUS” operation=“profile_load” name="/sbin/syslogd" pid=1110 comm=“apparmor_parser”
Nov 11 11:01:20 myserver kernel: [ 39.371465] type=1400 audit(1384164040.257:6): apparmor=“STATUS” operation=“profile_load” name="/usr/lib/PolicyKit/polkit-explicit-grant-helper" pid=1119 comm=“apparmor_parser”
Nov 11 11:01:20 myserver kernel: [ 40.085717] type=1400 audit(1384164040.973:7): apparmor=“STATUS” operation=“profile_load” name="/usr/lib/PolicyKit/polkit-grant-helper" pid=1132 comm=“apparmor_parser”
Nov 11 11:01:20 myserver kernel: [ 40.307438] type=1400 audit(1384164041.193:8): apparmor=“STATUS” operation=“profile_load” name="/usr/lib/PolicyKit/polkit-grant-helper-pam" pid=1147 comm=“apparmor_parser”
Nov 11 11:01:20 myserver kernel: [ 40.805424] type=1400 audit(1384164041.693:9): apparmor=“STATUS” operation=“profile_load” name="/usr/lib/PolicyKit/polkit-read-auth-helper" pid=1162 comm=“apparmor_parser”
Nov 11 11:01:20 myserver kernel: [ 41.131163] type=1400 audit(1384164042.017:10): apparmor=“STATUS” operation=“profile_load” name="/usr/lib/PolicyKit/polkit-resolve-exe-helper" pid=1190 comm=“apparmor_parser”
Nov 11 11:01:20 myserver kernel: [ 41.402943] type=1400 audit(1384164042.289:11): apparmor=“STATUS” operation=“profile_load” name="/usr/lib/PolicyKit/polkit-revoke-helper" pid=1194 comm=“apparmor_parser”
Nov 11 11:01:20 myserver kernel: [ 43.390025] audit_printk_skb: 9 callbacks suppressed
Nov 11 11:01:20 myserver kernel: [ 43.390056] type=1400 audit(1384164044.277:15): apparmor=“STATUS” operation=“profile_load” name="/usr/sbin/mdnsd" pid=1234 comm=“apparmor_parser”
Nov 11 11:01:20 myserver kernel: [ 44.181958] type=1400 audit(1384164045.073:16): apparmor=“STATUS” operation=“profile_load” name="/usr/sbin/nscd" pid=1248 comm=“apparmor_parser”
Nov 11 11:01:20 myserver kernel: [ 44.411005] type=1400 audit(1384164045.301:17): apparmor=“STATUS” operation=“profile_load” name="/usr/sbin/ntpd" pid=1255 comm=“apparmor_parser”
Nov 11 11:01:20 myserver kernel: [ 44.533482] type=1400 audit(1384164045.421:18): apparmor=“STATUS” operation=“profile_load” name="/usr/sbin/traceroute" pid=1265 comm=“apparmor_parser”
Nov 11 11:01:20 myserver kernel: [ 75.137096] microcode: CPU0 sig=0x6f7, pf=0x1, revision=0x6b
Nov 11 11:01:20 myserver kernel: [ 76.647499] microcode: CPU1 sig=0x6f7, pf=0x1, revision=0x6b
Nov 11 11:01:20 myserver kernel: [ 76.648000] microcode: Microcode Update Driver: v2.00 tigran@aivazian.fsnet.co.uk, Peter Oruba
Nov 11 11:01:20 myserver kernel: [ 77.672676] acpiphp: ACPI Hot Plug PCI Controller Driver version: 0.5
Nov 11 11:01:20 myserver kernel: [ 77.675035] acpiphp: Slot [32] registered
Nov 11 11:01:20 myserver kernel: [ 77.675225] acpiphp: Slot [33] registered
Nov 11 11:01:20 myserver kernel: [ 77.675325] acpiphp: Slot [34] registered
Nov 11 11:01:20 myserver kernel: [ 77.675364] acpiphp: Slot [35] registered
Nov 11 11:01:20 myserver kernel: [ 77.675410] acpiphp: Slot [36] registered
Nov 11 11:01:20 myserver kernel: [ 77.675454] acpiphp: Slot [37] registered
Nov 11 11:01:20 myserver kernel: [ 77.675495] acpiphp: Slot [38] registered
Nov 11 11:01:20 myserver kernel: [ 77.675535] acpiphp: Slot [39] registered
Nov 11 11:01:20 myserver kernel: [ 77.675576] acpiphp: Slot [40] registered
Nov 11 11:01:20 myserver kernel: [ 77.675617] acpiphp: Slot [41] registered
Nov 11 11:01:20 myserver kernel: [ 77.675660] acpiphp: Slot [42] registered
Nov 11 11:01:20 myserver kernel: [ 77.675702] acpiphp: Slot [43] registered
Nov 11 11:01:20 myserver kernel: [ 77.675744] acpiphp: Slot [44] registered
Nov 11 11:01:20 myserver kernel: [ 77.675786] acpiphp: Slot [45] registered
Nov 11 11:01:20 myserver kernel: [ 77.675828] acpiphp: Slot [46] registered
Nov 11 11:01:20 myserver kernel: [ 77.675869] acpiphp: Slot [47] registered
Nov 11 11:01:20 myserver kernel: [ 77.675910] acpiphp: Slot [48] registered
Nov 11 11:01:20 myserver kernel: [ 77.675950] acpiphp: Slot [49] registered
Nov 11 11:01:20 myserver kernel: [ 77.675991] acpiphp: Slot [50] registered
Nov 11 11:01:20 myserver kernel: [ 77.676032] acpiphp: Slot [51] registered
Nov 11 11:01:20 myserver kernel: [ 77.676073] acpiphp: Slot [52] registered
Nov 11 11:01:20 myserver kernel: [ 77.676116] acpiphp: Slot [53] registered
Nov 11 11:01:20 myserver kernel: [ 77.676157] acpiphp: Slot [54] registered
Nov 11 11:01:20 myserver kernel: [ 77.676198] acpiphp: Slot [55] registered
Nov 11 11:01:20 myserver kernel: [ 77.676243] acpiphp: Slot [56] registered
Nov 11 11:01:20 myserver kernel: [ 77.676285] acpiphp: Slot [57] registered
Nov 11 11:01:20 myserver kernel: [ 77.676326] acpiphp: Slot [58] registered
Nov 11 11:01:20 myserver kernel: [ 77.676367] acpiphp: Slot [59] registered
Nov 11 11:01:20 myserver kernel: [ 77.676408] acpiphp: Slot [60] registered
Nov 11 11:01:20 myserver kernel: [ 77.676454] acpiphp: Slot [61] registered
Nov 11 11:01:20 myserver kernel: [ 77.676495] acpiphp: Slot [62] registered
Nov 11 11:01:20 myserver kernel: [ 77.676538] acpiphp: Slot [63] registered
Nov 11 11:01:25 myserver rchal: CPU frequency scaling is not supported by your processor.
Nov 11 11:01:25 myserver rchal: boot with ‘CPUFREQ=no’ in to avoid this warning.
Nov 11 11:01:25 myserver rchal: Cannot load cpufreq governors - No cpufreq driver available
Nov 11 10:01:41 myserver ifup: lo
Nov 11 10:01:42 myserver ifup: lo
Nov 11 10:01:42 myserver ifup: IP address: 127.0.0.1/8
Nov 11 10:01:42 myserver ifup:
Nov 11 10:01:42 myserver ifup:
Nov 11 10:01:42 myserver ifup: IP address: 127.0.0.2/8
Nov 11 10:01:42 myserver ifup:
Nov 11 10:01:43 myserver ifup: eth1 device: Intel Corporation 82545EM Gigabit Ethernet Co
Nov 11 10:01:43 myserver kernel: [ 96.859759] e1000: eth1 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: None
Nov 11 10:01:43 myserver ifup: eth1
Nov 11 10:01:43 myserver ifup: IP address: 192.168.xxx.xxx/24
Nov 11 10:01:43 myserver ifup:
Nov 11 10:01:51 myserver SuSEfirewall2: SuSEfirewall2 not active
Nov 11 10:01:52 myserver rpcbind: cannot create socket for udp6
Nov 11 10:01:52 myserver rpcbind: cannot create socket for tcp6
Nov 11 10:01:52 myserver sm-notify[2559]: Version 1.2.3 starting
Nov 11 10:02:09 myserver haveged: haveged starting up
Nov 11 10:02:09 myserver haveged: arch: x86 vendor: intel generic: 0 i_cache: 32 d_cache: 32 loop_idx: 29 loop_idxmax: 40 loop_sz: 31874 loop_szmax: 113090 etime: 103866 havege_ndpt 0
Nov 11 10:02:09 myserver auditd[2736]: Started dispatcher: /sbin/audispd pid: 2738
Nov 11 10:02:09 myserver auditd[2736]: Init complete, auditd 1.8 listening for events (startup state disable)
Nov 11 10:02:09 myserver audispd: priority_boost_parser called with: 4
Nov 11 10:02:09 myserver audispd: max_restarts_parser called with: 10
Nov 11 10:02:10 myserver audispd: af_unix plugin initialized
Nov 11 10:02:10 myserver audispd: audispd initialized with q_depth=80 and 1 active plugins
Nov 11 10:02:11 myserver logger: /etc/init.d/xdm: No changes for /etc/X11/xdm/Xservers
Nov 11 10:02:11 myserver logger: /etc/init.d/xdm: No changes for /etc/X11/xdm/xdm-config
Nov 11 10:02:14 myserver su: (to informix) root on /dev/pts/7
Nov 11 10:02:45 myserver ntpd[3095]: ntpd 4.2.4p8@1.1612-o Thu Nov 22 17:39:18 UTC 2012 (1)
Nov 11 10:02:45 myserver ntpd[3098]: precision = 1.000 usec
Nov 11 10:02:45 myserver ntpd[3098]: ntp_io: estimated max descriptors: 1024, initial socket boundary: 16
Nov 11 10:02:45 myserver ntpd[3098]: Listening on interface #0 wildcard, 0.0.0.0#123 Disabled
Nov 11 10:02:45 myserver ntpd[3098]: Listening on interface #1 lo, 127.0.0.1#123 Enabled
Nov 11 10:02:45 myserver ntpd[3098]: Listening on interface #2 lo, 127.0.0.2#123 Enabled
Nov 11 10:02:45 myserver ntpd[3098]: Listening on interface #3 eth1, 192.168.xxx.xxx#123 Enabled
Nov 11 10:02:45 myserver ntpd[3098]: kernel time sync status 2040
Nov 11 10:02:45 myserver ntpd[3098]: frequency initialized 9.060 PPM from /var/lib/ntp/drift/ntp.drift
Nov 11 10:02:53 myserver /usr/sbin/cron[3239]: (CRON) STARTUP (V5.0)
Nov 11 10:02:55 myserver smartd[3255]: smartd 6.0 2012-10-10 r3643 [i686-linux-3.0.93-0.8-default] (SUSE RPM)
Nov 11 10:02:55 myserver smartd[3255]: Copyright (C) 2002-12, Bruce Allen, Christian Franke, www.smartmontools.org
Nov 11 10:02:55 myserver smartd[3255]: Opened configuration file /etc/smartd.conf
Nov 11 10:02:55 myserver smartd[3255]: Drive: DEVICESCAN, implied ‘-a’ Directive on line 26 of file /etc/smartd.conf
Nov 11 10:02:55 myserver smartd[3255]: Configuration file /etc/smartd.conf was parsed, found DEVICESCAN, scanning devices
Nov 11 10:02:55 myserver smartd[3255]: Device: /dev/sda, opened
Nov 11 10:02:55 myserver smartd[3255]: Device: /dev/sda, [VMware Virtual disk 1.0 ], 128 GB
Nov 11 10:02:55 myserver smartd[3255]: Device: /dev/sda, Bad IEC (SMART) mode page, err=-5, skip device
Nov 11 10:02:55 myserver smartd[3255]: Unable to monitor any SMART enabled devices. Try debug (-d) option. Exiting…
Nov 11 08:02:54 myserver sshd[3320]: Server listening on 0.0.0.0 port 22.
Nov 11 08:02:56 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/chargen [file=/etc/xinetd.conf] [line=26]
Nov 11 08:02:56 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/chargen-udp [file=/etc/xinetd.d/chargen-udp] [line=14]
Nov 11 08:02:56 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/cups-lpd [file=/etc/xinetd.d/cups-lpd] [line=15]
Nov 11 08:02:57 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/daytime [file=/etc/xinetd.d/daytime] [line=11]
Nov 11 08:02:57 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/daytime-udp [file=/etc/xinetd.d/daytime-udp] [line=14]
Nov 11 08:02:57 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/discard [file=/etc/xinetd.d/discard] [line=15]
Nov 11 08:02:57 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/discard-udp [file=/etc/xinetd.d/discard-udp] [line=14]
Nov 11 08:02:57 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/echo [file=/etc/xinetd.d/echo] [line=15]
Nov 11 08:02:57 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/echo-udp [file=/etc/xinetd.d/echo-udp] [line=14]
Nov 11 08:02:57 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/netstat [file=/etc/xinetd.d/netstat] [line=15]
Nov 11 08:02:57 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/pure-ftpd [file=/etc/xinetd.d/pure-ftpd] [line=16]
Nov 11 08:02:57 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/rexec [file=/etc/xinetd.d/rexec] [line=21]
Nov 11 08:02:58 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/rlogin [file=/etc/xinetd.d/rlogin] [line=20]
Nov 11 08:02:58 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/rsh [file=/etc/xinetd.d/rsh] [line=24]
Nov 11 08:02:58 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/rsync [file=/etc/xinetd.d/rsync] [line=22]
Nov 11 08:02:59 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/sane-port [file=/etc/xinetd.d/sane-port] [line=12]
Nov 11 08:03:00 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/servers [file=/etc/xinetd.d/servers] [line=14]
Nov 11 08:03:00 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/services [file=/etc/xinetd.d/services] [line=14]
Nov 11 08:03:00 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/swat [file=/etc/xinetd.d/swat] [line=14]
Nov 11 08:03:00 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/systat [file=/etc/xinetd.d/systat] [line=12]
Nov 11 08:03:00 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/tftp [file=/etc/xinetd.d/tftp] [line=17]
Nov 11 08:03:01 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/time [file=/etc/xinetd.d/time] [line=15]
Nov 11 08:03:01 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/time-udp [file=/etc/xinetd.d/time-udp] [line=15]
Nov 11 08:03:01 myserver xinetd[3337]: Reading included configuration file: /etc/xinetd.d/vnc [file=/etc/xinetd.d/vnc] [line=15]
Nov 11 08:03:01 myserver xinetd[3337]: xinetd Version 2.3.14 started with libwrap loadavg options compiled in.
Nov 11 08:03:01 myserver xinetd[3337]: Started working: 1 available service
Nov 11 08:03:24 myserver gnome-session[3331]: WARNING: Application ‘gnome-settings-daemon.desktop’ failed to register before timeout
Nov 11 08:03:37 myserver pulseaudio[3508]: pid.c: Stale PID file, overwriting.

There are no cronjobs weekly defined on the server. After looking on my backup software, I disabled the weekly backup of machine using the vmware mechanism instead of saving the files. We’re using barracuda backup. So I will wait some days. I hope, that this was the reason for failure. The system freezes completly. It is not possible to logon.

Best regards from Hannover

Dirk Emmermacher

Moin, moin Dirk,

the syslog does show that the server is still alive at 2 AM (latest message at 2:00:07) :wink:

[disabled backup run] So I will wait some days.

If that back-up run was active around 2 AM, then this sounds pretty reasonable (I take it that you meant backing up VMware snapshots of the VM’s disk(s) by your statement “weekly backup of machine using the vmware mechanism”). If that doesn’t help, in your position I’d start checking your monitoring tools on when exactly the server went down (i.e. no longer responding to “pings”, services not reachable etc) and double-check the VMware logs of the VM for any hints around that time. You could also have a look at the VMware console to correlate disk/CPU activity to the service loss.

Might it be that your backup is driving the network load too high, creating latencies that lead to a time-out situation when accessing the virtual disk, which is in turn reported to the VM and makes it stall? That’s something your network monitoring might reveal and/or lead to messages in the VMware VM log.

Regards,
Jens

Moin Jens.

Thanks for your answer. In the notifications I’ve got from my backup system there is a warning “Changed block tracking is not enabled”. I _m not sure, that this is reason for freezing the machine (with ping I’Ve got an answer but no more…) On the vmware server I found a message Errorcode 4: Quiesce aborted.I’ll look for that error.

Best regards.

Dirk

Hi Dirk,

with ping I’Ve got an answer but no more…

this may sound like nitpicking, but that VM isn’t frozen - at least the kernel and the IP stack are still responsive. Chances are high that some (virtual) disk access problem caused a r/o status of the file system - so no chance to see any output in syslog later on.

Do you have a chance of leaving open a remote connection to the server, running “dmesg” in a loop (probably starting at about 1:55 AM :wink: ) and to set up syslog to send messges to a remote syslog, so you’ll have a copy of the messages even after the local FS went unavailable?

Regards,
Jens

Hi Jens.

The backup of complete VM was the reason for frozen machine. I disabled the backup las week. This morning it was possible to use server normal. Next I will search for backup error.
I’ll be back when I’ve got new informations.

Best regards.

Dirk

Hi Dirk,

[QUOTE=Emmermacher;17520]Hi Jens.

The backup of complete VM was the reason for frozen machine. I disabled the backup las week. This morning it was possible to use server normal. Next I will search for backup error.
I’ll be back when I’ve got new informations.

Best regards.

Dirk[/QUOTE]

great, you’re on track! And you know how to reproduce the error at some offload time… rather than Monday morning, 2 AM :smiley: Thanks for reporting back, and let us know if we can assist with further trouble shooting.

Regards,
Jens

I have a big hunch that it is to do with CPU Frequency Scaling… I have the same problem

Nov 11 11:01:25 myserver rchal: CPU frequency scaling is not supported by your processor.
Nov 11 11:01:25 myserver rchal: boot with ‘CPUFREQ=no’ in to avoid this warning.
Nov 11 11:01:25 myserver rchal: Cannot load cpufreq governors - No cpufreq driver available

Am 07.02.2017 um 04:14 schrieb archproject:[color=blue]

I have a big hunch that it is to do with CPU Frequency Scaling… I have
the same problem

Nov 11 11:01:25 myserver rchal: CPU frequency scaling is not supported
by your processor.
Nov 11 11:01:25 myserver rchal: boot with ‘CPUFREQ=no’ in to avoid this
warning.
Nov 11 11:01:25 myserver rchal: Cannot load cpufreq governors - No
cpufreq driver available

[/color]
I really don’t see how this has anything to do with the original, 4 year
old thread?! Also, why do you think it’s a problem? That message is
completely normal in a virtual environment, and causes no problems.

CU,

Massimo Rosen
Micro Focus Knowledge Partner
No emails please!
http://www.cfc-it.de