Skip to main content

How to fix | ESXI Virtual SAN Health service installation


I encountered an issue with the ESXi Virtual SAN Health service installation in one of the vSAN cluster,


ESXI Virtual SAN Health service installation_error










Step 1 :
I checked whether all the ESXi hosts are running on the same version or not,
VMware ESXi 6.0.0 build-5224934 VMware ESXi 6.0.0 Update 3 on ESX1
VMware ESXi 6.0.0 build-5224934 VMware ESXi 6.0.0 Update 3 on ESX2
VMware ESXi 6.0.0 build-5224934 VMware ESXi 6.0.0 Update 3 on ESX3
VMware ESXi 6.0.0 build-5224934 VMware ESXi 6.0.0 Update 3 on ESX5
VMware ESXi 6.0.0 build-5224934 VMware ESXi 6.0.0 Update 3 on ESX4
They are in the same version so we can go check whether vSAN health VIB is installed or not.
From the KB https://kb.vmware.com/s/article/2109874,
On vSphere 6.0 Update 2 release, none of the other health checks will be conducted until all the hosts are upgraded to 6.0 Update 2 (when running the latest version, vSAN 6.2) release to avoid false alarms.  But we have all the ESXi hosts in ESXi 6.0 Update 3
"Install the vSAN Health Service VIB on all the ESXi hosts that are participating in the vSAN cluster, or update the ESXi host to the version that is indicated in the error field. "
Step 2: 
We have all the ESXi hosts in the same version so no need this option is ignored. The only available option is to check if the vSAN health service VIB installed on the host or not.
vsanhealth                6.0.0-3000000.3.0.3.58.5224738         VMware  VMwareCertified   2017-04-13   on ESX1
vsanhealth                6.0.0-3000000.3.0.3.58.5224738         VMware  VMwareCertified   2017-04-12   on ESX2
vsanhealth                6.0.0-3000000.3.0.3.58.5224738         VMware  VMwareCertified   2017-04-13   on ESX3
vsanhealth                6.0.0-3000000.3.0.3.58.5224738         VMware  VMwareCertified   2017-04-13   on ESX5
vsanhealth                6.0.0-3000000.3.0.3.58.5224738         VMware  VMwareCertified   2017-04-13   on ESX4
And the VIB is installed on all the ESXi hosts in the cluster.
Step 3 : 
So te next is to check the vsangmgtd health state on the ESXi host ESX4 where ESXi Virtual SAN Health service installation status is uknown.
[root@esx4:/var/log] /etc/init.d/vsanmgmtd status
vsanperfsvc is running
[root@esx4:/var/log] cat /var/log/vsanmgmt.log
++++++++++++++++++++++++++++++++++++++++++++++++++++++++
2017-11-28T04:48:23Z VSANMGMTSVC: ERROR vsanperfsvc[MainThread] [statsdaemon::logDaemonMemoryStats] Failed to get memory stats: Bad parameter
Traceback (most recent call last):
  File "/build/mts/release/bora-5224738/bora/build/esx/release/vsanhealth/usr/lib/vmware/vsan/perfsvc/statsdaemon.py", line 580, in logDaemonMemoryStats
  File "/build/mts/release/bora-5224738/bora/build/esx/release/vsanhealth/usr/lib/vmware/vsan/perfsvc/statsdaemon.py", line 593, in _checkAndLogMemoryStats
ValueError: Bad parameter
2017-11-28T04:48:28Z VSANMGMTSVC: ERROR vsanperfsvc[MainThread] [cliutils::GetVSANClusterInfo] Failed to query cluster info: Failed to alloc memory for text
Traceback (most recent call last):
  File "/build/mts/release/bora-5224738/bora/build/esx/release/vsanhealth/usr/lib/vmware/vsan/perfsvc/cliutils.py", line 69, in GetVSANClusterInfo
  File "/usr/lib/vmware/hostd/hmo/VsanInternalSystem.py", line 205, in DecodeEntry
    return eval(pyCMMDS.BinToTextPolicy(e.dataStr, True, style))
RuntimeError: Failed to alloc memory for text
2017-11-28T04:48:28Z VSANMGMTSVC: ERROR vsanperfsvc[MainThread] [statsdaemon::logDaemonMemoryStats] Failed to get memory stats: Bad parameter
Traceback (most recent call last):
  File "/build/mts/release/bora-5224738/bora/build/esx/release/vsanhealth/usr/lib/vmware/vsan/perfsvc/statsdaemon.py", line 580, in logDaemonMemoryStats
  File "/build/mts/release/bora-5224738/bora/build/esx/release/vsanhealth/usr/lib/vmware/vsan/perfsvc/statsdaemon.py", line 593, in _checkAndLogMemoryStats
ValueError: Bad parameter
2017-11-28T04:48:33Z VSANMGMTSVC: ERROR vsanperfsvc[MainThread] [cliutils::GetVSANClusterInfo] Failed to query cluster info: Failed to alloc memory for text
Traceback (most recent call last):
  File "/build/mts/release/bora-5224738/bora/build/esx/release/vsanhealth/usr/lib/vmware/vsan/perfsvc/cliutils.py", line 69, in GetVSANClusterInfo
  File "/usr/lib/vmware/hostd/hmo/VsanInternalSystem.py", line 205, in DecodeEntry
    return eval(pyCMMDS.BinToTextPolicy(e.dataStr, True, style))
RuntimeError: Failed to alloc memory for text
++++++++++++++++++++++++++++++++++++++++++++++++++++++
[root@esx4:~] /etc/init.d/vsanmgmtd restart
watchdog-vsanperfsvc: Terminating watchdog process with PID 34190
vsanperfsvc started
After restarting the vsanmgmtd service, ESXI Virtual SAN Health service installation error has been disappeared and all the vSAN healths are available now.

ESXI Virtual SAN Health service installation_Green












In case if you want to install the health service on ESXi there are four installation options for the health VIB:
1. Enable health check plugin via vSphere web client
2. RVC, the Ruby vSphere Console
3.VUM, the VMware Update Manager. Use of VUM is outside the scope of this document and users are directed to the official vSphere documentation on how to use VUM to install VIBs.
4.Manual install at the ESXi command line. This is done using ESXCLI software VIB commands.
VMware recommends option 1 for ease of install, but the steps to install via option 2 are also covered here.

Read: https://www.vmware.com/content/dam/digitalmarketing/vmware/en/pdf/products/products/vsan/vmw-gdl-vsan-health-check.pdf ( From Page 12)

Comments

  1. Interesting Article. Hoping that you will continue posting an article having a useful information. Executive Health Check up tests in Sydney

    ReplyDelete
  2. Sometimes it is difficult to know when your child is going through a difficult phase online child counseling

    ReplyDelete
  3. Singapore & Malaysia Veg Tour Group Package from Mumbai | NFTT WORLD

    International Veg Tour Packages - Explore your dream holiday destination (Singapore, Malaysia) with your family, friends, and special ones at a discounted price by NFTT WORLD.

    For more Details Visit our Website : Himachal Manali Shimla Tour

    ReplyDelete
  4. Point Cook Health Centre

    We’re here for you with a range of services. Chiropractic, Massage, Podiatry and Homeopathy. Conveniently located at 17 Dunnings Road, Point Cook.

    For more Details Visit our Website : Point Cook Chiropractor

    ReplyDelete
  5. Hair Lost & Hair Growth Treatment For Men - Bioscor International

    What is the most effective treatment for regrowing hair and preventing hair loss? We evaluated value by testing products, studying trial results, and analysing trial results. Contact us for more information.

    ReplyDelete
  6. This comment has been removed by the author.

    ReplyDelete
  7. Chiro Thomastown

    Looking for Family Chiropractic Care in Thomas Town? The Thomas Town family chiropractors at Wellbeing provide treatment for various conditions that affect the nervous system and the musculoskeletal system.

    ReplyDelete
  8. Werribee Chiropractor

    Chiropractic Care Chiropractic care deals with a range of conditions affecting the musculoskeletal system. It involves diagnosing, treating, and preventing mechanical disorders. With many techniques used in chiropractic treatment, our Werribee Chiropractors use a number of combinations to identify the right treatment for patients in our care.

    ReplyDelete
  9. Wollert Chiropractor

    Wellbeing has a dedicated team of experienced chiropractors, known for providing research-based treatments. Having treated numerous patients requiring chiropractic care, our team of Wollert Chiropractors has helped many patients with their musculoskeletal health and injury concerns. To know more call us at (03) 9020 2072 to book your chiropractic appointment today.

    ReplyDelete
  10. Transform Your Look with Melbourne's Premier Hair Loss and Skin Clinic - Bioscor International

    Bioscor International offers a range of natural hair regrowth treatments and specialist hair loss services at our Australian Clinics in Melbourne & Perth.

    Website : https://bioscor.com.au/

    ReplyDelete
  11. Point Cook Chiropractors | Available 7 Days | Wellbeing Chiropractic

    Our dedicated team of Point Cook Chiropractors is known for offering research-based treatments that helped many patients with their musculoskeletal health and injury concerns. Call us today.

    ReplyDelete
  12. Reservoir Chiropractors | Available 7 Days | Wellbeing Chiropractic

    Our dedicated team of Reservoir chiropractors is known for offering research-based treatments that helped many patients with their musculoskeletal health and injury concerns. Call us today.

    ReplyDelete
  13. Our Services | Wellbeing Ringwood Chiropractic

    We are licensed Chiropractors in Ringwood. We offer range of services to the patient such as, massage, chiro treatment, emergency room hire and many more.

    ReplyDelete
  14. Wyndham Vale Chiropractors | Available 7 Days | Wellbeing Chiropractic

    Our dedicated team of Wyndham Vale Chiropractors is known for offering research-based treatments that helped many patients with their musculoskeletal health and injury concerns. Call us today.

    ReplyDelete
  15. Brunswick Chiropractors | Available 7 Days | Wellbeing Chiropractic

    Our dedicated team of Brunswick Chiropractors is known for offering research-based treatments that helped many patients with their musculoskeletal health and injury concerns. Call us today.

    ReplyDelete
  16. Our Services | Wellbeing Chiropractor

    Wellbeing Chiropractor provides you quality and thorough Chiropractic Services in Roxburgh Park. Schedule your appointment at Wellbeing Chiropractor today. Call (03) 90202614.

    ReplyDelete
  17. Our Services | Wellbeing Chiropractor

    Wellbeing Chiropractor provides thorough Chiropractic Services in St Albans. Call us at (03) 90202614 for appointments at Wellbeing St Albans Chiropractor.

    ReplyDelete
  18. Our Services | Wellbeing Chiropractor

    Wellbeing Chiropractor employs effective chiropractic techniques. Get a thorough examination, diagnosis, and treatment on your first visit. Call (03) 8001 4015.

    For more Details Visit our Website : Sunbury Chiropractor

    ReplyDelete
  19. Our Services | Wellbeing Chiropractor

    Wellbeing Chiropractor is your local Chiropractor in Tarneit that provides trusted chiropractic services to people of all ages. Call (03) 9020 7349 to book an appointment.

    ReplyDelete
  20. Our Services | Wellbeing Chiropractor

    Wellbeing Chiropractor provides thorough Chiropractic Services in Richmond. Call us at (03) 9021 8606 for appointments at Wellbeing St Albans Chiropractor.

    ReplyDelete
  21. About Us | Wellbeing Chiropractor

    Wellbeing Chiropractor focuses on providing you the high-quality Chiropractic Care in Manor Lakes. Call us at (03) 9008 5992 or book an appointment online now.

    ReplyDelete
  22. Rosanna Chiropractor | Wellbeing Chiropractor

    Wellbeing Chiropractor is a leading Chiropractic Centre and has been serving the Rosanna & local areas for years. Call and book an appointment at (03) 9458 3255.

    For more Details Visit our Website : Rosanna Chiropractor

    ReplyDelete
  23. The integration of virtual reality in diverse industries set the stage for vartuial's expansion https://petrogalant.com/

    ReplyDelete

Post a Comment

Please leave your valuable comment to improvise the content.

Popular posts from this blog

vSAN Disk group is in "Unhealthy State"

If you are running VMware vSAN 6.0, 6.1 and 6.2 then there is a high chance that you will be seeing this issue with the following RAID controllers, Cisco 12G SAS Modular Raid Controller DELL FD332-PERC (Dual ROC) DELL FD332-PERC (Single ROC) DELL PERC H730 Adapter DELL PERC H730 Mini ==> We are using with Dell R620/630 serves with this RAID controller DELL PERC H730P Adapter  DELL PERC H730P Mini Huawei Technologies Co. Ltd. SR 430C Lenovo ThinkServer RAID 720i AnyRAID Adapter Lenovo ThinkServer RAID 720ix AnyRAID Adapter Lenovo ServeRAID 5210e SAS/SATA Controller Lenovo ServeRAID M5210 SAS/SATA Controller LSI MegaRAID SAS 9361-8i LSI MegaRAID SAS 9362-8i Supermicro SMC3108 But this can happen due to Physical Disk Drive failure and RAID Controllers from above list resetting the Disk Drives. In some scenario only one disk group will go to unhealthy state or all the disk groups will go to unhealthy state on the ESXi host in the vSAN cluster. Th

How to Fix | Virtual SAN Health Alarm 'Performance data collection'' status is Red

Virtual SAN Health Alarm 'Performance data collection'' status is Red vSAN CLuster ==> Monitor ==> Virtual SAN==> Health ==> Performance Service ==> Performance Data Collection==>  Stats Gathering ==>  Failed Stats persistence==> Failed The causes for this error is unknown but there are two fixes available to this issue, 1)  Restarting the vsanmgmtd and vsanvpd service on all the ESXi hosts in the vSAN Cluster.  There is no impact of restarting these two services on the ESXi,  /etc/init.d/vsanmgmtd  restart /etc/init.d/vsanvpd restart Make sure the service is is running state after the restart,  /etc/init.d/vsanmgmtd  status /etc/init.d/vsanvpd status Post restart of the services retest the vsan health , vSAN CLuster ==> Monitor ==> Virtual SAN==> Health==>Retest and the Performance Data Collection should be green. 2) To resolve this issue, re-enable the performance service from the cluster level a.

How to fix | vSAN CLOMD Liveness - Part I

In the following scenarios, you will see the CLOMD service liveness on ESXi hosts, If any of the ESXi hosts are disconnected, the CLOMD liveness state of the disconnected host is shown as unknown .If the Health service is not installed on a particular ESXi host, the CLOMD liveness state of all the ESXi hosts is also reported as unknown. If the CLOMD service is not running on a particular ESXi host, the CLOMD liveness state of one host is abnormal. The Cluster Health – CLOMD liveness check in the vSAN Health Service, and provides details on why it might report an error.This checks if the Cluster Level Object Manager ( CLOMD ) daemon is alive or not. It does so by first checking that the service is running on all ESXi hosts, and then contacting the service to retrieve run-time statistics to verify that CLOMD can respond to inquiries.  CLOMD (Cluster Level Object Manager Daemon) plays a key role in the operation of a vSAN cluster. It runs on every ESXi host

How to Fix | Virtual SAN Health - Physical Disk Health Retrieval Issues

Physical Disk Health – Physical Disk Health Retrieval Issues In Virtual SAN cluster, there is one more common issue is the Virtual SAN health test failing to retrieve the Physical Disk Health on an ESXi host.It is informing the administrator that it cannot get physical disk-related information from the ESXi host in question in order to perform a check on the health of the physical disks. If the Virtual SAN management service vsanmgmtd on the ESXi host is nonresponsive then you will encounter this issue, in the vsanmgmt.log you will see the following snippets, ++++++++++++++++++++++++++++++++++++++++++++ [root@esxihost-1:/var/log] cat vsanmgmt.log  2017-11-15T03:08:46Z VSANMGMTSVC: INFO vsanperfsvc[Thread-1] [VsanLsomHealth::getHealthStats] Get issued comps = {}  2017-11-15T03:08:46Z VSANMGMTSVC: WARNING vsanperfsvc[Thread-1] [VsanHealthUtil::InvokeMethod] Invoke: mo=ServiceInstance, info=RetrieveContent  2017-11-15T03:08:46Z VSANMGMTSVC: ERROR vsanperfsvc[Thread-1] [Vs

Horizon View Pools stuck in Deleting state

Recently, had an issue with 2 view Desktop Pools that were stuck in Deleting state in horizon view manager. We are running Horizon View 7.2 and this issue happening since View 4.x. Out of 2 Pools, I was able to delete one pool by just removing the VM from Resources-->Machines--> Filtered using Pool name.But when doing the same thing for the TEST-Pool I was getting an error as below, "Machine","Desktop Pool","DNS Name","User","Host","Agent Version","Datastore","Status" "TEST-POOL-046","TEST-POOL","TEST-POOL-046.TEST.LOCAL","","esx3.TEST.LOCAL","Unknown","[TEST-VCENTER1VSAN]","Status:Error Status Errors:Nov 30, 2017 10:38:20 PM PST: Failed to delete VM - null" So I logged in to the connection server and found the following error logs, C:\programdata\vmware\vdm\logs\debug-2017-11-30-221023.txt ++++++++

vSAN Component Failure State - Degraded vs Absent - Part I

Failure States of Virtual SAN Components: Virtual SAN  handles failures of the host, network and storage devices in the cluster based on the severity of the failure. When these fail they directly affect the components in the  vSAN cluster.  Virtual SAN has 2 types of failure states for components ABSENT and DEGRADED. According to the component state, it uses different approaches to recover the affected components. Degraded: "A component is in degraded state if Virtual SAN detects a permanent component failure and assumes that the component is not going to recover to working state." Absent: "A component is in absent state if Virtual SAN detects a temporary component failure where the component might recover and restore its working state." An ABSENT state may or not resolve itself over time, but a  DEGRADED state is a permanent state. From the above image, left side a disk has been unplugged or offline may be reinserted or brought online, Virtual

How to fix | vSAN CLOMD Liveness - Part II (Virtual machine creation failed)

vSAN CLOMD daemon may fail when trying to repair objects with 0 byte components When Cloning a VM from a template from VRA and vCenter vMotion failed with the following errors. And the vApp deployment failed due to the clomd service is failed on the host, Read the importance of clomd here,  https://virtuawisdom.blogspot.in/2017/11/how-to-fix-vsan-clomd-liveness-part-i.html Task Details: Name: clone Status: Cannot complete file creation operation. Start Time: May 30, 2017 5:34:13 AM Completed Time: May 30, 2017 5:35:13 AM State: error Error Stack:  A CLOM is not attached. This could indicate that the clomd daemon is not running.Failed to create the object. Additional Task Details: Error Type: CannotCreateFile Task Id: Task Cancelable: true Canceled: false Description Id: VirtualMachine.clone Event Chain Id: 291778 /var/log/clomd.log +++++++++++++++++++++++++++++++++++++++++++++ 2017-05-25T18:20:12.755Z 26738391 (111018916128)(opID:0)main: Clomd is star

How to Fix | Controller utility is installed on host "Warning"

The controller utilities enable additional health checks based on controller settings. The yellow check status indicates that vSAN Health Service is not able to find the appropriate controller utility for the storage controller on the host. Typically, the controller utility is used to configure and view configuration data. When vSAN Health Service can retrieve controller configuration data, it can further analyze configuration issues for the current vSAN setup. Host with PERCCLI installed: [root@esx26:~] esxcli software  vib list | grep perccli vmware-esx-perccli-1.05.08     1.05.08-01                             LSI     PartnerSupported  2017-08-03 Host without PERCCLI: [root@esx7:~] esxcli software  vib list | grep perccli ********NO OUTPUT******** Based on the KB https://kb.vmware.com/s/article/2148867 1) Download the PERCCLI for ESXi from the below link,  http://www.dell.com/support/home/in/en/inbsdt1/Drivers/DriversDetails?driverId=XY978 2) Put the host

vCloud vApp fail to start with error "Edge deployment failed on host as the message bus infra on the host is not green"

Recently, we faced an issue when starting a vApp in the org due to an error " Edge deployment failed on host host-33 as the message bus infra on the host is not green. Please call API to re-sync the message bus and after successful re-sync, try edge installation., error code 10921" This issue is completely from the ESXi / NSX end and there is no issue in the vCloud director. But this issue severely affects the power-on opeation for the Tenants in the vcloud director. Error stack : ++++++++++++++++++++++++++++++++++++++++++++++++++++ [ 3455c8b1-d86f-4ee1-b982-d99f2737a5a3 ] Internal Server Error  - java.util.concurrent.ExecutionException: com.vmware.ssdc.util.LMException: Unable to start vApp "Test-01".  - com.vmware.ssdc.util.LMException: Unable to start vApp "Test-01".  - Unable to start vApp "Test-01".  - Unable to deploy network "Fence(urn:uuid:577df80b-f352-45f3-9739-8819fae1a02a)". com.vmware.vcloud.common.network.VsmEx