During a recent Vmware SRM (version 5.5) planned fail over I received following error during step 8 : Error – Failed to recover datastore ‘<datastorename>’. VMFS volume residing on recovered devices ‘”<device_wwn>”‘ cannot be found. Datastore was from EMC VNX replicated with EMC RecoverPoint.
I have found another way:
- Detach the LUN from all recovery hosts (Configuration – > Storage Adapters -> select HBA -> Select LUN -> right click -> Detach)
- Rerun the recovery plan.
I have not investigated this error properly so I don’t know what causes it but a quick fix for this error is:
1) Rescan all hosts in the cluster
2) Select one host – go to Configuration -> Storage -> Add Storage
3) Select Disk/LUN -> your LUN should appear on the list -> select it -> Next
4) You should be presented with mounting options -> select “Assign a new signature”
5) After finishing the wizard your datastore should appear in the datastore list.
6) Rescan storage on all hosts in the cluster
7) When LUN has “recovered” on all hosts rerun the recovery plan.
Recently I reinstalled my infra test environment and stumbled on a problem in VMware SRM while creating protection groups. My problem was that the location where I should have selected the Array Pair for which I wanted to create protection groups was empty.
After doing some digging in logs I discovered following lines in SRM log:
[01212 error ‘DatastoreGroupManager’ opID=420f60c] Device ’82’ matches two different devices ‘naa.60060e8013294d005020294d00000052’ and ‘hitachi_hus_vm0_0052’
[01212 error ‘DatastoreGroupManager’ opID=420f60c] Device ’83’ matches two different devices ‘naa.60060e8013294d005020294d00000053’ and ‘hitachi_hus_vm0_0053’
[01212 verbose ‘DatastoreGroupManager’ opID=420f60c] Matched 0 devices of total 60
[01212 warning ‘DatastoreGroupManager’ opID=420f60c] No replicated datastores found for array pair ‘array-pair-7037’
[01212 verbose ‘DatastoreGroupManager’ opID=420f60c] Recomputed datastore groups for array pair ‘array-pair-7037’: 0 replicated datastores, 0 replicated RDMs, 0 free devices, 0 datastore groups
Device 82 and device 83 were the luns which are replicated and which contain VMs. The problem was that these devices were identified into two different ESXi devices. After checking ESXi hosts I figured out the problem – out of three hosts two hosts had 3rd party multipath software installed. This caused the luns to be identified differently. After I unpresented the luns from the host which did not had 3rd party multipathing software installed the array pair showed up in SRM.
It seems that it’s needed to keep ESXi hosts similar if they have been presented replicated luns which you want to use with VMware SRM.
Two weeks ago I discovered that in VMware Site Recovery Manager 5.8 it was not possible to configure network mappings when number of port groups was 40+. Instead of list of port groups only thing that was shown was line “Portgroups”. After reducing the amount of port groups to 15 I was able to do the mappings. I reported the error to VMware and today they confirmed that it is now a verified bug. No word on the fix yet.
There is also a thread in VMware forums about this issue: https://communities.vmware.com/thread/491260
Update – VMware has informed me that this bug will be fixed in the next release. No exact ETA but indication that it will be before the end of the year.
Update 2: Site Recovery Manager 18.104.22.168 has been released which fixes the issue.