Quantcast
Channel: VMware Communities : Discussion List - ESXi
Viewing all 8132 articles
Browse latest View live

Sharing esata disk (Windows LAN)

$
0
0

I currently use Windows 10 running several VMs in Workstation, CPU is i7-3930K, i.e. no VT-d. It also has 24TB Raid5 external array (Areca), connected to the PC via esata, and shared as r/w folder in Windows, so that other PCs can use it as file server. I read that switching to Esxi allows better memory management and therefore - better performance for VMs. I have following questions:

1. Will performance difference be noticeable, especially if I increase number of VMs to 10+?
2. Will it be possible to share the esata disk in one of the VMs, or by means of the Esxi, so that it is visible to other Windows machines as shared folder?

3. Since the PC is pretty old (SaberTooth X79), should I use older versions of Esxi for drivers compatibility, or 7.0 will work just fine?

 

thx!


When is the next release for HPE Custom ESXi ISO for ESXi 6.7 Gen 9 +?

$
0
0

The latest ESXi 6.7 U3 release from the HPE site( https://www.hpe.com/us/en/servers/hpe-esxi.html ) was from March 2020.

 

Name:VMware-ESXi-6.7.0-Update3-15160138-HPE-Gen9plus-670.U3.10.5.5.25-Mar2020.iso
Release Date: 2020-03-31
Build Number: 15160138

 

Source: https://my.vmware.com/group/vmware/downloads/details?downloadGroup=OEM-ESXI67U3-HPE&productId=742

 

The latest patch from VMware for ESXi 6.7U3 was AUG 2020.

ESXi670-202008001

Product:ESXi (Embedded and Installable) 6.7.0

Download Size:475.3 MB

08/20/2020

16713306

 

Source: VMware ESXi 6.7, Patch Release ESXi670-202008001

ESXi 6.5.0 #PF Exception 14 in world 66075:tq:tcpip4 IP

$
0
0

Greetings,

I had a PSOD on my ESXi 6.5, and was hoping someone could help me resolve the problem. I can't find information related to tq:tcpip4,  i hope so someone can help mi with that.

Thanks for you time.

 

2020-08-29T23:55:25.508Z cpu2:66075) [45m [33;1mVMware ESXi 6.5.0 [Releasebuild-7967591 x86_64] [0m

#PF Exception 14 in world 66075:tq:tcpip4 IP 0x4180173c3c55 addr 0x0

PTEs:0x166032027;0x16602e027;0x0;

2020-08-29T23:55:25.508Z cpu2:66075)cr0=0x8001003d cr2=0x0 cr3=0x4c1000 cr4=0x216c

2020-08-29T23:55:25.508Z cpu2:66075)frame=0x4390d0d9bac0 ip=0x4180173c3c55 err=0 rflags=0x10297

2020-08-29T23:55:25.508Z cpu2:66075)rax=0x0 rbx=0x1 rcx=0x0

2020-08-29T23:55:25.509Z cpu2:66075)rdx=0x1 rbp=0x1 rsi=0x4305eda83510

2020-08-29T23:55:25.509Z cpu2:66075)rdi=0x0 r8=0x1 r9=0x4305eda835e0

2020-08-29T23:55:25.509Z cpu2:66075)r10=0x4390c01a7100 r11=0x4305ed961c88 r12=0x0

2020-08-29T23:55:25.509Z cpu2:66075)r13=0x0 r14=0x1 r15=0x0

2020-08-29T23:55:25.509Z cpu2:66075)pcpu:0 world:67770 name:"lwsmd" (U)

2020-08-29T23:55:25.509Z cpu2:66075)pcpu:1 world:2608807 name:"vmm0:XXX" (V)

2020-08-29T23:55:25.509Z cpu2:66075)pcpu:2 world:66075 name:"tq:tcpip4" (S)

2020-08-29T23:55:25.509Z cpu2:66075)pcpu:3 world:2609108 name:"vmm1:YYY" (V)

2020-08-29T23:55:25.509Z cpu2:66075)pcpu:4 world:65562 name:"netCoalesce2World" (S)

2020-08-29T23:55:25.509Z cpu2:66075)pcpu:5 world:66100 name:"DVFilter-ReplyWorld" (S)

2020-08-29T23:55:25.509Z cpu2:66075)pcpu:6 world:2609105 name:"vmm0:XXX" (V)

2020-08-29T23:55:25.509Z cpu2:66075)pcpu:7 world:2608810 name:"vmm1:YYY" (V)

2020-08-29T23:55:25.509Z cpu2:66075)pcpu:8 world:65768 name:"VSCSIPoll" (S)

2020-08-29T23:55:25.509Z cpu2:66075)pcpu:9 world:2608812 name:"vmm3:XXX" (V)

2020-08-29T23:55:25.509Z cpu2:66075)pcpu:10 world:2609111 name:"vmx-mks:YYY" (U)

2020-08-29T23:55:25.509Z cpu2:66075)pcpu:11 world:65896 name:"vmsyslogd" (U)

2020-08-29T23:55:25.509Z cpu2:66075)pcpu:12 world:2608811 name:"vmm2:XXX" (V)

2020-08-29T23:55:25.509Z cpu2:66075)pcpu:13 world:66078 name:"Tcpip4 wtask" (S)

2020-08-29T23:55:25.509Z cpu2:66075)pcpu:14 world:65663 name:"itRebalance" (S)

2020-08-29T23:55:25.509Z cpu2:66075)pcpu:15 world:67145 name:"rhttpproxy-work" (U)

2020-08-29T23:55:25.509Z cpu2:66075)@BlueScreen: #PF Exception 14 in world 66075:tq:tcpip4 IP 0x4180173c3c55 addr 0x0

PTEs:0x166032027;0x16602e027;0x0;

Update qlnativefc driver version HBA

$
0
0

Hello,

 

In the predictive skyline, I am seeing the following

 

An ESXi 6.7 host running with a qlnativefc driver version below 3.1.36.0 may experience a purple diagnostic screen error. Click the Ask VMware link above for more details and a resolution.

 

As I'm a newbie.....I'm not sure how to update some HBA drivers.

 

Could you help me? This is a critical move....because if I leave an ESX offline.....I have a problem...

 

 

regards,

Building VMWare homelab

$
0
0

So, I have been running ESXi / VSphere on a home lab for the past yr.  I have learned at TON.  BUT at the moment, I am looking for some help / guidance on the best way to move my lab to the next level. When I initially set up the servers, I was concerned about developing in an isolated environment that would allow me to take down portions (on purpose or by mistake) without impacting the production environment. (FOR REF: my production side of the homelab is the portion that runs the services / internet etc for the family)

 

I know have a decent understanding of some of the basics, but I am thinking that there must be a better design, vs. what I am using now.  Current setup is 2 physical systems that are essentially the same but isolated from each other. I want to delve more into clusters and all that goes with that architecture.

 

Would anyone be able to guide me towards a good tutorial / reading on how to redesign my lab?  Or maybe take some time to assist me in the design process??  Appreciate any input.

 

Cheers!

How to manage NTP from command line in ESXi 6?

$
0
0

Is there a way to manage ntp from the ESXi shell without setting up DCLI and access to the vicfg-ntp command?

My VMs replaced by numbers

$
0
0

We had an extended power outage that knocked everything down. When the power cam back on and I started my ESXi 6.7, my VMs have all be replaced by numbers and I cannot open the VMs, including my vCenter. the ESXi server comes up fine but I cannot read the VM configuration files for some reason. Can anybody help me?

 

VMs Gone.PNG

 

When I click on the VM to try to start it, I get the following error message:

ErrMsg.PNG

Error message on esxi 7.0

$
0
0

Hi all

 

I'm very new to esxi.

Running esxi 7.0 on an i9 10900 cpu with 32gb ram.

6 VMs - 2x Windows server, 2x Windows 10 Pro, 1x pfsense and 1x freenas

 

1x windows server is dhcp dns

1x windows server is iis and spare for other services

1x windows 10 is torrent and plex

1x windows 10 is home automation

Pfsense vm has 2 ethernet ports passed through (for wan and lan)

The freenas vm has integrated sata controller passed through which holds 1x 5tb hdd. (I had to manually map this controller to make it available for passthrough) 

 

Everything was working great on Friday.

I went away for the weekend and came back to a non-functioning network.

After a reboot and a few hours of not doing anything things started working again.

 

Today I shut down all the vms in order to turn the power off for some electrical work and suddenly lost connection to esxi.

 

My desktop and esxi are connected by ethernet to the same switch. Both on static IPs, same subnet etc.

 

Looked at the esxi host screen and got the error message attached.

 

Haven't turned power on yet, as work is ongoing. But hopefully should self resolve once I do.

 

Would like some input as to what the error message is. No idea where to begin. Any help would be great.

 

Thanks.


Help configuring passthru gfx card

$
0
0

Hello,

 

I have installed a ESXi 7 with a guest vm under Windows 10 to play /test some games. I added a NVIDIA quadro on my esx.

What I suceeded to do :

 

- Install the quadro

- Add pci to my vm and have under guest device manager the quadro (using the nvidia drivers)

 

I add hypervisor.cpuid.v0=FALSE to the configuration of the Windows 10 VM

 

What is strange / need help

 

-Under my guest W10, inside device manager I see the QUADRO card AND the svga card (can I remove the last one, I try to remove with svga.present=FALSE but after W10 don"t boot)

- When I restart my guest, the QUADRO is lost under guest OS, I need to restart ESXi and reconfigure PCI devices (each guest boot, It's not very good !!)

- When I launch a 'small game' like Siberia (less FPS). It's very slow, not normal with the Quadro, It's like SVGA card, but device manager see the Quadro!! strange !!!

- In the VM configuration - graphic card I activate the 3D support, is it necessary when having QUADRO hardware ?

 

What are my mistakes in configuration ?

any help ?

 

thanks

Bruno

Applying a patch... do I need to update first?

$
0
0

Hi folks,

 

Rookie question...

 

Recently was told by VMWare support to install patch ESXi 6.7 EP14 (Build-15820472).

The host is currently using ESXi 6.7 GA (Build-8169922).

 

Does that mean I have to update to ESXi 6.7 U3, before I download and install VIB for EP14 ?  (I download because the facility has no internet connection)

 

Appreciate your help.

 

Regards

Lum

Dependency error on patching with 'esxcli software vib' command

$
0
0

I got 'DepencencyError' when applying a new patch downloaded from VMware patch list.

patch_error.png

 

Can anyone help?

 

TIA

VMDK / API Snapshot woes

$
0
0

Where do I begin.

 

I feel like I am always a newbie with VMware, despite working in it for a few years. We are running a VSAN environment on 6.5, performing backups with Veeam. About 2 weeks ago, some of our vm's in the backup started throwing errors. Due to some events outside of my control, I just started looking at this today. Veeam support said the error was because the VMX file was corrupt, recommended solution was to shutdown machine, remove from inventory, create new machine using the existing disks, bring it back up. We performed this solution on a non critical machine, and it worked great. Did it to a semi-critical machine, and worked great again. Did it to our Exchange server and.. it wasn't great.

 

The server came back up, however after a few hours of operation, a large amount of people reported missing about 2 weeks of email. We had the machine up for about 5 hours poking around at logs before I shut it down to focus on the VMware side of things. After a ton of digging on the guest as well as in the host environment, I figured out the root cause- despite there being no snapshots in the snapshot manager, the system was running off of a snapshot due to the failed backup. I made the mistake of mounting the original vmdk files on booting rather than the 000001.vmdk file. My own mistake of making assumptions, thinking those files were somehow orphaned since the snapshot manager listed no snapshots. The previous, successful machines either didn't have a snapshot file, or historical data didn't matter on that guest.

 

After talking with VMware support, they basically said since the original vmdk's were booted, the damage is done, consider the data lost. They did say I can try to remove the drives from the guest, and try to re-add the snapshot versions, but had little faith that it would work, and warned of a high chance of corruption of both the vmdk and the snapshot vmdk. Since the last shutdown, I've kept the server powered off and have been seeking any type of option to try and get this machine back to life with its current data, and have ran into a brick wall every time. Mostly being cautious on any steps tried from this point due to the corruption warnings, I've copied out all files save for the snapshot files from the original location of the datastore to a different location to mitigate risk of further corruption. The snapshot files however, will simply not budge. Web client copy, SSH copy, vmkfstools -i, nothing will get those files to somewhere else in their original size (though I can download what looks to be the header with WinSCP).

 

I'm desperately trying to safeguard the snapshot data before doing something that may corrupt the whole guest and get this thing back in an up to date, running condition. Since this is an Exchange server, the files are quite large. Just copying out the files took 3hrs. I'm now attempting a clone as I've read a clone may merge snapshot files automatically, with the hope that it won't impact the original files. If the clone doesn't work, I'd be at the last straw to try to boot off of the snapshots, knowing I may lose everything. Finally I've landed here, seeing some users get success by some of you truly amazing experts here. The final kick in the rear, is our management is getting ready to suffer the data loss just to get the server back on and email flowing, so their patience is thin. Casting out a bottle in the sea here, hoping it comes back with some much needed help in time. Attaching relevant info that I've seen requested in other posts:

 

Directory ls -lh of original files:

 

-rw-r--r--    1 root     root          92 Oct 24  2018 CAKEXK01-8d4db6ef.hlog

-rw-------    1 root     root       32.6K Nov 15 08:02 CAKEXK01-Snapshot557.vmsn

-rw-r--r--    1 root     root          13 May  8  2019 CAKEXK01-aux.xml

-rw-------    1 root     root        8.5K Nov 14 08:12 CAKEXK01.nvram

-rw-------    1 root     root          45 Nov 14 08:12 CAKEXK01.vmsd

-rwx------    1 root     root        4.6K Dec  6 21:22 CAKEXK01.vmx

-rw-------    1 root     root        3.3K May 17  2018 CAKEXK01.vmxf

-rw-------    1 root     root        5.0M Dec  6 21:22 CAKEXK01_3-000001-ctk.vmdk

-rw-------    1 root     root         408 Nov 15 08:02 CAKEXK01_3-000001.vmdk

-rw-------    1 root     root         600 Dec  7 04:12 CAKEXK01_3.vmdk

-rw-------    1 root     root        5.9M Dec  6 21:22 CAKEXK01_4-000001-ctk.vmdk

-rw-------    1 root     root         409 Nov 15 08:02 CAKEXK01_4-000001.vmdk

-rw-------    1 root     root         576 Dec  7 04:12 CAKEXK01_4.vmdk

-rw-------    1 root     root        2.0M Dec  6 21:22 CAKEXK01_5-000001-ctk.vmdk

-rw-------    1 root     root         407 Nov 15 08:09 CAKEXK01_5-000001.vmdk

-rw-------    1 root     root         598 Dec  7 04:12 CAKEXK01_5.vmdk

drwxr-xr-x    1 root     root         280 Dec  7 06:38 bak

-rw-------    1 root     root      299.5K May 17  2018 vmware-3.log

-rw-------    1 root     root       15.2M Sep 21  2018 vmware-4.log

-rw-------    1 root     root        3.0M Oct 18  2018 vmware-5.log

-rw-------    1 root     root      393.2K Oct 22  2018 vmware-6.log

-rw-------    1 root     root      467.3K Oct 24  2018 vmware-7.log

-rw-------    1 root     root      244.0K Oct 24  2018 vmware-8.log

-rw-------    1 root     root       45.4M Dec  6 21:22 vmware.log

 

Directory ls -lh of newly created machine that is pointing to the above vmdk's:

 

-rw-r--r--    1 root     root         295 Dec  6 21:35 CAKEXK01-35be335f.hlog

-rw-------    1 root     root        8.5K Dec  7 05:25 CAKEXK01.nvram

-rw-r--r--    1 root     root           0 Dec  6 21:35 CAKEXK01.vmsd

-rwxr-xr-x    1 root     root        3.8K Dec  7 05:25 CAKEXK01.vmx

-rw-------    1 root     root        3.1K Dec  6 21:45 CAKEXK01.vmxf

-rw-r--r--    1 root     root        1.0M Dec  7 03:08 vmware-1.log

-rw-r--r--    1 root     root      322.3K Dec  7 05:25 vmware.log

Vmware Vsphere Compatibility on DELL Embedded PC3000 hardware

$
0
0

I am planning to install Vmware Vpshere on DELL Embedded PC3000 hardware which is having Intel® Atom Processor E3845 (Quad Core, 2M Cache, 1.91 GHz, 10W), 8GB RAM and 256 SSD storage. Do you see possibility to install the Vmware Vpshere Esxi host on this hardware and then host Linux virtual machine on it.

ESXi 6.7 - OVF Export Fails on VMDK export

$
0
0

I just installed VMware ESXi on a Dell R620 to test with. I have an opensuse 42.3 Linux guest VM that is powered off. When I go to export as OVF I can fetch the .ovf file, but it fails to fetch the associated VMDK. In Google Chrome I get "Failed - Network Error." When I use Firefox it fetches about 80-90MB of the VMDK and then stops. There is no error shown. On the ESXi host, it shows the request for the export but does not log any actual error.

 

I tried downloading the VMDK via the datastore browser but that didn't fetch the actual vmdk.

Cannot boot from ESXi 7.0

$
0
0

Hi,

 

I tried to do a clean 7.0 install the first time and couldn`t boot after installation, the bios couln`t see the disk as bootable.

 

So I decided to install 6.7u3b and do an upgrade after but the problem still persist, I could boot fine uner 6.7 but as soon as I upgraded to 7.0 the disk was not bootable anymore.

 

No warnings during installation.

 

Any help will be very appreciated.

 

Thanks.


Issue with smx-providers on VMware 6.5 U3

$
0
0

Hi VMware-Folks,

 

I ran into the following issue and want to try to get the VMware-View of the problem.

Maybe someone has run into the same issue:

 

We have serveral HPE DL360 Gen10 Servers, installed with the HPE cursomized Image: VMware-ESXi-6.5.0-Update3-14990892-HPE-Gen9plus-650.U3.10.5.0.67-Dec2019.iso

 

All share the same Issue: If we try to poll Status via WBEM, nothing returns, as if the package is not installed.

I could narrow it down to the following:

 

The WBEM-Provider List shows a stange ".#vmw_smx-provider" entry:

 

esxcli system wbem
provider list

Name                Enabled  Loaded

------------------  -------  ------

.#vmw_smx-provider     true    false

sfcb_base              true    true

vmw_base               true    true

vmw_hdr                true    true

vmw_hhrcwrapper        true    true

vmw_iodmProvider       true    true

vmw_kmodule            true    true

vmw_omc                true    true

vmw_pci                true    true

vmw_smx-provider       true    true

vmw_vi                 true    true

 

Looking at /var/lib/sfcb/registration gives:

 

-rw-rw-rw-    1 root     root           112 Sep  3 13:38 loaded
drwxr-xr-x    1 root     root           512 Sep  3 13:37 repository
-r--r--r--    1 root     root          3513 Nov  1  2019 sfcb_base-providerRegister
-r--r--r--    1 root     root          4307 Nov  1  2019 vmw_base-providerRegister
-r--r--r--    1 root     root           374 Nov  1  2019 vmw_hdr-providerRegister
-r--r--r--    1 root     root           897 Nov  1  2019 vmw_hhrcwrapper-providerRegister
-r--r--r--    1 root     root          4220 Nov  1  2019 vmw_iodmProvider-providerRegister
-r--r--r--    1 root     root           164 Nov  1  2019 vmw_kmodule-providerRegister
-r--r--r--    1 root     root         15650 Jun 21  2019 vmw_omc-providerRegister
-r--r--r--    1 root     root          1550 Nov  1  2019 vmw_pci-providerRegister
-r--r--r--    1 root     root          1308 Sep  3 13:38 vmw_smx-provider-providerRegister
-r--r--r--    1 root     root          2898 Mar 15  2018 vmw_vi-providerRegister

 

However there is the hidden file mentioned above:

 

ls -l /var/lib/sfcb/registration/.#vmw_smx-provider-providerRegister
-r--r--r-T    1 root     root         77881 Aug  9  2019 /var/lib/sfcb/registration/.#vmw_smx-provider-providerRegister

 

This file shares the same bytesize on Gen9 Servers and obviously contains the HPE specific data.

If I overwrite the vmw_smx-provider-providerRegister with the contents from .#vmw_smx-provider-providerRegister the polling works as intended.

Until the next reboot or even just until restart of the mangagement network ...

 

I found: https://www.virtuallyghetto.com/2011/08/how-to-persist-configuration-changes-in.html

 

.. and tried to do use the Sticky Bit, but nothing kept the file from beeing overwritten with the default(?) version.

 

I also tried to downgrade the smx-provider package up to three versions and downgraded the whole management bundle in a separate attempt.

Nothing helped.

 

Gen9 Systems with the same image do not have that Issue at all. (as mentioned: there is just one vmw_smx-provider-providerRegister with the corrent and same bytesize as the .#vmw_smx-provider-providerRegister Version on the Gen10 systems)

 

Does someone have an Idea how to replace the vmw_smx-provider-providerRegister with the correct version and stick that version into the bootbank ?

 

Upgrading to 6.7 Ux will happen, but is currently not an option for a quick fix.

The mentioned systems are the only Gen10 servers in this environment.

Cannot edit VM with Passtrough-Devices

$
0
0

Hello,

i have installed the newest patch for ESXi6.5 (ESXi650-202007001 / Build  16576891). Since the update i cannot edit VMs with Passtrough-Devices (e.g. FC-Card). Is this a bug in this patch and is it already known and a fix is already in progress?

 

Many Thanks.

Lost access to volume … due to connectivity issues

$
0
0

Hello,

 

I have a brand new SYS-E200-8D Micro Server eqipped with 64 GB RAM and a 1 TB Samsung EVO 850 PRO SATA 6 SSD and installed ESXi 6.5 on it.

I have currently deployed one Windows Server 2016 RTM Virtual Machine on the SSD that acts as the Datastore.

 

But as soon as I do storage performance testing within the VM (I use CrystalDiskMark) I get the following warnings in the Monitor tab of the Datastore:

 

Successfully restored access to volume 58399b3f-53265d09-9851-0cc47aca3b52 (datastore1) following connectivity issues.Saturday, November 26, 2016, 20:47:22 +0100Warning
Lost access to volume 58399b3f-53265d09-9851-0cc47aca3b52 (datastore1) due to connectivity issues. Recovery attempt is in progress and outcome will be reported shortly.Saturday, November 26, 2016, 20:47:22 +0100Warning
Successfully restored access to volume 58399b3f-53265d09-9851-0cc47aca3b52 (datastore1) following connectivity issues.Saturday, November 26, 2016, 20:46:59 +0100Warning
Lost access to volume 58399b3f-53265d09-9851-0cc47aca3b52 (datastore1) due to connectivity issues. Recovery attempt is in progress and outcome will be reported shortly.Saturday, November 26, 2016, 20:46:58 +0100Warning
Successfully restored access to volume 58399b3f-53265d09-9851-0cc47aca3b52 (datastore1) following connectivity issues.Saturday, November 26, 2016, 20:45:02 +0100Warning
Lost access to volume 58399b3f-53265d09-9851-0cc47aca3b52 (datastore1) due to connectivity issues. Recovery attempt is in progress and outcome will be reported shortly.Saturday, November 26, 2016, 20:45:01 +0100Warning

 

CrystalDiskMark also reports very bad throughput for the C drive, which is physically stored through the vmdk file on the Datastore (which makes sense because of the above mentioned warnings).

 

Any ideas what could happen here and why I get these warnings?

I have also a Mac Mini setup (16 GB RAM, 256 GB Samsung SATA 6 SSD, ESXi 6.5 installation) where these warnings don't occur when I do storage performance tests.

 

Thanks for your help & input,

 

-Klaus

Is it possible to either get ESXi 7 working with Intel 82576 network controller, or possible to still get a trial of ESXi 6.7?

$
0
0

Hello, I recently signed up for a 60 day trial of the Product Evaluation Center for VMware vSphere 7.0. I have never used vSphere before, and am trying to learn it to expand my employment opportunities.

 

I have a Supermicro H8DGU-F, which has the Intel 82576 network controller. The server also has the Intel Gigabit ET2 Quad Port Server Adapter, and has the model E1G44ET2BLK listed on the label. Searching for that model brings up an ark.intel.com page for the ET2, and if it is the correct page for this model ("E1G44ET2BLK" is nowhere on the page, in spite of it being one of the top results when using it as the search term), then it also uses the Intel 82576 network controller.

 

During setup, I see the following errors:
nfs41client failed to load

vmfs3 failed to load

Util: 1338: Failed trying to get a valid VMKernel MAC address: not found [...] Add/configure VMKernel NICs

 

If I am interpreting the results of searching the VMware Compatibility Guide for "82576", this hardware will support only up to ESXi 6.7. Am I correct?

 

Assuming I cannot run ESXi 7, I guess I'd like to try the newest compatible version. I cannot find where to register to evaluate vSphere 6.7. Is it still possible to do this?

 

Looking forward to any assistance that can be provided. Thank you.

Sysbench file I/O test on nvme crash

$
0
0

My environment is as follows:

- Supermicro H11SSL-NC with EPYC 7232P CPU

- ESXi-7.0b-16324942-standard

- Western Digital SN750 1T disk

- Debian 10 guest

 

I'm testing WD disk with this configuration but I experience crash every time when testing file I/O with sysbench. I'm using the following command:

 

sysbench fileio --file-test-mode=seqwr --file-total-size=10G --file-block-size=16K --threads=8 --time=60 run

 

Please take a look at the following kernel log:

2020-09-07T09:10:19.407Z cpu8:1049379)WARNING: NVMEIO:2223 Controller 256 receiv - Pastebin.com

 

You can see that there is a critical warning 0x2 in the beginning and after that everything goes down. APD starts and All Paths Down is finally reached. Disk never recovers and I have to power off the guest. It is in invalid state after that and I need to reboot the host to recover.

 

It appears that increasing --file-block-size will make the system crash earlier. With 4K block size it seems to work.

 

Any ideas what is the issue and/or how to workaround this? What is that critical warning 0x2?

Viewing all 8132 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>