Is same as Pionieer Edition or x86-p5, but probably you're right, besides I think it's not a BIOS problem but a construction problem, it's broken.Not sure which CWWK product is supposed to be similar to the Odruid h4, but no. It will have a lower power consumption though... being bricked and all.
thought it was a given that people should use their own higher quality fans. iv got a old scythe fdb fan on my one doing very well.Just fyi, if anyone is thinking of picking up one of these with active cooling, the fan on mine lasted about 9 months of continuous use before the bearings started dying, causing it to overheat and services to crash. $30 for them to send me a replacement fan (though I care more about the waste of my time to tear it down and replace).
Kind of annoying for a router. I don't think I've ever seen a fan die that fast. I won't buy one of these with active cooling again unless the fan is a standard size I can easily replace.
I do not have the same Topton unit but my Topton came with the cable as show in the picture. It worked with an old sata3 OCZ Vertex SSD without any changes to the bios for a bare metal install of opnsense.Hi all, I have four of the Topton N305 variation A, and I am struggling to figure out how to enable the use of a SATA 2.5 SSD.
I have the right SATA data cable paired with the 4pin connector for power, but the bios does not seem to recognize the SSDs out of the box. Also I don’t see the he when I try to install Ubuntu.
Are there specific bios settings I would need to enable? Anyone with the same router that successfully had it running with an attached SSD?
Thank you in advance.
If you can connect a mechanical notebook drive and feel it spinning, at least you know the power part of the cable is probably good and it's the SATA part that doesn't seem to be working.Thank you for the answer!
Interesting, in my case I had to buy cables and found them in Ali express , but now I a doubting my purchase. Is there anyone on this king thread (which I tried to parse all) that has success with a cable they bought and could recommend a link?
If you have a spare SATA3 SSD, use that to determine if the performance is suitable. If it isn't, then look into getting a NVMe drive.I just ordered a CWWK (no SSD no RAM, N100, type C I think) minipc for use as a proxmox/Opnsense machine.
I am wondering if, from a heat perspective, would an SSD make more sense than a NVME? Since I imagine the HDD usage would be pretty low, maybe I'm just over thinking things. But my googling hasn't yielded any answers to this..
Posting this here for future reference, it turns out that, 1/ my cable for the SATA data+pwr was the right one, but 2/ the SATA port on this model (Topton N305 variation A) only activates if I remove the NVME 2230 to 2280 adapter. This means that I can only have a total of either 2 NVME 2280 ssd drives or 1 NVME SSD and 1 SATA drive. I was hoping to be able to use all the 3 of them. Also there seems to be no option in the BIOS to enable all of them at the same time.Not sure I have one of those drives around, but I might! I’ll look around.
i did not reach out to Topton after I read terrible experiences on this forum from people trying to connect with them.
truenas_admin@truenas[~]$ sudo dmesg | grep nvme
[ 0.000000] Command line: BOOT_IMAGE=/ROOT/24.10.0.2@/boot/vmlinuz-6.6.44-production+truenas root=ZFS=boot-pool/ROOT/24.10.0.2 ro libata.allow_tpm=1 amd_iommu=on iommu=pt kvm_amd.npt=1 kvm_amd.avic=1 intel_iommu=on zfsforce=1 nvme_core.multipath=N
[ 0.020309] Kernel command line: BOOT_IMAGE=/ROOT/24.10.0.2@/boot/vmlinuz-6.6.44-production+truenas root=ZFS=boot-pool/ROOT/24.10.0.2 ro libata.allow_tpm=1 amd_iommu=on iommu=pt kvm_amd.npt=1 kvm_amd.avic=1 intel_iommu=on zfsforce=1 nvme_core.multipath=N
[ 3.596977] nvme 0000:04:00.0: platform quirk: setting simple suspend
[ 3.596979] nvme 0000:05:00.0: platform quirk: setting simple suspend
[ 3.597152] nvme nvme0: pci function 0000:04:00.0
[ 3.597581] nvme nvme1: pci function 0000:05:00.0
[ 3.602281] nvme nvme0: Shutdown timeout set to 10 seconds
[ 3.602328] nvme nvme1: Shutdown timeout set to 10 seconds
[ 3.605981] nvme nvme1: 4/0/0 default/read/poll queues
[ 3.606233] nvme nvme0: 4/0/0 default/read/poll queues
[ 3.608396] nvme1n1: p1 p2 p3
truenas_admin@truenas[~]$ sudo journalctl -k
Jun 16 02:44:31 truenas kernel: microcode: updated early: 0x11 -> 0x17, date = 2023-12-07
Jun 16 02:44:31 truenas kernel: Linux version 6.6.44-production+truenas (root@tnsbuilds01.tn.ixsystems.net) (gcc (Debian 12.2.0-14) 12.2.0, GNU ld (GNU Binutils for Debian) 2.40) #1 SMP PREEMPT_DYNAMIC Fri Nov 8 18:37:36 UTC 2024
Jun 16 02:44:31 truenas kernel: Command line: BOOT_IMAGE=/ROOT/24.10.0.2@/boot/vmlinuz-6.6.44-production+truenas root=ZFS=boot-pool/ROOT/24.10.0.2 ro libata.allow_tpm=1 amd_iommu=on iommu=pt kvm_amd.npt=1 kvm_amd.avic=1 intel_iommu=on zfsforce=1 nvme_core.multipath=N
Jun 16 02:44:31 truenas kernel: x86/split lock detection: #AC: crashing the kernel on kernel split_locks and warning on user-space split_locks
Jun 16 02:44:31 truenas kernel: BIOS-provided physical RAM map:
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x0000000000000000-0x000000000009dfff] usable
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x000000000009e000-0x000000000009efff] reserved
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x000000000009f000-0x000000000009ffff] usable
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x00000000000a0000-0x00000000000fffff] reserved
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x0000000000100000-0x000000006fe52fff] usable
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x000000006fe53000-0x0000000072f52fff] reserved
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x0000000072f53000-0x0000000073034fff] ACPI data
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x0000000073035000-0x00000000730f4fff] ACPI NVS
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x00000000730f5000-0x0000000073e65fff] reserved
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x0000000073e66000-0x0000000073efefff] type 20
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x0000000073eff000-0x0000000073efffff] usable
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x0000000073f00000-0x0000000079ffffff] reserved
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x000000007aa00000-0x000000007abfffff] reserved
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x000000007b000000-0x00000000803fffff] reserved
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x00000000c0000000-0x00000000cfffffff] reserved
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x00000000fe000000-0x00000000fe010fff] reserved
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x00000000fec00000-0x00000000fec00fff] reserved
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x00000000fed00000-0x00000000fed00fff] reserved
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x00000000fed20000-0x00000000fed7ffff] reserved
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x00000000fee00000-0x00000000fee00fff] reserved
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x00000000ff000000-0x00000000ffffffff] reserved
Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x0000000100000000-0x000000087fbfffff] usable
Jun 16 02:44:31 truenas kernel: NX (Execute Disable) protection: active
Jun 16 02:44:31 truenas kernel: APIC: Static calls initialized
Jun 16 02:44:31 truenas kernel: efi: EFI v2.8 by American Megatrends
Jun 16 02:44:31 truenas kernel: efi: ACPI=0x730d1000 ACPI 2.0=0x730d1014 TPMFinalLog=0x730a0000 SMBIOS=0x73bb0000 SMBIOS 3.0=0x73baf000 MEMATTR=0x6cdc1018 ESRT=0x6ded4898 MOKvar=0x73b83000
Jun 16 02:44:31 truenas kernel: efi: Remove mem78: MMIO range=[0xc0000000-0xcfffffff] (256MB) from e820 map
Jun 16 02:44:31 truenas kernel: e820: remove [mem 0xc0000000-0xcfffffff] reserved
Jun 16 02:44:31 truenas kernel: efi: Not removing mem79: MMIO range=[0xfe000000-0xfe010fff] (68KB) from e820 map
Jun 16 02:44:31 truenas kernel: efi: Not removing mem80: MMIO range=[0xfec00000-0xfec00fff] (4KB) from e820 map
Jun 16 02:44:31 truenas kernel: efi: Not removing mem81: MMIO range=[0xfed00000-0xfed00fff] (4KB) from e820 map
Jun 16 02:44:31 truenas kernel: efi: Not removing mem83: MMIO range=[0xfee00000-0xfee00fff] (4KB) from e820 map
Jun 16 02:44:31 truenas kernel: efi: Remove mem84: MMIO range=[0xff000000-0xffffffff] (16MB) from e820 map
Jun 16 02:44:31 truenas kernel: e820: remove [mem 0xff000000-0xffffffff] reserved
Jun 16 02:44:31 truenas kernel: SMBIOS 3.6.0 present.
Jun 16 02:44:31 truenas kernel: DMI: Default string Default string/Default string, BIOS 5.27 06/26/2024
Jun 16 02:44:31 truenas kernel: tsc: Detected 800.000 MHz processor
Jun 16 02:44:31 truenas kernel: tsc: Detected 806.400 MHz TSC
Jun 16 02:44:31 truenas kernel: e820: update [mem 0x00000000-0x00000fff] usable ==> reserved
Jun 16 02:44:31 truenas kernel: e820: remove [mem 0x000a0000-0x000fffff] usable
Jun 16 02:44:31 truenas kernel: last_pfn = 0x87fc00 max_arch_pfn = 0x400000000
Jun 16 02:44:31 truenas kernel: MTRR map: 6 entries (3 fixed + 3 variable; max 23), built from 10 variable MTRRs
Jun 16 02:44:31 truenas kernel: x86/PAT: Configuration [0-7]: WB WC UC- UC WB WP UC- WT
Jun 16 02:44:31 truenas kernel: last_pfn = 0x73f00 max_arch_pfn = 0x400000000
Jun 16 02:44:31 truenas kernel: esrt: Reserving ESRT space from 0x000000006ded4898 to 0x000000006ded4948.
Jun 16 02:44:31 truenas kernel: e820: update [mem 0x6ded4000-0x6ded4fff] usable ==> reserved
Jun 16 02:44:31 truenas kernel: Using GB pages for direct mapping
Jun 16 02:44:31 truenas kernel: Secure boot disabled
Jun 16 02:44:31 truenas kernel: RAMDISK: [mem 0x2f1bb000-0x338d4fff]
Jun 16 02:44:31 truenas kernel: ACPI: Early table checksum verification disabled
Jun 16 02:44:31 truenas kernel: ACPI: RSDP 0x00000000730D1014 000024 (v02 ALASKA)
Jun 16 02:44:31 truenas kernel: ACPI: XSDT 0x00000000730D0728 00010C (v01 ALASKA A M I 01072009 AMI 01000013)
Jun 16 02:44:31 truenas kernel: ACPI: FACP 0x0000000073032000 000114 (v06 ALASKA A M I 01072009 AMI 01000013)
Jun 16 02:44:31 truenas kernel: ACPI: DSDT 0x0000000072FBC000 0752B7 (v02 ALASKA A M I 01072009 INTL 20200717)
Jun 16 02:44:31 truenas kernel: ACPI: FACS 0x00000000730F4000 000040
Jun 16 02:44:31 truenas kernel: ACPI: FIDT 0x0000000072FBB000 00009C (v01 ALASKA A M I 01072009 AMI 00010013)
Jun 16 02:44:31 truenas kernel: ACPI: SSDT 0x0000000073034000 00038C (v02 PmaxDv Pmax_Dev 00000001 INTL 20200717)
Jun 16 02:44:31 truenas kernel: ACPI: SSDT 0x0000000072FB5000 005D0B (v02 CpuRef CpuSsdt 00003000 INTL 20200717)
Jun 16 02:44:31 truenas kernel: ACPI: SSDT 0x0000000072FB2000 002935 (v02 SaSsdt SaSsdt 00003000 INTL 20200717)
Jun 16 02:44:31 truenas kernel: ACPI: SSDT 0x0000000072FAE000 0033D3 (v02 INTEL IgfxSsdt 00003000 INTL 20200717)
Jun 16 02:44:31 truenas kernel: ACPI: SSDT 0x0000000072FA0000 00D487 (v02 INTEL TcssSsdt 00001000 INTL 20200717)
Jun 16 02:44:31 truenas kernel: ACPI: HPET 0x0000000073033000 000038 (v01 ALASKA A M I 01072009 AMI 01000013)
Jun 16 02:44:31 truenas kernel: ACPI: APIC 0x0000000072F9F000 0001DC (v05 ALASKA A M I 01072009 AMI 01000013)
Jun 16 02:44:31 truenas kernel: ACPI: MCFG 0x0000000072F9E000 00003C (v01 ALASKA A M I 01072009 AMI 01000013)
Jun 16 02:44:31 truenas kernel: ACPI: SSDT 0x0000000072F9C000 001F1A (v02 ALASKA Ther_Rvp 00001000 INTL 20200717)
Jun 16 02:44:31 truenas kernel: ACPI: UEFI 0x0000000073087000 000048 (v01 ALASKA A M I 01072009 AMI 01000013)
Dec 8 10:03:44 truenas kernel: Command line: BOOT_IMAGE=/ROOT/24.10.0.2@/boot/vmlinuz-6.6.44-production+truenas root=ZFS=boot-pool/ROOT/24.10.0.2 ro libata.allow_tpm=1 amd_iommu=on iommu=pt kvm_amd.npt=1 kvm_amd.avic=1 intel_iommu=on zfsforce=1 nvme_core.multipath=N
Dec 8 10:03:44 truenas kernel: Kernel command line: BOOT_IMAGE=/ROOT/24.10.0.2@/boot/vmlinuz-6.6.44-production+truenas root=ZFS=boot-pool/ROOT/24.10.0.2 ro libata.allow_tpm=1 amd_iommu=on iommu=pt kvm_amd.npt=1 kvm_amd.avic=1 intel_iommu=on zfsforce=1 nvme_core.multipath=N
Dec 8 10:03:44 truenas kernel: nvme 0000:02:00.0: platform quirk: setting simple suspend
Dec 8 10:03:44 truenas kernel: nvme 0000:01:00.0: platform quirk: setting simple suspend
Dec 8 10:03:44 truenas kernel: nvme 0000:03:00.0: platform quirk: setting simple suspend
Dec 8 10:03:44 truenas kernel: nvme 0000:04:00.0: platform quirk: setting simple suspend
Dec 8 10:03:44 truenas kernel: nvme nvme1: pci function 0000:02:00.0
Dec 8 10:03:44 truenas kernel: nvme 0000:05:00.0: platform quirk: setting simple suspend
Dec 8 10:03:44 truenas kernel: nvme nvme3: pci function 0000:04:00.0
Dec 8 10:03:44 truenas kernel: nvme nvme0: pci function 0000:01:00.0
Dec 8 10:03:44 truenas kernel: nvme nvme4: pci function 0000:05:00.0
Dec 8 10:03:44 truenas kernel: nvme nvme2: pci function 0000:03:00.0
Dec 8 10:03:44 truenas kernel: nvme nvme2: Shutdown timeout set to 10 seconds
Dec 8 10:03:44 truenas kernel: nvme nvme0: Shutdown timeout set to 10 seconds
Dec 8 10:03:44 truenas kernel: nvme nvme3: Shutdown timeout set to 10 seconds
Dec 8 10:03:44 truenas kernel: nvme nvme1: Shutdown timeout set to 10 seconds
Dec 8 10:03:44 truenas kernel: nvme nvme4: Shutdown timeout set to 10 seconds
Dec 8 10:03:44 truenas kernel: nvme nvme3: 4/0/0 default/read/poll queues
Dec 8 10:03:44 truenas kernel: nvme nvme1: 4/0/0 default/read/poll queues
Dec 8 10:03:44 truenas kernel: nvme nvme2: 4/0/0 default/read/poll queues
Dec 8 10:03:44 truenas kernel: nvme nvme4: 4/0/0 default/read/poll queues
Dec 8 10:03:44 truenas kernel: nvme nvme0: 4/0/0 default/read/poll queues
Dec 8 10:03:44 truenas kernel: nvme3n1: p1
Dec 8 10:03:44 truenas kernel: nvme2n1: p1
Dec 8 10:03:44 truenas kernel: nvme1n1: p1
Dec 8 10:03:44 truenas kernel: nvme4n1: p1 p2 p3
Dec 8 10:03:44 truenas kernel: nvme0n1: p1
Dec 8 10:07:32 truenas systemd[1]: nvmefc-boot-connections.service - Auto-connect to subsystems on FC-NVME devices found during boot was skipped because of an unmet condition check (ConditionPathExists=/sys/class/fc/fc_udev_device/nvme_discovery).
Dec 8 10:07:32 truenas systemd[1]: Starting nvmf-autoconnect.service - Connect NVMe-oF subsystems automatically during boot...
Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme0n1, opened
Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme0n1, KINGSTON SKC3000D2048G, S/N:50026B7686EB36EF, FW:EIFK31.7, NSID:1, 2.04 TB
Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme0n1, is SMART capable. Adding to "monitor" list.
Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme1n1, opened
Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme1n1, KINGSTON SKC3000D2048G, S/N:50026B7686FBD3BB, FW:EIFK51.2, NSID:1, 2.04 TB
Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme1n1, is SMART capable. Adding to "monitor" list.
Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme2n1, opened
Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme2n1, KINGSTON SKC3000D2048G, S/N:50026B7686EB2B94, FW:EIFK31.7, NSID:1, 2.04 TB
Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme2n1, is SMART capable. Adding to "monitor" list.
Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme3n1, opened
Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme3n1, KINGSTON SKC3000D2048G, S/N:50026B7686FBD264, FW:EIFK51.2, NSID:1, 2.04 TB
Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme3n1, is SMART capable. Adding to "monitor" list.
Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme4n1, opened
Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme4n1, KINGSTON SKC3000S512G, S/N:50026B7686DFB7FF, FW:EIFK51.2, NSID:1, 512 GB
Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme4n1, is SMART capable. Adding to "monitor" list.
Dec 8 10:07:32 truenas smartd[2842]: Monitoring 0 ATA/SATA, 0 SCSI/SAS and 5 NVMe devices
Dec 8 10:07:32 truenas systemd[1]: Finished nvmf-autoconnect.service - Connect NVMe-oF subsystems automatically during boot.
Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme0n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686EB36EF-n1.nvme.state
Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme1n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686FBD3BB-n1.nvme.state
Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme2n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686EB2B94-n1.nvme.state
Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme3n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686FBD264-n1.nvme.state
Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme4n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000S512G-50026B7686DFB7FF-n1.nvme.state
Dec 8 10:07:32 truenas sensors[2861]: nvme-pci-0100
Dec 8 10:07:32 truenas sensors[2861]: nvme-pci-0200
Dec 8 10:07:32 truenas sensors[2861]: nvme-pci-0300
Dec 8 10:07:32 truenas sensors[2861]: nvme-pci-0500
Dec 8 10:07:32 truenas sensors[2861]: nvme-pci-0400
Dec 8 10:08:53 truenas kernel: nvme nvme1: controller is down; will reset: CSTS=0xffffffff, PCI_STATUS=0xffff
Dec 8 10:08:53 truenas kernel: nvme nvme0: controller is down; will reset: CSTS=0xffffffff, PCI_STATUS=0xffff
Dec 8 10:08:53 truenas kernel: nvme nvme0: Does your device have a faulty power saving mode enabled?
Dec 8 10:08:53 truenas kernel: nvme nvme0: Try "nvme_core.default_ps_max_latency_us=0 pcie_aspm=off pcie_port_pm=off" and report a bug
Dec 8 10:08:53 truenas kernel: nvme nvme1: Does your device have a faulty power saving mode enabled?
Dec 8 10:08:53 truenas kernel: nvme nvme1: Try "nvme_core.default_ps_max_latency_us=0 pcie_aspm=off pcie_port_pm=off" and report a bug
Dec 8 10:08:53 truenas kernel: nvme nvme2: controller is down; will reset: CSTS=0xffffffff, PCI_STATUS=0xffff
Dec 8 10:08:53 truenas kernel: nvme nvme2: Does your device have a faulty power saving mode enabled?
Dec 8 10:08:53 truenas kernel: nvme nvme2: Try "nvme_core.default_ps_max_latency_us=0 pcie_aspm=off pcie_port_pm=off" and report a bug
Dec 8 10:08:53 truenas kernel: nvme 0000:02:00.0: Unable to change power state from D3cold to D0, device inaccessible
Dec 8 10:08:53 truenas kernel: nvme 0000:03:00.0: Unable to change power state from D3cold to D0, device inaccessible
Dec 8 10:08:53 truenas kernel: nvme nvme1: Disabling device after reset failure: -19
Dec 8 10:08:53 truenas kernel: nvme nvme2: Disabling device after reset failure: -19
Dec 8 10:08:53 truenas kernel: nvme 0000:01:00.0: Unable to change power state from D3cold to D0, device inaccessible
Dec 8 10:08:53 truenas kernel: nvme nvme0: Disabling device after reset failure: -19
Dec 8 10:09:44 truenas smartd[2842]: Device: /dev/nvme0n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686EB36EF-n1.nvme.state
Dec 8 10:09:44 truenas smartd[2842]: Device: /dev/nvme1n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686FBD3BB-n1.nvme.state
Dec 8 10:09:44 truenas smartd[2842]: Device: /dev/nvme2n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686EB2B94-n1.nvme.state
Dec 8 10:09:44 truenas smartd[2842]: Device: /dev/nvme3n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686FBD264-n1.nvme.state
Dec 8 10:09:44 truenas smartd[2842]: Device: /dev/nvme4n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000S512G-50026B7686DFB7FF-n1.nvme.state
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme0n1, opened
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme0n1, NVMe Identify Controller failed
Dec 8 10:09:44 truenas smartd[6041]: Unable to register NVMe device /dev/nvme0n1 at line 1 of file /etc/smartd.conf
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme0n1, not available
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme1n1, opened
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme1n1, NVMe Identify Controller failed
Dec 8 10:09:44 truenas smartd[6041]: Unable to register NVMe device /dev/nvme1n1 at line 2 of file /etc/smartd.conf
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme1n1, not available
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme2n1, opened
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme2n1, NVMe Identify Controller failed
Dec 8 10:09:44 truenas smartd[6041]: Unable to register NVMe device /dev/nvme2n1 at line 3 of file /etc/smartd.conf
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme2n1, not available
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme3n1, opened
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme3n1, KINGSTON SKC3000D2048G, S/N:50026B7686FBD264, FW:EIFK51.2, NSID:1, 2.04 TB
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme3n1, is SMART capable. Adding to "monitor" list.
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme3n1, state read from /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686FBD264-n1.nvme.state
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme4n1, opened
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme4n1, KINGSTON SKC3000S512G, S/N:50026B7686DFB7FF, FW:EIFK51.2, NSID:1, 512 GB
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme4n1, is SMART capable. Adding to "monitor" list.
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme4n1, state read from /var/lib/smartmontools/smartd.KINGSTON_SKC3000S512G-50026B7686DFB7FF-n1.nvme.state
Dec 8 10:09:44 truenas smartd[6041]: Monitoring 0 ATA/SATA, 0 SCSI/SAS and 2 NVMe devices
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme3n1, NVMe error count increased from 76 to 80 (0 new, 4 ignored, 0 unknown)
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme4n1, NVMe error count increased from 86 to 90 (0 new, 4 ignored, 0 unknown)
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme3n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686FBD264-n1.nvme.state
Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme4n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000S512G-50026B7686DFB7FF-n1.nvme.state
Dec 8 10:10:33 truenas smartd[6041]: Device: /dev/nvme3n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686FBD264-n1.nvme.state
Dec 8 10:10:33 truenas smartd[6041]: Device: /dev/nvme4n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000S512G-50026B7686DFB7FF-n1.nvme.state
midclt call system.advanced.update '{"kernel_extra_options": "nvme_core.default_ps_max_latency_us=0 pcie_aspm=off pcie_port_pm=off"}'
In my case I believe that updating the firmware of each of my drives or trying to press the board enough so it fits correctly so far has solved for me. But I'm scared of opening the device again and the disconnections returning.I have this N100 X86 P5 development board with 4*M2 NVMe expansion board: Link to CWWK product page
I'm having issues with the NVME expansion board. Sometimes 2-3 out of 4 attached NVME drives are not be detected. Reboot / attaching drives (same positions) and expansion board again helped for a while, but now only one of expansion board nvme drives is recognized. At one point I was able to create pool from 4 x NVMe drives in truenas.
BIOS:
RAM:
- project version: cw-adlntbx-1c2l
- build date: 06/26/2024
NVME drives:
- Kingston KC3000 2TB x 4
- Kingston KC3000 512GB ( in 2230 A+E M.2 slot with adapter board )
I have thought this as a software issue, but now I am starting to think that this might be hardware related. Maybe the expansion board's 2-pin power cable could be the issue? @rogervn did you find solution to your nvme drops?
dmesg from truenas:
journalctl -kCode:truenas_admin@truenas[~]$ sudo dmesg | grep nvme [ 0.000000] Command line: BOOT_IMAGE=/ROOT/24.10.0.2@/boot/vmlinuz-6.6.44-production+truenas root=ZFS=boot-pool/ROOT/24.10.0.2 ro libata.allow_tpm=1 amd_iommu=on iommu=pt kvm_amd.npt=1 kvm_amd.avic=1 intel_iommu=on zfsforce=1 nvme_core.multipath=N [ 0.020309] Kernel command line: BOOT_IMAGE=/ROOT/24.10.0.2@/boot/vmlinuz-6.6.44-production+truenas root=ZFS=boot-pool/ROOT/24.10.0.2 ro libata.allow_tpm=1 amd_iommu=on iommu=pt kvm_amd.npt=1 kvm_amd.avic=1 intel_iommu=on zfsforce=1 nvme_core.multipath=N [ 3.596977] nvme 0000:04:00.0: platform quirk: setting simple suspend [ 3.596979] nvme 0000:05:00.0: platform quirk: setting simple suspend [ 3.597152] nvme nvme0: pci function 0000:04:00.0 [ 3.597581] nvme nvme1: pci function 0000:05:00.0 [ 3.602281] nvme nvme0: Shutdown timeout set to 10 seconds [ 3.602328] nvme nvme1: Shutdown timeout set to 10 seconds [ 3.605981] nvme nvme1: 4/0/0 default/read/poll queues [ 3.606233] nvme nvme0: 4/0/0 default/read/poll queues [ 3.608396] nvme1n1: p1 p2 p3
UPDATE: went through syslog, had the same suggestion as @rogervnCode:truenas_admin@truenas[~]$ sudo journalctl -k Jun 16 02:44:31 truenas kernel: microcode: updated early: 0x11 -> 0x17, date = 2023-12-07 Jun 16 02:44:31 truenas kernel: Linux version 6.6.44-production+truenas (root@tnsbuilds01.tn.ixsystems.net) (gcc (Debian 12.2.0-14) 12.2.0, GNU ld (GNU Binutils for Debian) 2.40) #1 SMP PREEMPT_DYNAMIC Fri Nov 8 18:37:36 UTC 2024 Jun 16 02:44:31 truenas kernel: Command line: BOOT_IMAGE=/ROOT/24.10.0.2@/boot/vmlinuz-6.6.44-production+truenas root=ZFS=boot-pool/ROOT/24.10.0.2 ro libata.allow_tpm=1 amd_iommu=on iommu=pt kvm_amd.npt=1 kvm_amd.avic=1 intel_iommu=on zfsforce=1 nvme_core.multipath=N Jun 16 02:44:31 truenas kernel: x86/split lock detection: #AC: crashing the kernel on kernel split_locks and warning on user-space split_locks Jun 16 02:44:31 truenas kernel: BIOS-provided physical RAM map: Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x0000000000000000-0x000000000009dfff] usable Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x000000000009e000-0x000000000009efff] reserved Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x000000000009f000-0x000000000009ffff] usable Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x00000000000a0000-0x00000000000fffff] reserved Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x0000000000100000-0x000000006fe52fff] usable Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x000000006fe53000-0x0000000072f52fff] reserved Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x0000000072f53000-0x0000000073034fff] ACPI data Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x0000000073035000-0x00000000730f4fff] ACPI NVS Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x00000000730f5000-0x0000000073e65fff] reserved Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x0000000073e66000-0x0000000073efefff] type 20 Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x0000000073eff000-0x0000000073efffff] usable Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x0000000073f00000-0x0000000079ffffff] reserved Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x000000007aa00000-0x000000007abfffff] reserved Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x000000007b000000-0x00000000803fffff] reserved Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x00000000c0000000-0x00000000cfffffff] reserved Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x00000000fe000000-0x00000000fe010fff] reserved Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x00000000fec00000-0x00000000fec00fff] reserved Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x00000000fed00000-0x00000000fed00fff] reserved Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x00000000fed20000-0x00000000fed7ffff] reserved Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x00000000fee00000-0x00000000fee00fff] reserved Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x00000000ff000000-0x00000000ffffffff] reserved Jun 16 02:44:31 truenas kernel: BIOS-e820: [mem 0x0000000100000000-0x000000087fbfffff] usable Jun 16 02:44:31 truenas kernel: NX (Execute Disable) protection: active Jun 16 02:44:31 truenas kernel: APIC: Static calls initialized Jun 16 02:44:31 truenas kernel: efi: EFI v2.8 by American Megatrends Jun 16 02:44:31 truenas kernel: efi: ACPI=0x730d1000 ACPI 2.0=0x730d1014 TPMFinalLog=0x730a0000 SMBIOS=0x73bb0000 SMBIOS 3.0=0x73baf000 MEMATTR=0x6cdc1018 ESRT=0x6ded4898 MOKvar=0x73b83000 Jun 16 02:44:31 truenas kernel: efi: Remove mem78: MMIO range=[0xc0000000-0xcfffffff] (256MB) from e820 map Jun 16 02:44:31 truenas kernel: e820: remove [mem 0xc0000000-0xcfffffff] reserved Jun 16 02:44:31 truenas kernel: efi: Not removing mem79: MMIO range=[0xfe000000-0xfe010fff] (68KB) from e820 map Jun 16 02:44:31 truenas kernel: efi: Not removing mem80: MMIO range=[0xfec00000-0xfec00fff] (4KB) from e820 map Jun 16 02:44:31 truenas kernel: efi: Not removing mem81: MMIO range=[0xfed00000-0xfed00fff] (4KB) from e820 map Jun 16 02:44:31 truenas kernel: efi: Not removing mem83: MMIO range=[0xfee00000-0xfee00fff] (4KB) from e820 map Jun 16 02:44:31 truenas kernel: efi: Remove mem84: MMIO range=[0xff000000-0xffffffff] (16MB) from e820 map Jun 16 02:44:31 truenas kernel: e820: remove [mem 0xff000000-0xffffffff] reserved Jun 16 02:44:31 truenas kernel: SMBIOS 3.6.0 present. Jun 16 02:44:31 truenas kernel: DMI: Default string Default string/Default string, BIOS 5.27 06/26/2024 Jun 16 02:44:31 truenas kernel: tsc: Detected 800.000 MHz processor Jun 16 02:44:31 truenas kernel: tsc: Detected 806.400 MHz TSC Jun 16 02:44:31 truenas kernel: e820: update [mem 0x00000000-0x00000fff] usable ==> reserved Jun 16 02:44:31 truenas kernel: e820: remove [mem 0x000a0000-0x000fffff] usable Jun 16 02:44:31 truenas kernel: last_pfn = 0x87fc00 max_arch_pfn = 0x400000000 Jun 16 02:44:31 truenas kernel: MTRR map: 6 entries (3 fixed + 3 variable; max 23), built from 10 variable MTRRs Jun 16 02:44:31 truenas kernel: x86/PAT: Configuration [0-7]: WB WC UC- UC WB WP UC- WT Jun 16 02:44:31 truenas kernel: last_pfn = 0x73f00 max_arch_pfn = 0x400000000 Jun 16 02:44:31 truenas kernel: esrt: Reserving ESRT space from 0x000000006ded4898 to 0x000000006ded4948. Jun 16 02:44:31 truenas kernel: e820: update [mem 0x6ded4000-0x6ded4fff] usable ==> reserved Jun 16 02:44:31 truenas kernel: Using GB pages for direct mapping Jun 16 02:44:31 truenas kernel: Secure boot disabled Jun 16 02:44:31 truenas kernel: RAMDISK: [mem 0x2f1bb000-0x338d4fff] Jun 16 02:44:31 truenas kernel: ACPI: Early table checksum verification disabled Jun 16 02:44:31 truenas kernel: ACPI: RSDP 0x00000000730D1014 000024 (v02 ALASKA) Jun 16 02:44:31 truenas kernel: ACPI: XSDT 0x00000000730D0728 00010C (v01 ALASKA A M I 01072009 AMI 01000013) Jun 16 02:44:31 truenas kernel: ACPI: FACP 0x0000000073032000 000114 (v06 ALASKA A M I 01072009 AMI 01000013) Jun 16 02:44:31 truenas kernel: ACPI: DSDT 0x0000000072FBC000 0752B7 (v02 ALASKA A M I 01072009 INTL 20200717) Jun 16 02:44:31 truenas kernel: ACPI: FACS 0x00000000730F4000 000040 Jun 16 02:44:31 truenas kernel: ACPI: FIDT 0x0000000072FBB000 00009C (v01 ALASKA A M I 01072009 AMI 00010013) Jun 16 02:44:31 truenas kernel: ACPI: SSDT 0x0000000073034000 00038C (v02 PmaxDv Pmax_Dev 00000001 INTL 20200717) Jun 16 02:44:31 truenas kernel: ACPI: SSDT 0x0000000072FB5000 005D0B (v02 CpuRef CpuSsdt 00003000 INTL 20200717) Jun 16 02:44:31 truenas kernel: ACPI: SSDT 0x0000000072FB2000 002935 (v02 SaSsdt SaSsdt 00003000 INTL 20200717) Jun 16 02:44:31 truenas kernel: ACPI: SSDT 0x0000000072FAE000 0033D3 (v02 INTEL IgfxSsdt 00003000 INTL 20200717) Jun 16 02:44:31 truenas kernel: ACPI: SSDT 0x0000000072FA0000 00D487 (v02 INTEL TcssSsdt 00001000 INTL 20200717) Jun 16 02:44:31 truenas kernel: ACPI: HPET 0x0000000073033000 000038 (v01 ALASKA A M I 01072009 AMI 01000013) Jun 16 02:44:31 truenas kernel: ACPI: APIC 0x0000000072F9F000 0001DC (v05 ALASKA A M I 01072009 AMI 01000013) Jun 16 02:44:31 truenas kernel: ACPI: MCFG 0x0000000072F9E000 00003C (v01 ALASKA A M I 01072009 AMI 01000013) Jun 16 02:44:31 truenas kernel: ACPI: SSDT 0x0000000072F9C000 001F1A (v02 ALASKA Ther_Rvp 00001000 INTL 20200717) Jun 16 02:44:31 truenas kernel: ACPI: UEFI 0x0000000073087000 000048 (v01 ALASKA A M I 01072009 AMI 01000013)
("Try "nvme_core.default_ps_max_latency_us=0 pcie_aspm=off pcie_port_pm=off"") :
Tried to set :Code:Dec 8 10:03:44 truenas kernel: Command line: BOOT_IMAGE=/ROOT/24.10.0.2@/boot/vmlinuz-6.6.44-production+truenas root=ZFS=boot-pool/ROOT/24.10.0.2 ro libata.allow_tpm=1 amd_iommu=on iommu=pt kvm_amd.npt=1 kvm_amd.avic=1 intel_iommu=on zfsforce=1 nvme_core.multipath=N Dec 8 10:03:44 truenas kernel: Kernel command line: BOOT_IMAGE=/ROOT/24.10.0.2@/boot/vmlinuz-6.6.44-production+truenas root=ZFS=boot-pool/ROOT/24.10.0.2 ro libata.allow_tpm=1 amd_iommu=on iommu=pt kvm_amd.npt=1 kvm_amd.avic=1 intel_iommu=on zfsforce=1 nvme_core.multipath=N Dec 8 10:03:44 truenas kernel: nvme 0000:02:00.0: platform quirk: setting simple suspend Dec 8 10:03:44 truenas kernel: nvme 0000:01:00.0: platform quirk: setting simple suspend Dec 8 10:03:44 truenas kernel: nvme 0000:03:00.0: platform quirk: setting simple suspend Dec 8 10:03:44 truenas kernel: nvme 0000:04:00.0: platform quirk: setting simple suspend Dec 8 10:03:44 truenas kernel: nvme nvme1: pci function 0000:02:00.0 Dec 8 10:03:44 truenas kernel: nvme 0000:05:00.0: platform quirk: setting simple suspend Dec 8 10:03:44 truenas kernel: nvme nvme3: pci function 0000:04:00.0 Dec 8 10:03:44 truenas kernel: nvme nvme0: pci function 0000:01:00.0 Dec 8 10:03:44 truenas kernel: nvme nvme4: pci function 0000:05:00.0 Dec 8 10:03:44 truenas kernel: nvme nvme2: pci function 0000:03:00.0 Dec 8 10:03:44 truenas kernel: nvme nvme2: Shutdown timeout set to 10 seconds Dec 8 10:03:44 truenas kernel: nvme nvme0: Shutdown timeout set to 10 seconds Dec 8 10:03:44 truenas kernel: nvme nvme3: Shutdown timeout set to 10 seconds Dec 8 10:03:44 truenas kernel: nvme nvme1: Shutdown timeout set to 10 seconds Dec 8 10:03:44 truenas kernel: nvme nvme4: Shutdown timeout set to 10 seconds Dec 8 10:03:44 truenas kernel: nvme nvme3: 4/0/0 default/read/poll queues Dec 8 10:03:44 truenas kernel: nvme nvme1: 4/0/0 default/read/poll queues Dec 8 10:03:44 truenas kernel: nvme nvme2: 4/0/0 default/read/poll queues Dec 8 10:03:44 truenas kernel: nvme nvme4: 4/0/0 default/read/poll queues Dec 8 10:03:44 truenas kernel: nvme nvme0: 4/0/0 default/read/poll queues Dec 8 10:03:44 truenas kernel: nvme3n1: p1 Dec 8 10:03:44 truenas kernel: nvme2n1: p1 Dec 8 10:03:44 truenas kernel: nvme1n1: p1 Dec 8 10:03:44 truenas kernel: nvme4n1: p1 p2 p3 Dec 8 10:03:44 truenas kernel: nvme0n1: p1 Dec 8 10:07:32 truenas systemd[1]: nvmefc-boot-connections.service - Auto-connect to subsystems on FC-NVME devices found during boot was skipped because of an unmet condition check (ConditionPathExists=/sys/class/fc/fc_udev_device/nvme_discovery). Dec 8 10:07:32 truenas systemd[1]: Starting nvmf-autoconnect.service - Connect NVMe-oF subsystems automatically during boot... Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme0n1, opened Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme0n1, KINGSTON SKC3000D2048G, S/N:50026B7686EB36EF, FW:EIFK31.7, NSID:1, 2.04 TB Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme0n1, is SMART capable. Adding to "monitor" list. Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme1n1, opened Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme1n1, KINGSTON SKC3000D2048G, S/N:50026B7686FBD3BB, FW:EIFK51.2, NSID:1, 2.04 TB Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme1n1, is SMART capable. Adding to "monitor" list. Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme2n1, opened Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme2n1, KINGSTON SKC3000D2048G, S/N:50026B7686EB2B94, FW:EIFK31.7, NSID:1, 2.04 TB Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme2n1, is SMART capable. Adding to "monitor" list. Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme3n1, opened Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme3n1, KINGSTON SKC3000D2048G, S/N:50026B7686FBD264, FW:EIFK51.2, NSID:1, 2.04 TB Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme3n1, is SMART capable. Adding to "monitor" list. Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme4n1, opened Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme4n1, KINGSTON SKC3000S512G, S/N:50026B7686DFB7FF, FW:EIFK51.2, NSID:1, 512 GB Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme4n1, is SMART capable. Adding to "monitor" list. Dec 8 10:07:32 truenas smartd[2842]: Monitoring 0 ATA/SATA, 0 SCSI/SAS and 5 NVMe devices Dec 8 10:07:32 truenas systemd[1]: Finished nvmf-autoconnect.service - Connect NVMe-oF subsystems automatically during boot. Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme0n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686EB36EF-n1.nvme.state Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme1n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686FBD3BB-n1.nvme.state Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme2n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686EB2B94-n1.nvme.state Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme3n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686FBD264-n1.nvme.state Dec 8 10:07:32 truenas smartd[2842]: Device: /dev/nvme4n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000S512G-50026B7686DFB7FF-n1.nvme.state Dec 8 10:07:32 truenas sensors[2861]: nvme-pci-0100 Dec 8 10:07:32 truenas sensors[2861]: nvme-pci-0200 Dec 8 10:07:32 truenas sensors[2861]: nvme-pci-0300 Dec 8 10:07:32 truenas sensors[2861]: nvme-pci-0500 Dec 8 10:07:32 truenas sensors[2861]: nvme-pci-0400 Dec 8 10:08:53 truenas kernel: nvme nvme1: controller is down; will reset: CSTS=0xffffffff, PCI_STATUS=0xffff Dec 8 10:08:53 truenas kernel: nvme nvme0: controller is down; will reset: CSTS=0xffffffff, PCI_STATUS=0xffff Dec 8 10:08:53 truenas kernel: nvme nvme0: Does your device have a faulty power saving mode enabled? Dec 8 10:08:53 truenas kernel: nvme nvme0: Try "nvme_core.default_ps_max_latency_us=0 pcie_aspm=off pcie_port_pm=off" and report a bug Dec 8 10:08:53 truenas kernel: nvme nvme1: Does your device have a faulty power saving mode enabled? Dec 8 10:08:53 truenas kernel: nvme nvme1: Try "nvme_core.default_ps_max_latency_us=0 pcie_aspm=off pcie_port_pm=off" and report a bug Dec 8 10:08:53 truenas kernel: nvme nvme2: controller is down; will reset: CSTS=0xffffffff, PCI_STATUS=0xffff Dec 8 10:08:53 truenas kernel: nvme nvme2: Does your device have a faulty power saving mode enabled? Dec 8 10:08:53 truenas kernel: nvme nvme2: Try "nvme_core.default_ps_max_latency_us=0 pcie_aspm=off pcie_port_pm=off" and report a bug Dec 8 10:08:53 truenas kernel: nvme 0000:02:00.0: Unable to change power state from D3cold to D0, device inaccessible Dec 8 10:08:53 truenas kernel: nvme 0000:03:00.0: Unable to change power state from D3cold to D0, device inaccessible Dec 8 10:08:53 truenas kernel: nvme nvme1: Disabling device after reset failure: -19 Dec 8 10:08:53 truenas kernel: nvme nvme2: Disabling device after reset failure: -19 Dec 8 10:08:53 truenas kernel: nvme 0000:01:00.0: Unable to change power state from D3cold to D0, device inaccessible Dec 8 10:08:53 truenas kernel: nvme nvme0: Disabling device after reset failure: -19 Dec 8 10:09:44 truenas smartd[2842]: Device: /dev/nvme0n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686EB36EF-n1.nvme.state Dec 8 10:09:44 truenas smartd[2842]: Device: /dev/nvme1n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686FBD3BB-n1.nvme.state Dec 8 10:09:44 truenas smartd[2842]: Device: /dev/nvme2n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686EB2B94-n1.nvme.state Dec 8 10:09:44 truenas smartd[2842]: Device: /dev/nvme3n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686FBD264-n1.nvme.state Dec 8 10:09:44 truenas smartd[2842]: Device: /dev/nvme4n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000S512G-50026B7686DFB7FF-n1.nvme.state Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme0n1, opened Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme0n1, NVMe Identify Controller failed Dec 8 10:09:44 truenas smartd[6041]: Unable to register NVMe device /dev/nvme0n1 at line 1 of file /etc/smartd.conf Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme0n1, not available Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme1n1, opened Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme1n1, NVMe Identify Controller failed Dec 8 10:09:44 truenas smartd[6041]: Unable to register NVMe device /dev/nvme1n1 at line 2 of file /etc/smartd.conf Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme1n1, not available Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme2n1, opened Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme2n1, NVMe Identify Controller failed Dec 8 10:09:44 truenas smartd[6041]: Unable to register NVMe device /dev/nvme2n1 at line 3 of file /etc/smartd.conf Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme2n1, not available Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme3n1, opened Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme3n1, KINGSTON SKC3000D2048G, S/N:50026B7686FBD264, FW:EIFK51.2, NSID:1, 2.04 TB Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme3n1, is SMART capable. Adding to "monitor" list. Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme3n1, state read from /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686FBD264-n1.nvme.state Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme4n1, opened Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme4n1, KINGSTON SKC3000S512G, S/N:50026B7686DFB7FF, FW:EIFK51.2, NSID:1, 512 GB Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme4n1, is SMART capable. Adding to "monitor" list. Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme4n1, state read from /var/lib/smartmontools/smartd.KINGSTON_SKC3000S512G-50026B7686DFB7FF-n1.nvme.state Dec 8 10:09:44 truenas smartd[6041]: Monitoring 0 ATA/SATA, 0 SCSI/SAS and 2 NVMe devices Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme3n1, NVMe error count increased from 76 to 80 (0 new, 4 ignored, 0 unknown) Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme4n1, NVMe error count increased from 86 to 90 (0 new, 4 ignored, 0 unknown) Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme3n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686FBD264-n1.nvme.state Dec 8 10:09:44 truenas smartd[6041]: Device: /dev/nvme4n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000S512G-50026B7686DFB7FF-n1.nvme.state Dec 8 10:10:33 truenas smartd[6041]: Device: /dev/nvme3n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000D2048G-50026B7686FBD264-n1.nvme.state Dec 8 10:10:33 truenas smartd[6041]: Device: /dev/nvme4n1, state written to /var/lib/smartmontools/smartd.KINGSTON_SKC3000S512G-50026B7686DFB7FF-n1.nvme.state
did not make a difference.Code:midclt call system.advanced.update '{"kernel_extra_options": "nvme_core.default_ps_max_latency_us=0 pcie_aspm=off pcie_port_pm=off"}'
Same feeling, lost the pool after one day of uptime. Couple of drives unrecognized again. Next I'll try to replace the expansion board power cable. Updating drive firmware did not cross my mind, that might be the next step.In my case I believe that updating the firmware of each of my drives or trying to press the board enough so it fits correctly so far has solved for me. But I'm scared of opening the device again and the disconnections returning.