-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Karm - network firmware - Warning firmware error detected FWSM: 0x8118801B / 0x8118801F #997
Comments
Supermicro support previously supplied:
|
Supermicro have come back and recommended RMA'ing the |
Replacement riser is NOT cheap at ~£500, and no guarantee it is running the corrected firmware: https://www.ebay.co.uk/itm/225350050919 |
USA is cheaper, but no shipping to UK / EU. https://www.ebay.com/itm/363404625954 |
I mean, we could ship via someone in the US, but that's a lot of effort for something that might not even fix the problem. Can we cross-ship with supermicro? If we can't reasonably fix it, can we get a separate PCIe network card and either remove or disable the AOC network card? |
For now I have asked if I can sign their magic NDA for them to release the firmware update tool to me. If that is a no-go I will find out what RMA options there are.
Yes, this is an option. |
Supermicro now report the firmware we have is not field upgradeable and have offered an advance swap-out RMA (receive, before send). |
Ops to decide if to RMA now or to wait until next site visit. No site visits planned at the moment. |
Proceeding with RMA now. Will remote hands the swap-out work. |
RMA submitted. Waiting for approval. |
Supermicro approved the RMA. Non-advance. |
I have booked smart hands for karm and arranged DHL collection on Monday. |
Karm has been powered down in prep. |
Remote hands have removed the card and boxes it for collection on Monday by DHL. |
The card has been shipped. Marking as blocked until we get the card returned. |
The card arrived at RMA centre and is being processed. |
Supermicro have confirmed receipt of the RMA. They will update the firmware and confirm when ready for return. |
Supermicro have repaired the riser. The card should be returned in the next few days. I have created a combined inbound equinix / smart-hands ticket. |
Server is back online with updated riser/nic. All good, no more kernel errors. |
Sorry to resurrect this but I was wondering if they disclosed what they did to the riser to bring it back to working order. I have seen many servers with this riser come through and fill dmesg, but am unaware of any fix. Here is another user experiencing what appears to be the same in the FAQ: https://www.supermicro.com/support/faqs/faq.cfm?faq=38678 |
@WarmWelcome Prior to the RMA they sent me a variety of firmware updates. None worked, regardless of install method (Linux, DOS, UEFI). They said something to the effect that the NIC updates were locked as per the Support FAQ you linked. The RMA, I believe they must have used an external device programmer to update the firmware chip. |
There is an unknown issue with the AOC-2UR6N4-i4XT network card riser in karm. The kernel logs are being flooded with the following kernel error. The issue is not new.
We have previously tried to get a firmware update from Supermicro to fix the issue, but both of the updates they supplied would not load and returned errors.
I have reached out to Supermicro support again.
The text was updated successfully, but these errors were encountered: