Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Software RAID install fails (7.6.) - many existing errors in disk layout #208

Closed
msgerbs opened this issue Jun 26, 2019 · 12 comments
Closed

Comments

@msgerbs
Copy link

msgerbs commented Jun 26, 2019

I am trying to do a Software RAID install of 7.6.0 on 2 128GB SATA SSDs connected to an LSI SAS9220-8i in JBOD mode, and when I select the drives it just goes right back to the select installation device screen. I also saw the same behavior using my server's built-in HP SAS controller. I don't know how to get the proper logs for this but if somebody can point me in the right direction I am happy to grab them.

For what it's worth I tried the 7.5.0 installer and had the same issue.

@msgerbs msgerbs changed the title Software RAID install fails Software RAID install fails (7.6.) Jun 26, 2019
@stormi
Copy link
Member

stormi commented Jun 26, 2019

Hi. You can switch to logs view or to a console with ALT + right arrow during the installation.

@stormi
Copy link
Member

stormi commented Jun 26, 2019

You may also want to try XCP-ng 8.0 beta to check if the problem is still present.

@msgerbs
Copy link
Author

msgerbs commented Jun 27, 2019

I tried XCP-ng 8.0 a few days ago and ran into the same issue.

After removing all other drives and attempting the install again I was able to do so without issues. I had previously put 4 of the drives in an mdadm array and I've seen some other issues posted about this causing problems, I wonder if that's what happened here?

I will try to reproduce and grab logs for you.

@msgerbs
Copy link
Author

msgerbs commented Jun 27, 2019

Sorry, but what commands do I actually need to run to get the relevant logs here? I can see the logs on screen and I can get to a terminal, but I'm not quite sure where to go from there. I checked every file and folder in /var/log but don't see anything that resembles the output I see during the install, and the logs during the disk partitioning go by too fast to see what the issue is.

Right now I am at the "select device" screen immediately after a failed attempt to install to software RAID.

@nagilum99
Copy link

I installed XCP-ng 7.6 with software RAID (mirror via setup) and upgraded later with 8 - it worked without any problems. Controller was an AMD A320 chipset S-ATA controller.

Under most distros you find helpful infos under /var/log/messages or /var/log/syslog
But I'm not sure about the install media from XCP-ng.

@msgerbs
Copy link
Author

msgerbs commented Jun 27, 2019

Unfortunately /var/log/messages only has the startup message from rsyslogd.

I'm guessing when you installed it you did not have any other existing mdadm arrays? I believe that's the issue here, and there are other issues along the same lines: #107, #75

@stormi
Copy link
Member

stormi commented Jun 27, 2019

During installation the logs are in /tmp/install-log. After installation, they are in /var/log/installer.

@msgerbs
Copy link
Author

msgerbs commented Jun 28, 2019

Here is the log: https://pastebin.com/RAcZ58Di

@stormi
Copy link
Member

stormi commented Nov 25, 2019

Sorry, I overlooked your answer. Looking at the logs, I see lots of complaints from partitioning tools regarding the existing disk layout:

Caution! After loading partitions, the CRC doesn't check out!
****************************************************************************
Caution: Found protective or hybrid MBR and corrupt GPT. Using GPT, but disk
verification and recovery are STRONGLY recommended.
****************************************************************************
Warning! Main partition table overlaps the first partition by 33 blocks!
You will need to delete this partition or resize it in another utility.
 
Warning! Secondary partition table overlaps the last partition by
18446744071756026481 blocks!
You will need to delete this partition or resize it in another utility.
 
STANDARD ERROR:
Caution: invalid backup GPT header, but valid main header; regenerating
backup header from main header.
 
Warning! Main and backup partition tables differ! Use the 'c' and 'e' options
on the recovery & transformation menu to examine the two tables.
 
Warning! One or more CRCs don't match. You should repair the disk!

I think our installer is not robust enough to cope with all those errors and give you a proper error message in the UI.

@stormi stormi changed the title Software RAID install fails (7.6.) Software RAID install fails (7.6.) - many existing errors in disk layout Nov 25, 2019
@rjt
Copy link

rjt commented Nov 25, 2019 via email

@stormi
Copy link
Member

stormi commented Nov 25, 2019

Thanks for the input. I think this issue is converging towards #107

@stormi
Copy link
Member

stormi commented Jan 19, 2021

Closing as duplicate for #107

@stormi stormi closed this as completed Jan 19, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants