-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
KeyError: 66 (in used += self.cluster.osd_transfer_remainings[osdid]) #35
Comments
I have successfully worked around the crash by adding |
Thanks for the report and file - i can probably figure this out from the data but you may know directly: |
No idea. Apparently they just set one OSD to this class and later created a pool that uses it. Today they set more OSDs to this class (see the dump from #36). |
sounds wild :D let's hope they know what they're doing (but then again they seem to have asked you for help :) |
I think I should have expressed myself better. At this point, I think that the fact that I managed to notice a state with only one OSD with the "hdd_test" CRUSH device class is purely due to the miscoordination of my work with their work. The end result (two hosts full of hdd_test OSDs, which makes more sense but still triggers the issue unless |
While trying to rebalance an especially broken cluster, my colleague found this exception:
Note that osd.66 is the only OSD which has the hdd_test class:
As we are not permitted to publicly post anything containing UUIDs that can be used to identify the customer's cluster, I am going to submit the debug info via private email.
The text was updated successfully, but these errors were encountered: