Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactor data structure representing account candidates for cleaning #2296

Merged
merged 32 commits into from
Aug 8, 2024

Conversation

dmakarov
Copy link

@dmakarov dmakarov commented Jul 26, 2024

Problem

AccountsDB::clean_accounts makes unnecessary copies of large number of
pubkeys and accompanying information to find and operate on the
accounts that can be deleted from the accounts index.

Summary of Changes

With this change the candidates for deletion are organized in a single data structure
with necessary information being updated in-place, thus reducing
memory requirements of the cleaning procedure.

Fixes #

@dmakarov dmakarov force-pushed the oom branch 4 times, most recently from 9f4a90e to 7c5cd45 Compare July 26, 2024 12:47
@dmakarov dmakarov changed the title Fix OOM Refactor data structure representing account candidates for cleaning Jul 26, 2024
@dmakarov dmakarov force-pushed the oom branch 9 times, most recently from 8a14193 to 77b9935 Compare July 26, 2024 19:42
accounts-db/src/accounts_db.rs Outdated Show resolved Hide resolved
accounts-db/src/accounts_db.rs Outdated Show resolved Hide resolved
Comment on lines 3055 to 3058
store.accounts.scan_pubkeys(|key| {
let index = self.accounts_index.bin_calculator.bin_from_pubkey(key);
let mut pubkeys_bin = pubkeys[index].write().unwrap();
pubkeys_bin.insert(*key, CleaningInfo::default());

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We could also create per-thread binned pubkeys and then reduce that at the end. It may reduce lock contention.

I think what we have here is likely good as a v0. The map-reduce suggestion could be done in a subsequent PR.

Comment on lines 3104 to 3106
let index = self.accounts_index.bin_calculator.bin_from_pubkey(key);
let mut pubkeys_bin = pubkeys[index].write().unwrap();
pubkeys_bin.insert(*key, CleaningInfo::default());

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

And the same idea here too.

accounts-db/src/accounts_db.rs Show resolved Hide resolved
accounts-db/src/accounts_db.rs Outdated Show resolved Hide resolved
accounts-db/src/accounts_db.rs Outdated Show resolved Hide resolved
accounts-db/src/accounts_db.rs Outdated Show resolved Hide resolved
accounts-db/src/accounts_db.rs Outdated Show resolved Hide resolved
accounts-db/src/accounts_db.rs Outdated Show resolved Hide resolved
@@ -2742,86 +2755,95 @@ impl AccountsDb {
/// 1. one of the pubkeys in the store has account info to a store whose store count is not going to zero
/// 2. a pubkey we were planning to remove is not removing all stores that contain the account
fn calc_delete_dependencies(
purges: &HashMap<Pubkey, (SlotList<AccountInfo>, RefCount)>,
purges: &Vec<RwLock<HashMap<Pubkey, CleaningInfo>>>,

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we should drop the RwLock at some point when it is no longer necessary, probably:
let purges = purges.into_iter().map(|x| x.into_inner().unwrap()).collect::<Vec<_>>()
This simplifies fns like this one.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we could also look for places where we are not executing in parallel where perhaps we should be. and maybe the rwlocks help us, but probably not.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe if we separate (key, value) into separate vectors as Haoran suggested we don't need RwLock at all.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

we should drop the RwLock at some point when it is no longer necessary, probably: let purges = purges.into_iter().map(|x| x.into_inner().unwrap()).collect::<Vec<_>>() This simplifies fns like this one.

I think it only works like this

purges.into_iter().map(|x| {
    let y = x.read().unwrap();
    y.clone()
}).collect::<Vec<_>>()

because x is a shared reference and RwLock::into_inner cannot move out of a shared reference.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this works and is reasonable (doesn't copy the hashmaps):

 let purges = candidates.into_iter().map(|x| std::mem::take(&mut *x.write().unwrap())).collect::<Vec<_>>();

I tried Box::into_inner(candidates), but it complained about not knowing the size at compile time. I don't have a lot of experience with Box. I'm only saying that at some point we can get rid of the RwLock in the candidates. They were convenient early but no downstream code uses them at the moment. It only adds complexity and the possibility of deadlocks and poisoned rwlocks...

This isn't necessary for this pr, I don't think. And, I think the rwlocks could enable better threading/higher performance in the future.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

fwiw, the separating keys/values is an interesting idea. I'd have to think more. I think what we have is fine for now.

@jeffwashington
Copy link

looks like the right direction to me! I see my comments overlapped brooks. I knew he would have a lot to say.

@dmakarov
Copy link
Author

I really appreciate the feedback, and will make the necessary changes. Thank you!

@dmakarov dmakarov force-pushed the oom branch 2 times, most recently from bf23120 to cb51c76 Compare July 27, 2024 12:11
@jeffwashington
Copy link

jeffwashington commented Aug 6, 2024

fyi, I pulled this into the editor and worked through it. There was 1 more place we needed to filter. Then, there were several places with code duplication and several places where cargo fmt is apparently not correct. I made all these changes in individual commits and pushed them.

@dmakarov
Copy link
Author

dmakarov commented Aug 6, 2024

What should I do with this PR? Is more work required within this PR or other issues can be addressed in follow-up work?

accounts-db/src/accounts_db.rs Show resolved Hide resolved
Comment on lines +3419 to 3421
if slot_list.is_empty() {
continue; // seems simpler than filtering. `candidates` contains all the pubkeys we original started with
}

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this a correctness or a performance change?

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

correctness. we previously would have not gotten in this loop for any pubkey that was not a zero lamport account. I don't know what happens if we have no slot list here. I don't want to find out.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

but please verify my assertion!

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah. Nice catch!

@jeffwashington
Copy link

What should I do with this PR? Is more work required within this PR or other issues can be addressed in follow-up work?

once we test it once more, I think this pr is good. and once brooks/haoran review (and you look at it again). Bugs here can be very tricky, so this is a great exercise in really digging into this code.

@brooksprumo
Copy link

I'm concerned we're changing behavior in addition to the refactoring/reshaping of the data types. If possible, I'd find it preferable to have separate PRs for changing behavior. Maybe these changes are due to the refactoring/reshaping. I'll do another more substantive review.

@brooksprumo brooksprumo self-requested a review August 6, 2024 21:53
HaoranYi
HaoranYi previously approved these changes Aug 7, 2024
Copy link

@HaoranYi HaoranYi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I went over the PR again. It looks good to me.
Just one nit for clean up, which can be done either in this PR or in a follow-up PR.

@@ -3130,10 +3129,6 @@ impl AccountsDb {

timings.delta_key_count = Self::count_pubkeys(&candidates);

let mut hashset_to_vec = Measure::start("flat_map");
hashset_to_vec.stop();
timings.hashset_to_vec_us += hashset_to_vec.as_us();

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ha. it was never dumped out anyway.

@jeffwashington
Copy link

This morning, i started 2 mnb and 1 testnet validator with this change rebased to master.

accounts-db/src/accounts_db.rs Outdated Show resolved Hide resolved
accounts-db/src/accounts_db.rs Outdated Show resolved Hide resolved
accounts-db/src/accounts_db.rs Outdated Show resolved Hide resolved
accounts-db/src/accounts_db.rs Show resolved Hide resolved
accounts-db/src/accounts_db.rs Outdated Show resolved Hide resolved
accounts-db/src/accounts_db.rs Outdated Show resolved Hide resolved
@dmakarov
Copy link
Author

dmakarov commented Aug 7, 2024

I tried to address all recent comments.

Copy link

@brooksprumo brooksprumo left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

:shipit:

Thank you for answering all my questions and addressing all my comments!

Copy link

@jeffwashington jeffwashington left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm. Thanks for working through all the issues. We want future debugging to be easier, not harder. I think we've all learned new things about clean while also making it use less memory and run faster.
I pulled down the full change again and verified every line.
I have 2 mnb and 1 testnet validator running on this. All are running fine. We verified this on a ledger that was ooming at startup. No longer ooming.

@dmakarov dmakarov merged commit e8dfc9a into anza-xyz:master Aug 8, 2024
42 checks passed
@dmakarov dmakarov deleted the oom branch August 8, 2024 14:47
ray-kast pushed a commit to abklabs/agave that referenced this pull request Nov 27, 2024
…nza-xyz#2296)

* Refactor data structure representing account candidates for cleaning

AccountsDB::clean_accounts makes unnecessary copies of large number of
pubkeys and accompanying information to find and operate on the
accounts that can be deleted from the accounts index. With this change
the candidates for deletion are organized in a single data structure
with necessary information being updated in-place, thus reducing
memory requirements of the cleaning procedure.

* Remove redundant method of CleaningInfo and make its fields private

* Rework pubkey_to_slot_set construction

* Use slice instead of Vec in parameter types

* Remove redundant duplicate expression

* Fix tests build

* Replace Vec by Box

* Fix clippy error

* Add missing expect() message

* Update expect message

* Remove unnecessary check for empty bins

* Remove redundant type specification

* Formatting fixes

* Exclude accounts with empty slot list

* Fix name in an info message

* Call scan once per pubkey

* Fix deadlock in test

* Remove redundant variable

* Find affected pubkey in all bins

* Undo BINS_FOR_TESTING change

* Avoid acquiring read lock twice in same thread

* comments

* comments

* replace if-then-else by then_some

* Don't retain entries with empty slot list

* add filtering on store counts loop

* fix fmting on scan_accounts

* extract out count_pubkeys

* extract out `insert_pubkey`

* fix fmt on `retain`

* Remove hashset_to_vec timer

* comments

---------

Co-authored-by: jeff washington <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants