Skip to content

Reducing HoeffdingTree Time&Memory Consumption #1074

Answered by smastelini
ZahirBilal asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @ZahirBilal, nice strategy to compose trees, btw! I recently published a paper where random sampling is applied for regression, and the results are promising indeed :)

The choice of these parameters is based on my modest understanding of Hoeffding tree algorithms. What do you guys think? Did I choose a wrong approach?

If you want to control memory and time, your best bet is to start with max_depth. This is a straightforward way to constrain the trees and reduce memory usage, though it might lead to underfitting if the tree depth is too shallow. Next, I would try grace_period to avoid attempting to split too often. Increasing this parameter will give you some speed-ups at the cost of …

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@ZahirBilal
Comment options

@smastelini
Comment options

Answer selected by ZahirBilal
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants