Faster hDAG construction #8

willdumm · 2022-03-11T17:30:48Z

The method used to build the history DAG is slow (search entire DAG for duplicate nodes each time a history is added)

It certainly doesn't have to be this way. Let's think about this with the __getstate__ and __setstate__ implementations as inspiration.

The text was updated successfully, but these errors were encountered:

willdumm · 2022-03-15T20:15:41Z

Certainly all DAG construction functions should be modified to expect only a generator containing trees, not an entire list, and iterate on that generator in a memory-efficient way.

Current situation:

Goal: build each node only once.
Input: list of ete trees?

build node dictionary node_dict node -> node for existing DAG
for each ete tree:
- Postorder traverse to build child clades at all nodes
- Preorder traverse, adding each edge by looking up parent node in the node_dict (it must be there by preorder), looking up child node in node_dict, adding if necessary, and adding edge from parent to child

willdumm added the enhancement something to improve label Mar 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Faster hDAG construction #8

Faster hDAG construction #8

willdumm commented Mar 11, 2022

willdumm commented Mar 15, 2022 •

edited

Loading

Faster hDAG construction #8

Faster hDAG construction #8

Comments

willdumm commented Mar 11, 2022

willdumm commented Mar 15, 2022 • edited Loading

willdumm commented Mar 15, 2022 •

edited

Loading