feat(matching): use the Hungarian Algorithm for unordered matching #50

jpedroh · 2024-05-01T20:47:16Z

Our current approach to unordered node matching relies on a naive assumption: that all nodes possess an identifier. While this holds true for most nodes we've encountered thus far, such as method and property declarations within a Java class, it proves insufficient when attempting to match nodes lacking a label, like static blocks in Java. In such cases, calculations for matchings may yield incorrect results, consequently leading to erroneous merges.

This pull request introduces a solution for matching unordered nodes via the Assignment Problem, utilizing the Hungarian Algorithm to resolve it. This approach mirrors the one used in jDime.

Given the widespread recognition of the Hungarian Algorithm, we rely on the implementation provided by the pathfinding crate. This simplifies our implementation efforts, as we only need to provide the weights matrix and extract the matching information from the solution.

A workaround had to be implemented since pathfinding expects the input matrix weight to have the same number of rows and columns, which might not always be true in our case since nodes can have a different number of children. The solution involves initializing the remaining columns/rows with 0.

For now, our naive label implementation has been bypassed and is not being utilized. In a further pull request, the idea is to resort to the Hungarian algorithm only if the nodes are unlabeled, as it's significantly more complex than merely matching identifiers.

codesandbox · 2024-05-01T20:47:18Z

Review or Edit in CodeSandbox

Open the branch in Web Editor • VS Code • Insiders

Open Preview

coveralls · 2024-05-01T20:49:59Z

coverage: 86.126% (-0.3%) from 86.418%
when pulling eb9fe81 on feat-assignment-problem
into 31ea3c8 on main.

jpedroh added 6 commits May 1, 2024 17:12

feat(matching): initial implementation of assignment problem

2a1a72c

refactor: use return from pathfinding instead of calculating it

02e197c

refactor: rename function

1de8663

refactor: use unreachable for unreachable case

de42f36

refactor: rename and remove strict type alias

9505799

feat: early return and tweak test scenario

082b46e

refactor: move file to unordered mod for later improvements in design

eb9fe81

jpedroh marked this pull request as ready for review May 1, 2024 21:01

jpedroh merged commit fd5d943 into main May 1, 2024
8 checks passed

jpedroh deleted the feat-assignment-problem branch May 1, 2024 21:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(matching): use the Hungarian Algorithm for unordered matching #50

feat(matching): use the Hungarian Algorithm for unordered matching #50

jpedroh commented May 1, 2024 •

edited

Loading

codesandbox bot commented May 1, 2024

coveralls commented May 1, 2024 •

edited

Loading

feat(matching): use the Hungarian Algorithm for unordered matching #50

feat(matching): use the Hungarian Algorithm for unordered matching #50

Conversation

jpedroh commented May 1, 2024 • edited Loading

codesandbox bot commented May 1, 2024

Review or Edit in CodeSandbox

coveralls commented May 1, 2024 • edited Loading

jpedroh commented May 1, 2024 •

edited

Loading

coveralls commented May 1, 2024 •

edited

Loading