Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

First Implementation of a Simplex Trie #220

Draft
wants to merge 1 commit into
base: main
Choose a base branch
from
Draft

Conversation

ffl096
Copy link
Member

@ffl096 ffl096 commented Aug 18, 2023

This implements a simplex trie as presented in [1] as backend data structure for the SimplicialComplex class. This is also used in gudhi's SC implementation. However, they do not expose all functionality we need and the data structure is implemented in native code, so we cannot interact with it directly either.

Using a simplex tree should bring some nice performance improvements over the previous approach and fixes some bugs along the way as well. I will add some comparisons later.

[1] Jean-Daniel Boissonnat and Clément Maria. The Simplex Tree: An Efficient Data Structure for General Simplicial Complexes. Algorithmica, pages 1–22, 2014

@ffl096
Copy link
Member Author

ffl096 commented Aug 18, 2023

@mhajij The tests fail because coseg loads a pickled state of SimplicialComplex with internal properties. This is (unrelated to this pull request) a bad idea, as any change of the data structure may lead to errors, or worse undetected inconsistencies.

@mhajij
Copy link
Member

mhajij commented Aug 18, 2023

@mhajij The tests fail because coseg loads a pickled state of SimplicialComplex with internal properties. This is (unrelated to this pull request) a bad idea, as any change of the data structure may lead to errors, or worse undetected inconsistencies.

@ffl096
I am not sure we should merge this pull request now because the ICML challenge participants might have used that dataset and I think we need to merge the pull request they have their first before we merge this particular pull request. What do you think?

@mhajij mhajij self-requested a review August 18, 2023 21:34
@ffl096
Copy link
Member Author

ffl096 commented Aug 19, 2023

This is a draft pull request, it is not to be merged right now regardless :)

However, just to clarify: I do not propose to remove the coseg dataset. We have to think about a reasonable data format to deliver the dataset that does not rely on pickle. Ideally, the return value of the coseg function should stay exactly the same.
SimplicialComplex objects in this pr are compatible to the previous implementation as long as the user does not access internal state. The ICML submissions should all be fine.

@mhajij
Copy link
Member

mhajij commented Aug 19, 2023

This is a draft pull request, it is not to be merged right now regardless :)

However, just to clarify: I do not propose to remove the coseg dataset. We have to think about a reasonable data format to deliver the dataset that does not rely on pickle. Ideally, the return value of the coseg function should stay exactly the same. SimplicialComplex objects in this pr are compatible to the previous implementation as long as the user does not access internal state. The ICML submissions should all be fine.

we need to create a Data object to be utilized in the higher order context. I think the one available in torch is good enough.

This is an example on how it can be used in a higher order DL model https://github.com/pyt-team/TopoModelX/blob/569bd193f81d47e04891376676c034e90cc07554/tutorials/combinatorial/hmc_train.ipynb

@mhajij
Copy link
Member

mhajij commented Sep 14, 2023

@ffl096 I think we can merge this now, testing is failing however, can you please take care of it so we can merge ? also lint.

@ffl096
Copy link
Member Author

ffl096 commented Sep 14, 2023

The dataset issue still stands and is outside of the scope to be fixed here. We cannot reliably use pickled objects as data objects.

@mhajij
Copy link
Member

mhajij commented Sep 14, 2023

The dataset issue still stands and is outside of the scope to be fixed here. We cannot reliably use pickled objects as data objects.

I cannot merge wihout passing the tests, what do you think we should do? should we fix the dataset issues first?

@ffl096
Copy link
Member Author

ffl096 commented Sep 14, 2023

According to git blase, the coseg dataset downloaded from here was preprocessed by you, right? This repo does not contain this preprocessing script, can you provide that to me? Same for shrec_16.

@ffl096 ffl096 added enhancement New feature or request refactor labels Sep 14, 2023
@ffl096 ffl096 force-pushed the frantzen-simplex-trie branch 3 times, most recently from c38136b to 7357c14 Compare September 20, 2023 07:14
@USFCA-MSDS
Copy link
Contributor

@ffl096 What do you want to do with this PR ? I think we need to have SC faster and implemented correctly but many code relies on the datasets-- what do you suggest?

@ffl096
Copy link
Member Author

ffl096 commented Feb 9, 2024

As outlined above, the dataset structure has to be overhauled completely. This is outside of the scope of this pull request though, and needs to be done regardless. The current system is highly unstable. Once that is done, this pull request is good to be merged.

@ffl096 ffl096 force-pushed the frantzen-simplex-trie branch 5 times, most recently from 41939ff to 5a1a11a Compare February 9, 2024 14:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request refactor
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants