[ENH; WIP] Implementation of FCI algorithm that leverages the base classes used in PC #32

adam2392 · 2022-08-30T16:31:14Z

Changes proposed in this pull request:

implements the FCI algorithm
implements a subclass of LearnSkeleton for learning skeleton graphs using the idea of "possibly-d-separating" sets and "PDS-Path" sets (see RFCI paper)
implements unit tests for skeleton and FCI procedure

Next PR:

implements an example demonstrating how FCI is different

This relies on Pywhy-graph changes in: py-why/pywhy-graphs#10
Also this will be downstream of #30 .

Order of merging:

Before submitting

I've read and followed all steps in the Making a pull request
section of the CONTRIBUTING docs.
I've updated or added any relevant docstrings following the syntax described in the
Writing docstrings section of the CONTRIBUTING docs.
If this PR fixes a bug, I've added a test that will fail without my fix.
If this PR adds a new feature, I've added tests that sufficiently cover my new functionality.

After submitting

All GitHub Actions jobs for my pull request have passed.

Signed-off-by: Adam Li <[email protected]>

bloebp · 2022-08-31T19:32:22Z

Just want to mention that if we start supporting causal-learn (which we plan to), we should probably not have another implementation of the most common algorithms. However, I see the benefit here to demonstrate the API and graph usages.

That being said, maybe we should look into having wrappers to support calling the causal-learn functions. This could be as simple as:

def pc(data):
  adjacency_matrix = causal_learn.pc_algorihtm(data)
  
  cpdag = OurCPDAGImplementation(adjacency_matrix)

  return cpdag

There is no need to have the graph definitions deeply integrated in the algorithms.

adam2392 · 2022-08-31T20:08:00Z

Just want to mention that if we start supporting causal-learn (which we plan to), we should probably not have another implementation of the most common algorithms. However, I see the benefit here to demonstrate the API and graph usages.

That sounds good. I'm motivated to implement at least the core constraint-based algos. in this repo and codebase because of a few reasons:

i) extensibility: rn I am trying to get an implementation of rFCI working w/o having to use the R package pcalg.
ii) maintainability: I don't see the rules 5-10 of Zhang 2008 in the FCI function. I found it easier to just draft up an implementation that teases apart the semantics

In the proposed constraint-based API, I think it's important to modularize and pull apart each aspect of the core algorithms to enable future improvements. E.g. rFCI, or even conservativeFCI, or maxvoteFCI is trivially implementable by subclassing class FCI here.

Of course... this all relies on me being invested in constraint-based learning :p. I think if there is developer consensus on the API, then re-implementing will be helpful in the long run to consolidate the API across the entire pywhy. With that being said, I am less motivated to do the score-based algorithms, which I think then implementing a wrapper sounds like a great way of getting the functionality out right away. E.g. #29

adam2392 · 2022-08-31T20:08:03Z

def pc(data):
  adjacency_matrix = causal_learn.pc_algorihtm(data)
  
  cpdag = OurCPDAGImplementation(adjacency_matrix)

  return cpdag
There is no need to have the graph definitions deeply integrated in the algorithms.

For this comment: are you referring to the lines here

dodiscover/dodiscover/constraint/_classes.py

Lines 119 to 123 in 3c4f8d2

    
           def convert_skeleton_graph(self, graph: nx.Graph) -> EquivalenceClassProtocol: 
        
               raise NotImplementedError( 
        
                   "All constraint discovery algorithms need to implement a function to convert " 
        
                   "the skeleton graph to a causal graph." 
        
               )

or

dodiscover/dodiscover/constraint/pcalg.py

Lines 197 to 200 in ab79e40

    
           if graph.has_edge(v_i, u, graph.undirected_edge_name): 
        
               graph.orient_uncertain_edge(v_i, u) 
        
           if graph.has_edge(v_j, u, graph.undirected_edge_name): 
        
               graph.orient_uncertain_edge(v_j, u)

?

If the former, then agreed it's quite hidden. Rn I convert to the class, but that's because I need the explicit object to "orient edges" in the PC/FCI orientation phase. I think we've discussed the issue of assuming a numpy array vs an actual object in the internals of the algo. I think there are equally the same issues of converting any graph object to numpy array as converting a numpy array to a graph object before running the algos. But if we eventually can push MixedEdgeGraph into networkx, why not go with an explicit graph object instead of numpy array? Perhaps something to discuss concretely at the next meeting(?)

I think one way to make this more transparent and modularizable is to allow the user to optionally pass in a graph function that allows def convert_skeleton_graph(self, graph: nx.Graph) -> EquivalenceClassProtocol: to define any graph object that meets the Protocol defined.

bloebp · 2022-08-31T22:10:22Z

I was rather referring to providing wrappers instead of re-implementing the algorithms (as discussed offline). That was rather meant as a pseudo-code to show that we can simply call external libraries and convert their result into our objects.

Signed-off-by: Adam Li <[email protected]>

adam2392 added 6 commits August 26, 2022 13:19

Working version

49bdfe1

Signed-off-by: Adam Li <[email protected]>

Adding updated example

924d444

Signed-off-by: Adam Li <[email protected]>

Adding updated example

ebb11a8

Signed-off-by: Adam Li <[email protected]>

Adding updated example

ab79e40

Signed-off-by: Adam Li <[email protected]>

need to run tests

91b278d

Signed-off-by: Adam Li <[email protected]>

Almost WIP

fac3e84

Signed-off-by: Adam Li <[email protected]>

adam2392 changed the title ~~[ENH] Implementation of FCI algorithm that leverages the base classes used in PC~~ [ENH; WIP] Implementation of FCI algorithm that leverages the base classes used in PC Aug 30, 2022

adam2392 marked this pull request as draft August 30, 2022 16:31

adam2392 added 4 commits August 30, 2022 17:31

Fixed docs, unit tests and typing

0543c45

Signed-off-by: Adam Li <[email protected]>

Try again

efd85f3

Signed-off-by: Adam Li <[email protected]>

Try again

fce0550

Signed-off-by: Adam Li <[email protected]>

Try again

120ac45

Signed-off-by: Adam Li <[email protected]>

adam2392 added 4 commits September 21, 2022 20:23

Merging in main

0f0cbd4

Signed-off-by: Adam Li <[email protected]>

Adding counts

89886f7

Merging in main

c4ff9f3

Signed-off-by: Adam Li <[email protected]>

Fix unit tests

7112fd4

Signed-off-by: Adam Li <[email protected]>

adam2392 mentioned this pull request Sep 27, 2022

[ENH] Implementation of a "complete" FCI algorithm (augmented FCI) #52

Merged

6 tasks

adam2392 closed this in #52 Oct 1, 2022

adam2392 deleted the fci branch January 11, 2023 23:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH; WIP] Implementation of FCI algorithm that leverages the base classes used in PC #32

[ENH; WIP] Implementation of FCI algorithm that leverages the base classes used in PC #32

adam2392 commented Aug 30, 2022 •

edited

Loading

bloebp commented Aug 31, 2022 •

edited

Loading

adam2392 commented Aug 31, 2022

adam2392 commented Aug 31, 2022

bloebp commented Aug 31, 2022

[ENH; WIP] Implementation of FCI algorithm that leverages the base classes used in PC #32

[ENH; WIP] Implementation of FCI algorithm that leverages the base classes used in PC #32

Conversation

adam2392 commented Aug 30, 2022 • edited Loading

Before submitting

After submitting

bloebp commented Aug 31, 2022 • edited Loading

adam2392 commented Aug 31, 2022

adam2392 commented Aug 31, 2022

bloebp commented Aug 31, 2022

adam2392 commented Aug 30, 2022 •

edited

Loading

bloebp commented Aug 31, 2022 •

edited

Loading