Inaccurate gradient estimation for Gumbel-Sinkhorn #120

adamoyoung · 2021-12-28T22:12:39Z

I implemented a Gumbel-Sinkhorn layer as a CVXPY layer, and compared it with a baseline implementation using the Sinkhorn algorithm. While the forward passes of both layers seem similar, the backward passes differ. Specifically, it seems like the CVXPY layer is not able to provide accurate gradient esimates. I am looking for advice as to why it is not working properly.

I have detailed my experiments in this Google colab gist. Any feedback would be appreciated. Thanks!

AxelBreuer · 2023-05-09T09:14:36Z

Hi @adamoyoung ,
I am not familiar with Gumbel-Sinkhorn layers, but I quickly went through your Google colab and realized that this layer uses cp.log_sum_exp(x) and cp.exp(x).
I have recently fixed a bug in exponential cones (cvxgrp/diffcp#59); hence it could be woth to update to the lastest diffcp and relaunch you tests.
With a bit of luck, your code will work now.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inaccurate gradient estimation for Gumbel-Sinkhorn #120

Inaccurate gradient estimation for Gumbel-Sinkhorn #120

adamoyoung commented Dec 28, 2021

AxelBreuer commented May 9, 2023 •

edited

Loading

Inaccurate gradient estimation for Gumbel-Sinkhorn #120

Inaccurate gradient estimation for Gumbel-Sinkhorn #120

Comments

adamoyoung commented Dec 28, 2021

AxelBreuer commented May 9, 2023 • edited Loading

AxelBreuer commented May 9, 2023 •

edited

Loading