Added ShrunkCovariance #309

norm4nn · 2024-11-08T17:59:09Z

What has been added:

ShrunkCovariance module (scikit-learn context)
Covariance.Utilites module with common functions
tests and docs for ShrunkCovariance

Following up on #304: I added the ? sign to the assume_centered option, but I'm unsure about replacing empirical_covariance/1 with Nx.covariance/3 because tests don’t pass when I make that change. I believe the tests should stay as they are, since I’ve already verified the results against scikit-learn.

lib/scholar/covariance/shrunk_covariance.ex

Co-authored-by: Mateusz Sluszniak <[email protected]>

Co-authored-by: José Valim <[email protected]>

lib/scholar/covariance/shrunk_covariance.ex

krstopro

Few nitpicks.

lib/scholar/covariance/utils.ex

krstopro · 2024-11-12T14:35:18Z

lib/scholar/covariance/utils.ex

+    {x - location, location}
+  end
+
+  defn empirical_covariance(x) do


You should be able to use Nx.covariance/2 instead.

I don't think so, look at the PR description

Right, I forgot to ask: @norm4nn did you try setting ddof: 0 when calling Nx.covariance/2?

Sorry, ddof: 0 is default. It's strange that the tests are failing, because Nx.covariance/2 does exactly what is implemented here. Could it be the case that the data in your test is not centered and you are using Nx.covariance/2?

Yeah, you are right! This is the case, I will fix this on Thursday.

Co-authored-by: Paulo Valente <[email protected]>

Co-authored-by: Krsto Proroković <[email protected]>

krstopro · 2024-11-14T22:02:33Z

lib/scholar/covariance/ledoit_wolf.ex

@@ -182,9 +149,9 @@ defmodule Scholar.Covariance.LedoitWolf do

  defnp ledoit_wolf_shrinkage_complex(x) do
    {num_samples, num_features} = Nx.shape(x)
-    emp_cov = empirical_covariance(x)
+    emp_cov = Nx.covariance(x)


Is x centered here? If yes, maybe reverting it to empirical_covariance might be better. 😅

Hmm, I don't think empirical_covariance/1 was centering x, if that's what you're asking about. However, I'm still a bit confused as to why, when I used empirical_covariance/1 instead of Nx.covariance/2, the Covariance.*.fit/2 functions produced the same output as the scikit-learn versions when I passed a non-centered x and set assume_centered? to true. While I feel this case is invalid and can't think of any practical use case for passing such arguments, it still feels odd that the output in this edge case differs when using Nx.covariance/2 compared to the scikit-learn version. It seems empirical_covariance might not operate exactly the same way as Nx.covariance/2 , so I also like the idea of reverting it to empirical_covariance.

Lemme have a look.

I think you need

Suggested change

emp_cov = Nx.covariance(x)

emp_cov = Nx.covariance(x, Nx.reshape(0, {1}))

for the code to be totally equivalent.

empirical_covariance was blindly assuming the mean is 0

empirical_covariance was blindly assuming the mean is 0

Yes, because the data should be centered before. That is why using empirical_covariance might be better idea. Like this the data won't be centered twice.

So you want me to change it back to empirical_covariance version?

Sorry for the delayed response! I am still checking why empirical_covariance and Nx.covariance are giving completely different results.

@norm4nn let's revert to empirical_covariance for now so we can merge this and get unstuck. :)

josevalim · 2024-11-19T18:40:21Z

💚 💙 💜 💛 ❤️

norm4nn added 6 commits November 2, 2024 15:37

added working shrunk covariance

4bf257b

Merge branch 'shrunk_cov' of https://github.com/norm4nn/scholar

238f181

added utilites for covariance, added tests for shrunk covariance

f605f27

Added docs

4b50ca6

Added docs

7e4378b

added '?' sign to assume_centered option

df80bc3

msluszniak reviewed Nov 10, 2024

View reviewed changes

lib/scholar/covariance/shrunk_covariance.ex Outdated Show resolved Hide resolved

josevalim reviewed Nov 11, 2024

View reviewed changes

lib/scholar/covariance/shrunk_covariance.ex Outdated Show resolved Hide resolved

norm4nn and others added 2 commits November 11, 2024 12:14

Update lib/scholar/covariance/shrunk_covariance.ex

4ec5bc5

Co-authored-by: Mateusz Sluszniak <[email protected]>

Update lib/scholar/covariance/shrunk_covariance.ex

3f38654

Co-authored-by: José Valim <[email protected]>

josevalim requested a review from polvalente November 11, 2024 11:31

polvalente reviewed Nov 12, 2024

View reviewed changes

lib/scholar/covariance/shrunk_covariance.ex Outdated Show resolved Hide resolved

polvalente approved these changes Nov 12, 2024

View reviewed changes

krstopro reviewed Nov 12, 2024

View reviewed changes

norm4nn and others added 2 commits November 12, 2024 17:58

Update lib/scholar/covariance/shrunk_covariance.ex

5323af1

Co-authored-by: Paulo Valente <[email protected]>

Update lib/scholar/covariance/utils.ex

34f8574

Co-authored-by: Krsto Proroković <[email protected]>

krstopro reviewed Nov 14, 2024

View reviewed changes

msluszniak approved these changes Nov 19, 2024

View reviewed changes

reverted to empirical_covariance

f72d08f

norm4nn force-pushed the shrunk_covariance branch from 4297069 to f72d08f Compare November 19, 2024 17:52

mix format

0f7ab42

josevalim merged commit f84177f into elixir-nx:main Nov 19, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added ShrunkCovariance #309

Added ShrunkCovariance #309

norm4nn commented Nov 8, 2024 •

edited

Loading

krstopro left a comment

krstopro Nov 12, 2024

msluszniak Nov 12, 2024

krstopro Nov 12, 2024

krstopro Nov 12, 2024

norm4nn Nov 12, 2024

krstopro Nov 14, 2024

norm4nn Nov 15, 2024

krstopro Nov 15, 2024

polvalente Nov 15, 2024

krstopro Nov 15, 2024 •

edited

Loading

norm4nn Nov 16, 2024

krstopro Nov 18, 2024

josevalim Nov 19, 2024

norm4nn Nov 19, 2024

josevalim commented Nov 19, 2024

	emp_cov = Nx.covariance(x)
	emp_cov = Nx.covariance(x, Nx.reshape(0, {1}))

Added ShrunkCovariance #309

Added ShrunkCovariance #309

Conversation

norm4nn commented Nov 8, 2024 • edited Loading

krstopro left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

krstopro Nov 15, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

josevalim commented Nov 19, 2024

norm4nn commented Nov 8, 2024 •

edited

Loading

krstopro Nov 15, 2024 •

edited

Loading