WIP add e-values support by alexblnn · Pull Request #33 · ja-che/hidimstat

alexblnn · 2022-07-28T07:34:16Z

as discussed with @tbng @bthirion @pneuvial, this PR aims at adding Knockoff aggregation using e-values as described in Ren and Barber 2022 (https://arxiv.org/pdf/2205.15461.pdf)

tbng · 2022-07-28T11:56:30Z

Thanks a lot! I think it is better that you also create a small test case to check the FDP and Power of the method on simulated data (ideally FDP < threshold and Power is greater than 0); you can check examples in the test folder.

A related thing is that maybe we need a general procedure for creating e-value, since as far as I'm aware there is no such implementation on Python.

Note that there are many unorganized things with the knockoff module that I might plan to do a refactoring soon, but I have not found the time yet. That being said, I will do it after the merge of this PR.

bthirion · 2022-07-28T11:58:21Z

Indeed, this PR obviously needs test + example.

tbng · 2022-07-28T16:40:50Z

Also note that Zhimei Ren has an R implementation here: https://github.com/zhimeir/derandomized_knockoffs_fdr

bthirion

Sounds promising, thx !

bthirion · 2022-08-01T19:54:06Z

examples_not_exhibited/plot_fig_1_nguyen_et_al.py

@@ -11,6 +11,8 @@
 To reduce the script runtime it is desirable to increase n_jobs parameter.


What is the status of this 'examples_not_exhibited' directory ? if these are non-working examples it should be removed ;-)

This is one of my TODOs for the knockoff module: after refactoring knockoffs to the main branch using different example for a fast build (the current one is with n=500, p=1000 and 2500 simulations, hence not really the most friendly example to run).

bthirion · 2022-08-01T19:55:08Z

examples_not_exhibited/plot_fig_1_nguyen_et_al.py

    print('Done!')

-
+main()


We avoid this structure in examples to improve readability.

Yes, the all the examples need rework IMO.

bthirion · 2022-08-01T19:57:54Z

hidimstat/knockoffs/knockoff_aggregation.py

    return np.array(pvals)
+
+
+def _empirical_eval(test_score, fdr=0.1, offset=1):


Why do you expose the offset parameter ? I think it should be fixed.

It is shown in the rest of the code (i.e. it is a parameter of knockoff_aggregation etc), should we remove this altogether?

I would, as I don't see any use case for changing it. @tbng any opinion on this ?

Yes, this is done long before, anyway I think it's ok we remove the offset and simple use $1 + num W_j \leq -W_k$ in the numerator, as written in the paper.

bthirion · 2022-08-01T20:00:45Z

hidimstat/knockoffs/utils.py

+    evals_sorted = -np.sort(-evals)  # sort in descending order
+    selected_index = 2 * n_features
+    for i in range(n_features - 1, -1, -1):
+        if evals_sorted[i] >= n_features / (fdr * (i + 1)):


I think that you can avoid the for loop.

bthirion · 2022-08-01T20:01:00Z

hidimstat/knockoffs/utils.py

    else:
        return -1.0

+def _ebh_threshold(evals, fdr=0.1):


this function should have a test.

should i add a test_utils file to the test section? the utils file is not tested as of now

Yes, please.

tbng · 2022-09-09T08:27:21Z

Resurrecting this PR as I'm working to finish it right now. Edit: with @alexblnn

Alexandre Blain and others added 3 commits July 26, 2022 18:20

add evalues

bdbf6fa

add evalues

b5357e6

more evalues updates

ac4cdce

alexblnn added 2 commits August 1, 2022 16:04

add test + trick

26e8065

add example and fix CI bug

609d715

bthirion reviewed Aug 1, 2022

View reviewed changes

		@@ -11,6 +11,8 @@
		To reduce the script runtime it is desirable to increase n_jobs parameter.

		return np.array(pvals)


		def _empirical_eval(test_score, fdr=0.1, offset=1):

		print('Done!')


		main()

Conversation

alexblnn commented Jul 28, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tbng commented Jul 28, 2022

Uh oh!

bthirion commented Jul 28, 2022

Uh oh!

tbng commented Jul 28, 2022

Uh oh!

bthirion left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alexblnn Aug 3, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tbng commented Sep 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

alexblnn commented Jul 28, 2022 •

edited

Loading

alexblnn Aug 3, 2022 •

edited

Loading

tbng commented Sep 9, 2022 •

edited

Loading