Replace NNI with auto-sklearn, and add some testing #4

benclifford · 2021-02-26T14:21:06Z

This PR:

uses auto-sklearn instead of NNI (although it does not remove the NNI code)
adds a testing script
makes the code run from the root of a blindml checkout, rather than /Users/maksim

This probably means that the code will only work run from the blindml source directory. but it's better than not running.

… least tooling as installed by this is broken right now for me in a clean venv - pip-compile is broken because pip-tools breaks with recent pip, and apparently the requirements.txt doesnt pin that stuff hard enough. - the breakage only happens when I try to add in auto-sktools to requirements.txt and rebuild... this is an upgrade of pip-tools and a rerun of the regenerate/reinstall until a fixpoint is reached in my env first manually edit requirements.txt because pip-compile cant do this itself to constrain pip-tools>=5.5 then: run this over and over until no more git diff: (looks like one iteration is ok for me actually) rm -rfv ~/cqx/c/datastation/virtualenv/ && virtualenv /home/benc/cqx/c/datastation/virtualenv/ --python=python3 && pip install -r requirements.txt --extra-index-url https://pypi.anaconda.org/scipy-wheels-nightly/simple && pip-compile --extra-index-url https://pypi.anaconda.org/scipy-wheels-nightly/simple

…roken This gives basic linting for style errors and some basic static semantic checking. Future commits could tidy up the broken flake8 tests and re-enable them. Run with: $ flake8

Remove existing test that fails for me Adjust path of other existing test to run from blindml repo root for when it is invoked by pytest there

makslevental · 2021-02-26T16:31:53Z

blindml/frontend/config/task/task.py

+                # ^ should this drop the y_col even when there isn't a drop_cols
+                # configuration?


yes because otherwise we're training on the y_col to predict y_col (i.e. poisoning the training)

i guess if the user specifies the ycol in their explicitly specified X_cols, that's their own fault, though?

yup major party foul (but there should also probably be a check)

it looks like the explicit X_cols code path was broken anyway - I've put in a fix and added a test case.

I've also added in an assert in the explicit X_cols path for "no ycol in xcols"

blindml/frontend/config/task/task.py

blindml/runner.py

requirements.in

makslevental · 2021-02-26T16:39:25Z

Should've put this in the review whoops: please remove references to nni (including helper/util code)

Prior to this, auto-sklearn was only called in regression mode can i get a classification test/demo? does the previous NNI implementation in blindml support classification, to test my changes against? I couldn't get it to work on a simple csv that i made up

…h because of that

benclifford added 5 commits February 26, 2021 14:16

Replace hardcoded ~maksim paths with .

2ae7dea

This probably means that the code will only work run from the blindml source directory. but it's better than not running.

Add flake8 dep and config, with disables to disable what is already b…

9456b14

…roken This gives basic linting for style errors and some basic static semantic checking. Future commits could tidy up the broken flake8 tests and re-enable them. Run with: $ flake8

Add statistical test for basic sanity of behaviour

c3e9e2b

Remove existing test that fails for me Adjust path of other existing test to run from blindml repo root for when it is invoked by pytest there

Add auto-sklearn requirement

fb17c36

benclifford requested a review from makslevental February 26, 2021 14:21

benclifford added 2 commits February 26, 2021 14:47

Replace NNI calls with auto-sklearn calls

1bb0a4b

fix flake8

f8aebb1

makslevental suggested changes Feb 26, 2021

View reviewed changes

benclifford added 5 commits March 1, 2021 10:44

Remove TODO/question/comment that is answered

4a4dc9c

remove some commented out code

5b30a11

add missing dot

7c840fa

Remove nni prereq package, and everything that breaks in quick-test.s…

d6ae253

…h because of that

benclifford force-pushed the benc-replace-nni-autosklearn branch from 92bada1 to d6ae253 Compare March 1, 2021 14:08

benclifford added 3 commits April 9, 2021 08:49

Fix broken explicit-X_cols codepath and add a test

c7b8b9d

check that y col is not in x cols when explicitly specified

e96a0d9

fix flake8 for quick test

68939be

makslevental approved these changes Apr 9, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Replace NNI with auto-sklearn, and add some testing #4

Replace NNI with auto-sklearn, and add some testing #4

Uh oh!

benclifford commented Feb 26, 2021

Uh oh!

makslevental Feb 26, 2021

Uh oh!

benclifford Mar 1, 2021

Uh oh!

makslevental Mar 1, 2021

Uh oh!

benclifford Apr 9, 2021

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

makslevental commented Feb 26, 2021 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		# ^ should this drop the y_col even when there isn't a drop_cols
		# configuration?

Replace NNI with auto-sklearn, and add some testing #4

Are you sure you want to change the base?

Replace NNI with auto-sklearn, and add some testing #4

Uh oh!

Conversation

benclifford commented Feb 26, 2021

Uh oh!

makslevental Feb 26, 2021

Choose a reason for hiding this comment

Uh oh!

benclifford Mar 1, 2021

Choose a reason for hiding this comment

Uh oh!

makslevental Mar 1, 2021

Choose a reason for hiding this comment

Uh oh!

benclifford Apr 9, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

makslevental commented Feb 26, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

makslevental commented Feb 26, 2021 •

edited

Loading