Hyperparameter mutation probabilities and gradual changes #103

perib · 2023-10-12T21:46:38Z

[please review the Contribution Guidelines prior to submitting your pull request. go ahead and delete this line if you've already reviewed said guidelines.]

What does this PR do?

Added three parameters to GraphIndividual (and to the Estimator) to better control the probabilities of hyperparameter mutations.

hyperparameter_probability : float from 0 to 1. The percent of hyperparameters that get mutated per node. (At least 1 hyperparameter will be updated)
hyper_node_probability : float from 0 to 1. The percent of nodes that get their hyperparameters updated. (at least 1 node will be updated)
hyperparameter_alpha : float from 0 to 1. used to calculate a weighted average between the new hyperparameter and the old one. A value of 1 means that the new value is selected. (new x alpha + old x (1-alpha).

The config.hyperparameter file used to have separate functions. These have been grouped into a Trial class. This makes the code easier to read. It also allows for the features above to be implemented without changing the optuna compatible API. Furthermore, now the individual nodes store the optuna suggested hyperparameter in addition to the final hyperparameters returned by the param function (these are not necessarily identical).

Any background context you want to provide?

This will make it easier to better specify probabilities for hyperparameter changes. The inclusion of the alpha parameter allows for gradual changes in the hyperparameter which may potentially make them easier to learn, this needs to be investigated.

What are the relevant issues?

This may be helpful for #84

perib · 2023-10-13T00:04:24Z

One idea could be to merge the Trial class and the NodeLabel class. That may simplify the code.

And it would be consistent with the idea/plan of the graphindividual graph holding objects that each have their own mutation/crossover methods.

jgh9094 · 2023-10-16T18:15:26Z

tpot2/individual_representations/graph_pipeline_individual/individual.py

-            node.hyperparameters = self.select_config_dict(node)[node.method_class](config.hyperparametersuggestor) 
-
+
+            if not completed_one:


Would the else part of this if ever get hit, given that completed_one is always set to False at the start?

Maybe just use the hyper_node_probability?

good catch, just fixed it.

The completed_one is to guarantee that at least one node has its hyperparameters mutated

The else would still not get hit because of the return True statement's placement right? If the plan is just to mutate one node then this works fine, but if you want to have a probability of each node being mutated, this won't accomplish that.

perib · 2023-10-16T18:29:30Z

_mutate_hyperparameters should return True only if hyperparametesr were actually changed. If the first node happens to be one with a fixed set of hyperparameters, then it would return True without doing anything, leading to a duplicate individual.

This is eventually caught later in the population class which loops through mutations until the individual is unique, but it might be a good idea to catch this here too. If a user wants to allow for repeat individuals, those should probably be due to evolution finding the same solution a second time rather than a mutation function that doesn't do anything.

jgh9094 · 2023-10-16T18:59:08Z

_mutate_hyperparameters should return True only if hyperparametesr were actually changed. If the first node happens to be one with a fixed set of hyperparameters, then it would return True without doing anything, leading to a duplicate individual.

This is eventually caught later in the population class which loops through mutations until the individual is unique, but it might be a good idea to catch this here too. If a user wants to allow for repeat individuals, those should probably be due to evolution finding the same solution a second time rather than a mutation function that doesn't do anything.

Could we not just add a return true after each get_hyperparameter() call in the _mutate_hyperparameters()?

perib · 2023-10-16T19:03:49Z

Could we not just add a return true after each get_hyperparameter() call in the _mutate_hyperparameters()?

that would work

jgh9094 · 2023-10-17T17:18:27Z

Some of the checks have failed. After looking through the tox logs, it looks like the error happened on tpot2/individual_representations/graph_pipeline_individual/individual.py:597.

The following change to that line should fix those errors:
new_node = create_node(self.select_config_dict(node)

jgh9094 · 2023-10-17T23:16:03Z

There is another failed check happening at tpot2/tpot2/config/hyperparametersuggestor.py:17. The error we get:

ValueError: Sample larger than population or is negative

My intuition is telling me that we are passing in an old_params dictionary that is empty and the max(#,1) is returning a 1, that is causing the error above. Given we take the max, the negative number should not be an issue.

perib added 2 commits October 12, 2023 14:36

hyperparameter mutation changes

1495af6

replace node creates new node hyper fix

9835819

jgh9094 reviewed Oct 16, 2023

View reviewed changes

fix

10e1c75

perib closed this Oct 16, 2023

perib reopened this Oct 16, 2023

perib added 2 commits October 16, 2023 14:13

fix

8e53a91

fix

82676cc

fix

9aaf019

fix

3ea1729

nickotto marked this pull request as ready for review October 20, 2023 20:00

nickotto merged commit 662553c into EpistasisLab:dev Oct 20, 2023
1 check passed

perib deleted the hyperparameter_demo branch July 18, 2024 17:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hyperparameter mutation probabilities and gradual changes #103

Hyperparameter mutation probabilities and gradual changes #103

perib commented Oct 12, 2023 •

edited

Loading

perib commented Oct 13, 2023

jgh9094 Oct 16, 2023

perib Oct 16, 2023

jgh9094 Oct 16, 2023

perib commented Oct 16, 2023

jgh9094 commented Oct 16, 2023

perib commented Oct 16, 2023

jgh9094 commented Oct 17, 2023

jgh9094 commented Oct 17, 2023

		node.hyperparameters = self.select_config_dict(node)[node.method_class](config.hyperparametersuggestor)


		if not completed_one:

Hyperparameter mutation probabilities and gradual changes #103

Hyperparameter mutation probabilities and gradual changes #103

Conversation

perib commented Oct 12, 2023 • edited Loading

What does this PR do?

Any background context you want to provide?

What are the relevant issues?

perib commented Oct 13, 2023

jgh9094 Oct 16, 2023

Choose a reason for hiding this comment

perib Oct 16, 2023

Choose a reason for hiding this comment

jgh9094 Oct 16, 2023

Choose a reason for hiding this comment

perib commented Oct 16, 2023

jgh9094 commented Oct 16, 2023

perib commented Oct 16, 2023

jgh9094 commented Oct 17, 2023

jgh9094 commented Oct 17, 2023

perib commented Oct 12, 2023 •

edited

Loading