Pandas update #121

nickotto · 2024-03-21T20:56:58Z

[please review the Contribution Guidelines prior to submitting your pull request. go ahead and delete this line if you've already reviewed said guidelines.]

What does this PR do?

Creates a new column in evaluated individuals in "Eval Error". This is to keep the columns for scores as floats without strings. All evaluation errors would be in the "Eval Error" column which is an object dtype. This would resolve any incompatibilities with pandas 2.0+. Updated for both steady state and base estimator version of tpot2

Where should the reviewer start?

Simply run tpot2 with single and multiple objectives. There should be no warning about incompatibility as we saw before when pandas was above 2.0+.

How should this PR be tested?

Run across different python versions.

Co-authored-by: Pedro Ribeiro <[email protected]>

perib · 2024-03-23T00:57:17Z

While trying to print out some of the evaluated individual columns, I noticed a simple bug in how str was defined from the graph individuals. it normally works by exporting a pipeline then printing the string for that, but that can fail if the hyperparameters are invalid. Just added a try-except block to catch those cases. (This will be changed again in the next update, so I didn't want to make a whole new PR for that.)

There is an edge case where if an individual is created, but its evaluation is incomplete, the value in Eval Errors column in np.nan instead of None. This can happen if the global timeout is triggered (max_time_seconds). We don't want to label those as "timeout" since that should be reserved for going over max_eval_time_seconds.

But I'm not sure if we can change the default missing value in pandas to None, and it doesn't allow us to add nans at the same time we add the strings for the error. I think we just leave it as is for now?

jay-m-dev and others added 4 commits February 15, 2024 14:50

Merge pull request #116 from EpistasisLab/main (#118)

3c771d4

Co-authored-by: Pedro Ribeiro <[email protected]>

pandas update 2.2.0+

4143a6b

new column "Eval Error" and dtype adjustments

13385a5

multiobjective correction

ac61118

nickotto changed the base branch from main to dev March 22, 2024 23:23

removed unused var, quick fix for printing invalid inds

3a608c1

steady_state_fix

537908b

perib merged commit 14922f6 into dev Mar 27, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pandas update #121

Pandas update #121

nickotto commented Mar 21, 2024

perib commented Mar 23, 2024

Pandas update #121

Pandas update #121

Conversation

nickotto commented Mar 21, 2024

What does this PR do?

Where should the reviewer start?

How should this PR be tested?

perib commented Mar 23, 2024