[ENH] (WIP) Creating a new Bayesian Regressor with PyMC as a backend #358

meraldoantonio · 2024-05-23T17:11:20Z

Reference Issues/PRs

#7

What does this implement/fix? Explain your changes.

This WIP PR implements a Bayesian Linear Regressor with PyMC as a backend

Does your contribution introduce a new dependency? If yes, which one?

Yes - it depends on PyMC family: PyMC itself, XArray and ArviZ

What should a reviewer concentrate their feedback on?

The design of the BayesianLinearRegressor. Especially:

The introduction of the priors. For now, the class hardcodes the priors. We need to think about the way in which the users should inject their own priors.

Did you add any tests for the change?

Not yet

Any other comments?

N/A

PR checklist

For all contributions

I've added myself to the list of contributors with any new badges I've earned :-)
How to: add yourself to the all-contributors file in the skpro root directory (not the CONTRIBUTORS.md). Common badges: code - fixing a bug, or adding code logic. doc - writing or improving documentation or docstrings. bug - reporting or diagnosing a bug (get this plus code if you also fixed the bug in the PR).maintenance - CI, test framework, release.
See here for full badge reference
[ X] The PR title starts with either [ENH], [MNT], [DOC], or [BUG]. [BUG] - bugfix, [MNT] - CI, test framework, [ENH] - adding or improving code, [DOC] - writing or improving documentation or docstrings.

For new estimators

(This is not yet done)

I've added the estimator to the API reference - in docs/source/api_reference/taskname.rst, follow the pattern.
I've added one or more illustrative usage examples to the docstring, in a pydocstyle compliant Examples section.
If the estimator relies on a soft dependency, I've set the python_dependencies tag and ensured
dependency isolation, see the estimator dependencies guide.

review-notebook-app · 2024-05-23T17:11:25Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

skpro/regression/bayesian.py

This PR removes the legacy base modules. * base class: equivalent functionality is now contained in `BaseDistribution`, `BaseProbaRegressor`, `_DelegatedProbaRegressor` * pymc vendor interface: currently worked on in #358 * density estimation: tracked via #7

skpro/regression/bayesian.py

…ons have diff. behaviors wrt tensor mutability)

fkiraly · 2024-06-06T19:14:12Z

skpro/regression/bayesian.py

+            # Priors for unknown model parameters
+            self.intercept = pm.Normal("intercept", mu=self.intercept_mu, sigma=self.intercept_sigma)
+            self.slopes = pm.Normal("slopes", mu=self.slopes_mu, sigma=self.slopes_sigma, shape = self._X.shape[1], dims=("pred_id"))
+            self.noise = pm.HalfNormal("noise", sigma=self.noise_sigma)


would inverse gamma not be more standard here, as it is conjugate to the normal?

fkiraly

Nice contribution!

Some high-level points:

could you split the notebook off into a chained PR, based on the estimator PR? The notebook may require some more review (time), and should not block the estimator
in the estimator, I would kindly ask you to remove the visualization dependencies from python_dependencies, and instead introduce dependency checks in the methods that need them? This way, users do not need to visualisation dependencies when using the model in a deployment pipeline.

fkiraly · 2024-10-04T17:48:18Z

Strange import error - is this related to an upper bound of any of the imports implied by 3.9, e.g., scipy?

meraldoantonio · 2024-10-05T15:35:55Z

Strange import error - is this related to an upper bound of any of the imports implied by 3.9, e.g., scipy?

Apparently there is a bug with Arviz 0.17 and scipy>=1.13 (source 1) (source 2).

The bug is no longer present in Arviz 0.18 but this requires Python 3.10 and above.

As a temporary solution, I've locked the scipy version in all-extras in project.toml

fkiraly · 2024-10-05T16:10:11Z

Makes sense.

From a maintenance perspective, applying the version bound in the pyproject.toml is not a good solution, since the lock is implied only by a single estimator, and not by scipy itself.

Could you add the lock instead in the python_dependencies tag of the estimator, and revert the changes to pyproject?

…ressor dependencies

meraldoantonio · 2024-10-06T15:43:23Z

Makes sense.

From a maintenance perspective, applying the version bound in the pyproject.toml is not a good solution, since the lock is implied only by a single estimator, and not by scipy itself.

Could you add the lock instead in the python_dependencies tag of the estimator, and revert the changes to pyproject?

Makes sense! But I've tried this a couple of times and for some reason, without the pyproject.toml lock, the test framework keeps installing the "wrong" version of scipy (version 1.13.1), even after specifying "scipy<=1.12.0" in the python_dependencies tag...

It might be that other libraries are pulling in a conflicting version, but I haven't managed to find the exact cause..

Any ideas?

fkiraly · 2024-10-07T00:36:02Z

Any ideas?

Why are you trying to bound scipy instead of arviz? I would simply bound arviz>=0.18, based on your statements, as well as python_version >= 3.10

fkiraly · 2024-10-07T00:36:54Z

PS: why did you close the notebook PR? That was a nice notebook, and indeed it would be nice as separate PR.

Ubuntu and others added 2 commits May 22, 2024 02:44

Added pymc as an optional dependency in pyproject.toml

8bfa908

Added WIP Bayesian Linear Regression code and notebook

4634059

meraldoantonio marked this pull request as draft May 23, 2024 17:11

fkiraly reviewed May 23, 2024

View reviewed changes

skpro/regression/bayesian.py Outdated Show resolved Hide resolved

meraldoantonio added 2 commits May 23, 2024 17:14

Added comments

be4d12a

Changed MutableData to Data

054a7df

meraldoantonio mentioned this pull request May 24, 2024

[MENTEE] Meraldo Antonio sktime/mentoring#43

Open

fkiraly assigned meraldoantonio May 24, 2024

fkiraly mentioned this pull request May 25, 2024

[MNT] remove legacy base modules #80

Merged

Fixed typo in pyproject.toml

eb54e18

fkiraly reviewed May 31, 2024

View reviewed changes

skpro/regression/bayesian.py Outdated Show resolved Hide resolved

fkiraly reviewed May 31, 2024

View reviewed changes

skpro/regression/bayesian.py Outdated Show resolved Hide resolved

meraldoantonio added 15 commits June 6, 2024 03:28

BayesianLinearRegressor fitted to skpro template, fit and predict work

eebfb67

Finished predict_proba method in BayesianLinearRegressor

2bacfa0

Fixed indexing bugs

1a373ea

Deleted template comments

3f02932

Added an example in the docstring

4b7cca4

Added a visualize_model method

b0f89c4

Added mutability=True in pm.Data

41857d1

Pinned the version of pymc in pyproject.toml to 5.15.0 (earlier versi…

a3b4c64

…ons have diff. behaviors wrt tensor mutability)

Removed mutable=True in pm.Data which is to be deprecated

bea9481

Added get_prior method

0016acf

Added get posterior method

c43f836

Added methods to return prior and posterior summary statistics

e4d0933

Added plot_ppc method

a150c5c

Deleted old sample notebook

b0b4bd1

Added example notebook

45698aa

fkiraly reviewed Jun 6, 2024

View reviewed changes

Meraldo Antonio added 8 commits August 30, 2024 01:01

Merge remote-tracking branch 'upstream/main' into pymc_dev

0bb0757

Used pymc-marketing prior class, streamlined sampling

058257c

Modified the notebook to fit with the reworked class

ab3bb70

Updated sampling and notebook

050322f

removed bayesian.py

83dc8f7

Renamed module

f780155

Added self as contributor

c58d437

unlocked pymc version

e1dd494

meraldoantonio requested a review from fkiraly August 30, 2024 11:11

fkiraly added enhancement module:regression probabilistic regression module labels Sep 7, 2024

Meraldo Antonio added 2 commits September 27, 2024 13:05

Updated notebook for multiple predictors and prior elicitation

8b4f92a

Allowed multivariate regression and re-training

5a03d3e

fkiraly requested changes Sep 27, 2024

View reviewed changes

Meraldo Antonio added 3 commits October 4, 2024 20:39

Removed notebook from this branch and pull request

663b8e0

Removed the dictionary piping syntax and python 3.10 requirements

89c0f6d

Removed the dictionary piping syntax and python 3.10 requirements

55249f4

Meraldo Antonio added 5 commits October 5, 2024 23:06

Commented out graphviz visualization fucntion

434f086

Locked Arviz version

5c886b1

Downgraded scipy version

b4a2e4a

modified scipy version to 1.12 in project toml due to arviz bug

21102b2

Uncommented visualize model

0b3408d

meraldoantonio mentioned this pull request Oct 5, 2024

[DOC] (WIP) Notebook companion to BayesianLinearRegressor (#358) #474

Closed

2 tasks

Removed scipy lock from pyproject toml and added to BayesianLienarReg…

535ca75

…ressor dependencies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ENH] (WIP) Creating a new Bayesian Regressor with PyMC as a backend #358

[ENH] (WIP) Creating a new Bayesian Regressor with PyMC as a backend #358

meraldoantonio commented May 23, 2024 •

edited by fkiraly

Loading

review-notebook-app bot commented May 23, 2024

fkiraly Jun 6, 2024

fkiraly left a comment

fkiraly commented Oct 4, 2024

meraldoantonio commented Oct 5, 2024

fkiraly commented Oct 5, 2024

meraldoantonio commented Oct 6, 2024 •

edited

Loading

fkiraly commented Oct 7, 2024 •

edited

Loading

fkiraly commented Oct 7, 2024

[ENH] (WIP) Creating a new Bayesian Regressor with PyMC as a backend #358

Are you sure you want to change the base?

[ENH] (WIP) Creating a new Bayesian Regressor with PyMC as a backend #358

Conversation

meraldoantonio commented May 23, 2024 • edited by fkiraly Loading

Reference Issues/PRs

What does this implement/fix? Explain your changes.

Does your contribution introduce a new dependency? If yes, which one?

What should a reviewer concentrate their feedback on?

Did you add any tests for the change?

Any other comments?

PR checklist

For all contributions

For new estimators

review-notebook-app bot commented May 23, 2024

fkiraly Jun 6, 2024

Choose a reason for hiding this comment

fkiraly left a comment

Choose a reason for hiding this comment

fkiraly commented Oct 4, 2024

meraldoantonio commented Oct 5, 2024

fkiraly commented Oct 5, 2024

meraldoantonio commented Oct 6, 2024 • edited Loading

fkiraly commented Oct 7, 2024 • edited Loading

fkiraly commented Oct 7, 2024

meraldoantonio commented May 23, 2024 •

edited by fkiraly

Loading

meraldoantonio commented Oct 6, 2024 •

edited

Loading

fkiraly commented Oct 7, 2024 •

edited

Loading