reporters: report jobs separately #721

ryneeverett · 2022-09-26T19:02:31Z

Adapted from #305.

I made some adjustments per the review comment:

2. The `separate` key is nowhere documented or put into the example config, also please rethink where we place the separate key -- should it be per-reporter or a global config?

Is this what you had in mind?

thp

Looks clean and simple. Added some nitpick comments.

Please also add to /CHANGELOG.md in the ## UNRELEASED section in ### Added an item like this:

- New `separate` configuration option for reporters to split reports into one-per-job (contributed by Ryne Everett)

The more I think about it, the more I feel like it should be like details and footer (configurable in the same spots).

docs/source/configuration.rst

lib/urlwatch/reporters.py

docs/source/configuration.rst

ryneeverett · 2022-09-27T16:19:21Z

Moving the option into the format-specific configuration sections makes sense to me given the current configuration hierarchy. Wouldn't it also make sense for it to go into html?

thp · 2022-10-03T11:28:49Z

Moving the option into the format-specific configuration sections makes sense to me given the current configuration hierarchy. Wouldn't it also make sense for it to go into html?

Yes, of course, html also makes sense.

scottmac · 2022-11-08T19:00:16Z

I'm interested in this change, @ryneeverett are you ready for another review?

thp

The __base_kind__ is a good idea, we should use that in other places too by implementing a get_base_config() (see comments) plus making sure we don't fail if a custom ReporterBase subclass doesn't have __base_kind__.

thp · 2022-11-25T08:58:24Z

lib/urlwatch/reporters.py

@@ -134,7 +134,11 @@ def submit_all(cls, report, job_states, duration):
            if cfg['enabled']:
                any_enabled = True
                logger.info('Submitting with %s (%r)', name, subclass)
-                subclass(report, cfg, job_states, duration).submit()
+                if report.config['report'][subclass.__base_kind__].get('separate', False):


This probably won't work for fully custom reporters (out-of-tree) that just subclass from ReporterBase. Not sure if such scripts exist, but we don't currently disallow such reporters. For this reason, one option could be to use:

base_kind = subclass.get('__base_kind__', None) if base_kind in report.config['report']: ...

I do like the __base_kind__ classification, maybe you can also use it in the rest of the code? E.g. the HTML reporter has this currently at the beginning of def _parts(self)::

cfg = self.report.config['report']['html']

This could be:

cfg = self.report.config['report'][self.__base_kind__]

..and since this is probably used in other places as well, we could have (in ReporterBase):

@classmethod def get_base_config(cls, report): if not hasattr(cls, '__base_kind__'): return {} return report.config['report'].get(cls.__base_kind__, {})

Then the first line in HtmlReporter._parts() becomes:

cfg = self.get_base_config(self.report)

And the line this comment is attached to becomes:

if subclass.get_base_config(report).get('separate', False):

(all of this is untested, but assumed working)

Please also check if there are other occurrences where the string of __base_kind__ is used currently an could be replaced with either self.get_base_config() or self.__base_kind__ (doing a hasattr() check first if the check is in a base class).

I'm not necessarily opposed to going this route but this is the point where I would ask -- if we're going to allow __base_kind__ to permeate the code, would it be better to enforce it's existence by throwing an error? (Perhaps with an abstract base class?) If we want to maintain backwards compatibility with unknown third party reporters, perhaps at least a warning that it ought to be defined and may be an error in a future release? Even if returning an empty dictionary works fine now, I don't think there's any way to guarantee it will continue working for all usages in the future and it strikes me as likely to lead to confusing behavior.

The "cleanest" way would probably just to look at the MRO (method resolution order) for a class, this way finding the "base" kind automatically, and making it possible to get a "merged" dict of all config options (subclass configs overriding non-subclass configs).

The method resolution order can be found as __mro__ on the class object.

Like this:

% PYTHONPATH=lib python3 Python 3.10.8 (main, Oct 13 2022, 09:48:40) [Clang 14.0.0 (clang-1400.0.29.102)] on darwin Type "help", "copyright", "credits" or "license" for more information. >>> from urlwatch.reporters import ShellReporter >>> ShellReporter.__mro__ (<class 'urlwatch.reporters.ShellReporter'>, <class 'urlwatch.reporters.TextReporter'>, <class 'urlwatch.reporters.ReporterBase'>, <class 'object'>) >>>

Then when using just __kind__ on the base class:

diff --git a/lib/urlwatch/reporters.py b/lib/urlwatch/reporters.py index 39150bc..46456f7 100644 --- a/lib/urlwatch/reporters.py +++ b/lib/urlwatch/reporters.py @@ -262,6 +262,8 @@ class HtmlReporter(ReporterBase): class TextReporter(ReporterBase): + __kind__ = 'text' + def submit(self): cfg = self.report.config['report']['text'] line_length = cfg['line_length']

You can get all "kinds" like this:

>>> from urlwatch.reporters import ShellReporter >>> [getattr(cls, '__kind__', None) for cls in ShellReporter.__mro__] ['shell', 'text', None, None]

(note that the first None there is from ReporterBase, and the second None is from object)

Getting an "inherited config" could look like this:

>>> config = {} >>> for key in reversed([getattr(cls, '__kind__', None) for cls in ShellReporter.__mro__]): ... if key is not None: ... config.update(self.report.config['report'][key]) ... # here, you should be able to use "config"

I know this is a mouthful, but that's probably the nicest way to implement this.

(in theory we could also give ReporterBase a __kind__ of some kind (sic), e.g. all and then just make sure that the default config has all: separate: false by default, and then it would even magically work to:

Set separate: true for all, but not set it for anything else -> separate reports for everything

Set separate: true for all, and separate: false for text -> separate reports for all but text-based ones

Set separate: false for all, separate: truefortext, separate: falseforshell` -> separate reports only for text-based ones, except shell

I think the all configuration idea is a good one but I think improving the "Reporter" section of the Configuration docs is a higher priority at the moment. The docs hint that there is a configuration hierarchy but you have to read reporters.py to find out what it is. I say "hint" because as an end user approaching the docs it isn't clear that the reporters are even python classes. Seems like at the minimum we could provide a hyperlink to reporters.py on github but better would be to just outline the hierarchy in the docs. Ideally the outline would be generated by an RST macro but that might not be worth the effort.

Another thing I think would be really helpful in general is configuration validation. For example, I just found out the hard way that reporters' "enabled" key is required and if you omit it you get a KeyError traceback. Pydantic is my preferred solution for such problems because it encourages you to extract out any implementation details of the configuration and also makes documentation easier because all the information is in one place.

Feel free to do the validation thing as a separate PR. This PR is nearly ready (see the comment I just posted), once that is fixed, I think this can be merged -- what do you think?

Oh yeah, I wouldn't want to increase the scope of this PR.

lib/urlwatch/reporters.py

thp · 2022-12-12T16:24:03Z

Please rebase against master so that the CI tests can run though (see #733).

Adapted from thp#305.

lib/urlwatch/reporters.py

thp · 2022-12-18T08:11:57Z

Merged, thanks!

This PR should make the `--test-reporter` option respect `separate` flag too.. It's just applying the changes already made with thp#721 from `submit_all` method to the `submit_one` method as well. fixes thp#771 I hope this is ok so far, because my python is not the best 😉

thp requested changes Sep 27, 2022

View reviewed changes

docs/source/configuration.rst Outdated Show resolved Hide resolved

lib/urlwatch/reporters.py Outdated Show resolved Hide resolved

docs/source/configuration.rst Outdated Show resolved Hide resolved

ryneeverett force-pushed the separate-job-reports branch from cbe8dd7 to 8d13b48 Compare September 27, 2022 16:15

ryneeverett force-pushed the separate-job-reports branch 2 times, most recently from 218ca3f to 27ea055 Compare November 20, 2022 02:38

ryneeverett requested a review from thp November 20, 2022 02:39

thp requested changes Nov 25, 2022

View reviewed changes

ryneeverett force-pushed the separate-job-reports branch from 27ea055 to be55cf1 Compare December 5, 2022 03:46

ryneeverett force-pushed the separate-job-reports branch from be55cf1 to 9c6bb73 Compare December 14, 2022 17:38

reporters: report jobs separately

695e206

Adapted from thp#305.

ryneeverett force-pushed the separate-job-reports branch from 9c6bb73 to 695e206 Compare December 14, 2022 17:38

ryneeverett marked this pull request as draft December 14, 2022 18:26

ryneeverett marked this pull request as ready for review December 14, 2022 18:46

thp reviewed Dec 17, 2022

View reviewed changes

lib/urlwatch/reporters.py Show resolved Hide resolved

thp approved these changes Dec 18, 2022

View reviewed changes

thp merged commit 5385365 into thp:master Dec 18, 2022

marunjar mentioned this pull request Oct 24, 2023

--test-reporter option is ignoring separated flag #771

Closed

marunjar mentioned this pull request Oct 24, 2023

reporters: test report jobs separately #772

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

reporters: report jobs separately #721

reporters: report jobs separately #721

ryneeverett commented Sep 26, 2022

thp left a comment

ryneeverett commented Sep 27, 2022

thp commented Oct 3, 2022

scottmac commented Nov 8, 2022

thp left a comment

thp Nov 25, 2022

ryneeverett Dec 5, 2022

thp Dec 5, 2022

thp Dec 5, 2022

ryneeverett Dec 14, 2022

ryneeverett Dec 14, 2022

thp Dec 17, 2022

ryneeverett Dec 18, 2022

thp commented Dec 12, 2022

thp commented Dec 18, 2022

reporters: report jobs separately #721

reporters: report jobs separately #721

Conversation

ryneeverett commented Sep 26, 2022

thp left a comment

Choose a reason for hiding this comment

ryneeverett commented Sep 27, 2022

thp commented Oct 3, 2022

scottmac commented Nov 8, 2022

thp left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

thp commented Dec 12, 2022

thp commented Dec 18, 2022