-
-
Notifications
You must be signed in to change notification settings - Fork 310
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Performance suggestion: do not run unselected plugins/checks #751
Comments
In GitLab by @sigmavirus24 on Jun 5, 2020, 06:01 This would break our verbose output that tells people how many errors were |
In GitLab by @sigmavirus24 on Jun 5, 2020, 15:59 Perhaps the better way to do this is to have a |
In GitLab by @andersk on Feb 13, 2021, 12:57 pycodestyle can do this and save significant time. So surely Flake8 ought to be able to do it too, at least for some checks including the pycodestyle ones, when verbose output is not requested. $ git clone https://github.com/zulip/zulip.git
$ cd zulip; rm setup.cfg
$ time pycodestyle -qq --count .
15849
real 0m22.806s
user 0m22.759s
sys 0m0.020s
$ time pycodestyle -qq --select=E265 --count .
4
real 0m9.721s
user 0m9.680s
sys 0m0.030s
$ time flake8 -j1 -qq --count .
15831
real 0m50.552s
user 0m50.281s
sys 0m0.213s
$ time flake8 -j1 -qq --select=E265 --count .
4
real 0m50.434s
user 0m50.177s
sys 0m0.195s |
This is not only a performance optimization but also a stability improvement. What you don't run can't break. Flake8's plugin discovery can break a CI pipeline at any time when dependencies are updated, because some of the plugin libraries may change their behaviour or if something unexpected is in the importpath. If one can exactly specify which to run, this reduces the chance of such surprises.
There may be better examples, these might partially be debatable, but the problem class definitely exists in the deep fires of Python's dependency hell. |
In GitLab by @hugovk on Jun 5, 2020, 01:45
Please read this brief portion of documentation before going any further: http://flake8.pycqa.org/en/latest/internal/contributing.html#filing-a-bug
Please describe how you installed Flake8
Please provide the exact, unmodified output of
flake8 --bug-report
Please describe the problem or feature
I noticed that Flake8 takes the same time to run with
--select
as without. As shown using-vv
verbosity, it runs all the plugins and checks regardless of--select
, and only reports the selected ones afterwards.Flake8 can sometimes take a long time to run on large codebases, and if it was possible to only run the selected checks, that would save a lot of time, CPU and power.
Would it be possible to only run selected checks/plugins? Rather than running them anyway and discarding that work when reporting?
Docs
For reference, my emphasis.
flake8 --help
says--select
is for which ones to enable:The docs are a bit more explicit:
https://flake8.pycqa.org/en/latest/user/options.html#cmdoption-flake8-select
Example
An example running on the TensorFlow codebase:
Both about the same, around 5m20s.
With an ugly hack (I know this mixes plugin names with error codes, but it's just to get a rough idea, and there's other places to skip too):
About 4m30s, nearly a minute and ~13% faster.
The text was updated successfully, but these errors were encountered: