-
Notifications
You must be signed in to change notification settings - Fork 67
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add window function #98
base: master
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the PR. My initial impression is that this is reasonable, but let me give it a some thought first.
when no collection is provided. For a sliding window containing each element | ||
and n-1 _following_ elements, use `clojure.core/partition` with a `step` size | ||
of 1." | ||
{:added "1.10.0"} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Should be 1.9.0 I believe.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, sorry I assumed 1.9 had been cut. Will correct.
(letfn [(part [part-n coll] | ||
(let [run (doall (take part-n coll))] | ||
(lazy-seq | ||
(when (== part-n (count run)) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is using ==
over =
due to performance reasons? What do the benchmarks come out with when comparing the two?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, performance hunch. Not particularly thought out. Will run in through criterium, but not expecting a meaningful difference.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(c/benchmark-round-robin [(dorun (window== 1000 (range 10000))) (dorun (window= 1000 (range 10000)))] c/*default-benchmark-opts*)
;; window== sample mean: 0.29017708853649266
;; window= sample mean: 0.3584622250301741
;; window== mean: 0.3225724956375
;; window= mean: 0.3290516003666667
my stats isn't strong enough to be sure sample mean is a better indicator than mean, so i'll leave both means for you to look at. either a ~20% difference or a 2% difference. but both in favour of ==
over =
so probably worth keeping, but I'm easy.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
from my reading, sample mean will tend to emphasise the impact of outliers, thereby highlighting variance in one direction or another. so in this case, larger sample-mean draws attention to the fact a few of the test runs show significantly slower runs for =
I've had the lazy-only version of this function in quite a few codebases over the years, and recently took the time to write the transducer version. The cljs support is particularly annoying to write because cljs's
array-list
type doesn't implement aremove
method.This is like a simple alternative to https://github.com/cgrand/xforms/ 's
window
transducer, which needsf
andinvf
functions to construct the windows. Plus this has a standard lazy-arity.An example use case: