Why choose to not use numpy? #4

jonasvdd · 2022-07-28T16:01:54Z

Hi really cool repo 🚀

I am 100% sure that this this code can be significantly sped up by using numpy vectorisation.

I was able to write a pure numpy-python implementation which could downsample 50,000,000->2,000 data points in ~300ms (on a consumer pc).

If interested; I can quickly make a PR where I put the numpy-python version of that algorithm next to yours.

Cheers,
Jonas

sfc-gh-puneet-lakhanpal · 2023-08-11T23:36:33Z

Hi @jonasvdd , are you able to submit the PR for the numpy vectorized version ?

Thank you.

jonasvdd · 2023-08-25T08:47:12Z

Sure!

jonasvdd · 2023-08-25T08:58:21Z

Just a few questions:

def largest_triangle_three_buckets(data, threshold):
    """
    Return a downsampled version of data.
    Parameters
    ----------
    data: list of lists/tuples
        data must be formated this way: [[x,y], [x,y], [x,y], ...]
                                    or: [(x,y), (x,y), (x,y), ...]
    threshold: int
        threshold must be >= 2 and <= to the len of data
    Returns
    -------
    data, but downsampled using threshold

Why did you opt of a list of tuples as datatype? (not really performant when operating on large arrays). If you are considering numpy support, I would opt to just allow an array-alike type argument (and thus splitting up the data argument into x and y), e.g.:

def largest_triangle_three_buckets(x, y, threshold):
   """
   ....
   
   Parameters:
   ------------------
    x: list | np.ndarray
       ....

How should I add the numpy dependency? (via requirements.txt / pyproject.toml)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why choose to not use numpy? #4

Why choose to not use numpy? #4

jonasvdd commented Jul 28, 2022

sfc-gh-puneet-lakhanpal commented Aug 11, 2023

jonasvdd commented Aug 25, 2023

jonasvdd commented Aug 25, 2023

Why choose to not use numpy? #4

Why choose to not use numpy? #4

Comments

jonasvdd commented Jul 28, 2022

sfc-gh-puneet-lakhanpal commented Aug 11, 2023

jonasvdd commented Aug 25, 2023

jonasvdd commented Aug 25, 2023