Locate samples based on condition #11

AlePalu · 2023-06-24T06:47:08Z

Overview

Be able to locate samples in the domain based on a condition. Conditions might be, for instance,

presence of missing data points
high/low concentration of observations

Given a condition, we should obtain a subset of points, maybe clustered in subregions, on which we can perform different actions, for instance, randomly select a subsample.

Possible sub-problems

Filter points based on condition (locate all not-nan points)
Cluster filtered points in sub-regions (use proximity of the elements they belong to?)
Randomly sample points on a filtered group
Introduce a notion of proximity, i.e., given a point, locate the nearest point
Sample points at random on the neighborood of a clustered region
Allow to identify regions with highest/lowest points concentration

AlePalu added the enhancement New feature or request label Jun 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Locate samples based on condition #11

Locate samples based on condition #11

AlePalu commented Jun 24, 2023 •

edited

Loading

Locate samples based on condition #11

Locate samples based on condition #11

Comments

AlePalu commented Jun 24, 2023 • edited Loading

Overview

Possible sub-problems

AlePalu commented Jun 24, 2023 •

edited

Loading