Skip to content
This repository has been archived by the owner on Feb 8, 2024. It is now read-only.

Locate samples based on condition #11

Open
6 tasks
AlePalu opened this issue Jun 24, 2023 · 0 comments
Open
6 tasks

Locate samples based on condition #11

AlePalu opened this issue Jun 24, 2023 · 0 comments
Labels
enhancement New feature or request

Comments

@AlePalu
Copy link
Owner

AlePalu commented Jun 24, 2023

Overview

Be able to locate samples in the domain based on a condition. Conditions might be, for instance,

  • presence of missing data points
  • high/low concentration of observations

Given a condition, we should obtain a subset of points, maybe clustered in subregions, on which we can perform different actions, for instance, randomly select a subsample.

Possible sub-problems

  • Filter points based on condition (locate all not-nan points)
  • Cluster filtered points in sub-regions (use proximity of the elements they belong to?)
  • Randomly sample points on a filtered group
  • Introduce a notion of proximity, i.e., given a point, locate the nearest point
  • Sample points at random on the neighborood of a clustered region
  • Allow to identify regions with highest/lowest points concentration
@AlePalu AlePalu added the enhancement New feature or request label Jun 24, 2023
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant