[Selector module] Implementing D-optimal designs #71

FanwangM · 2022-05-13T22:46:35Z

Information related to determinantal point processes can be found at #4.

fast greedy algorithm
review
code
two algorithms here

Ali-Tehrani · 2022-06-17T15:15:32Z

D-optimal designs is the idea of finding the subset that maximizes the determinant of the overlap matrix X^T X, where X is the feature matrix. It's first used in QSAR in the early 1993. As noted in 1), it gives good minimal, diverse set however doesn't far sample well in the inner-region, this motivated them to use a onion design, where the dataset is split into groups and the process is repeated on each group.

Determinantal point-process (DPP) can only sample up to the rank of X^T X, and so I would favor to implement the D-optimal instead. A list of algorithms for D-optimal is included in [2]. A naive algorithm would be to sample using DPP (better than random sampling), and check if the determinant of the submatrix of including the sample increased and if so, add the new sample to the list of points and repeat.

[1] "D-optimal onion designs in statistical molecular design"
[2] R. Dennis Cook & Christopher J. Nachtrheim (1980) A Comparison of
Algorithms for Constructing Exact D-Optimal Designs, Technometrics, 22:3, 315-324, DOI:
10.1080/00401706.1980.10486162

PaulWAyers · 2022-06-17T20:04:37Z

OK, let's try the d-optimal set.

I think we can also report the determinant of the Gramian as a measure of diversity? Obviously it is zero when linear dependence arises...

We have (will have) the capacity to convert distance matrices to Gramians too, and I think(?) that any sort of symmetric-positive-definite matrix can be used for to initiate D-optimal sampling, yes?

PaulWAyers · 2023-05-15T20:48:28Z

It seems this is implemented at
https://basf.github.io/doe/

FanwangM assigned Ali-Tehrani May 13, 2022

FanwangM added the feature label May 14, 2022

FarnazH added this to the release milestone May 19, 2022

FanwangM added the low priority label May 15, 2023

FanwangM changed the title ~~Implementing determinantal point processes~~ Implementing D-optimal designs May 29, 2023

FanwangM changed the title ~~Implementing D-optimal designs~~ [Selector module] Implementing D-optimal designs May 29, 2023

FanwangM unassigned Ali-Tehrani May 29, 2023

FanwangM added the help wanted Extra attention is needed label May 29, 2023

FarnazH assigned Ali-Tehrani Jun 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Selector module] Implementing D-optimal designs #71

[Selector module] Implementing D-optimal designs #71

FanwangM commented May 13, 2022 •

edited

Loading

Ali-Tehrani commented Jun 17, 2022 •

edited

Loading

PaulWAyers commented Jun 17, 2022

PaulWAyers commented May 15, 2023

[Selector module] Implementing D-optimal designs #71

[Selector module] Implementing D-optimal designs #71

Comments

FanwangM commented May 13, 2022 • edited Loading

Ali-Tehrani commented Jun 17, 2022 • edited Loading

PaulWAyers commented Jun 17, 2022

PaulWAyers commented May 15, 2023

FanwangM commented May 13, 2022 •

edited

Loading

Ali-Tehrani commented Jun 17, 2022 •

edited

Loading