conditional gfn #188

josephdviviano · 2024-09-25T15:11:25Z

Supports conditioning on a tensor of shape=[n_trajectories, n_cond_dims]. This is passed by the user during a call to the sampler.

Implemented for all GFlowNets. Note that the current version expects a particular kind of estimator. I can imagine this will lead to future changes - e.g., we should have some Estimators which expect huggingface models, so we can use them to produce conditioning vectors / to initialize the policy (this will obviously be a future PR).

Note that the conditioning is useless in my example, we should have a better use-case envisioned for the demo. The demo currently is not complete for all GFlowNet types.

…ionally contains a tensor of conditioning vectors (one per trajectory)

…itioning into PB and PF computation

…ule can now accept raw tensors

…bute of the trajectory

josephdviviano · 2024-09-25T15:32:54Z

Don't worry about the tests - they should be easy to fix.

I can make the chances for DB, Sub-TB, and FM pretty easily if we agree this is a good approach, before a proper review.

saleml · 2024-09-25T17:50:21Z

src/gfn/modules.py

+
+    or
+
+    $s \mapsto (P_B(s' \mid s, c))_{s' \in Parents(s)}$.


might be worth mentioning that this is a s very specific conditioning use-case, where the condition is encoded separately, and embeddings are concatenated.

I don't think we can do a generic one, but this should be enough as an example !

What other conditioning approaches would be worth including? Cross attention?

In general I would think the conditioning should be embedded / encoded separately --- or would the conditioning just need to be concatenated to the state before input? I could add support for that.

I don't think there is an exhaustive list of ways we can process the condition. What you have is great as an example. I suggest you just add a comment or doc that the user might want to write their own module

saleml · 2024-09-25T17:51:04Z

src/gfn/samplers.py

@@ -68,7 +67,28 @@ def sample_actions(
                the sampled actions under the probability distribution of the given
                states.
        """
-        estimator_output = self.estimator(states)
+        # TODO: Should estimators instead ignore None for the conditioning vector?


wouldn't it be cleaner with fewer if else blocks ?

Yes there's a bit of cruft with all the if-else blocks, but as it stands an estimator can either accept one or two arguments and I think it's good if it fails noisily... what do you think?

Ok ! makes sense.

I added these exception_handlers to reduce the cruft.

saleml · 2024-09-25T20:04:28Z

LGTM! Looking forward to test this feature

…g conditioning

…tePolicyEstimator

josephdviviano added 9 commits September 25, 2024 10:56

example of conditional GFN computation with TB only (for now)

6e8dc4d

should be no change

39fb5ee

Trajectories objects now have an optional .conditonal field which opt…

2bc2263

…ionally contains a tensor of conditioning vectors (one per trajectory)

small changes to logz paramater handling, optionally incorporate cond…

99afaf3

…itioning into PB and PF computation

logZ is optionally computed using a conditioning vector

e6d25a0

NeuralNets now have input/output dims

2c72bf9

added a ConditionalDiscretePolicyEstimator, and the forward of GFNMod…

580c455

…ule can now accept raw tensors

added conditioning to sampler, which will save the tensor as an attri…

a74872f

…bute of the trajectory

black

056d935

josephdviviano added the enhancement New feature or request label Sep 25, 2024

josephdviviano requested a review from saleml September 25, 2024 15:11

josephdviviano self-assigned this Sep 25, 2024

josephdviviano mentioned this pull request Sep 25, 2024

Add conditional LogZ calculation #150

Open

saleml reviewed Sep 25, 2024

View reviewed changes

josephdviviano added 8 commits October 1, 2024 12:10

API changes adapted

96b725c

added conditioning to all gflownets

5cd32a7

both trajectories and transitions can now store a conditioning tensor

877c4a0

input_dim setting is now private

279a313

added exception handling for all estimator calls potentially involvin…

65135c1

…g conditioning

API change -- n vs. n_trajectories

b4c418c

change test_box target value

738b062

API changes

4434e5f

josephdviviano marked this pull request as ready for review October 1, 2024 16:34

josephdviviano added 4 commits October 1, 2024 13:16

hacky fix for problematic test (added TODO)

851e03e

working examples for all 4 major losses

5152295

added conditioning indexing for correct broadcasting

1d64b55

added a ConditionalScalarEstimator which subclasses ConditionalDiscre…

348ee82

…tePolicyEstimator

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

conditional gfn #188

conditional gfn #188

josephdviviano commented Sep 25, 2024 •

edited

Loading

josephdviviano commented Sep 25, 2024

saleml Sep 25, 2024

josephdviviano Sep 25, 2024

josephdviviano Sep 25, 2024

saleml Sep 25, 2024

saleml Sep 25, 2024

josephdviviano Sep 25, 2024

saleml Sep 25, 2024

josephdviviano Oct 1, 2024

saleml commented Sep 25, 2024

conditional gfn #188

Are you sure you want to change the base?

conditional gfn #188

Conversation

josephdviviano commented Sep 25, 2024 • edited Loading

josephdviviano commented Sep 25, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

saleml commented Sep 25, 2024

josephdviviano commented Sep 25, 2024 •

edited

Loading