EXAMPLE : Interpreter grounded Neural Program Synthesis [WIP] #81

reshinthadithyan · 2022-11-05T20:35:59Z

This PR adds an example of using trlx to ground LMs into the interpreter with a toy list manipulation DSL to make them generate more coherent programs to satisfy IO conditions.

Questions:

I want to do extensive ablation studies on the impact of grounding Models with Interpreter. So would need a folder to add plots and ReadME.md. Does the current directory structure look good ?

Interpreter Grounded Program Synthesis

Program synthesis is the task of automatically generating programs that solve a given task by satisfying an IO condition. In Neural Program Synthesis the synthesizer is a neural network which is a Language Model that takes in an input/output pair and tries to generate the program in the defined toy DSL's Grammar.

Toy List Manipulation DSL Grammar

The DSL has the following grammar:

list_expr := list[int]
integer := -5 | -4 | -3 | -2 | -1 | 0 | 1 | 2 | 3 | 4 | 5
statement :=
          | take(list_expr,integer)
          | drop(list_expr,integer)
          | reverse(list_expr)
          | sort_asc(list_expr)
          | sort_des(list_expr)
          | add_n(list_expr,integer)
          | sub_n(list_expr,integer)
          | mul_n(list_expr,integer)
          | expand_copy(list_expr)

To generate training/testing data run, python3 -m lang. The dataset would be saved in ./dataset/train.json and ./dataset/test.json. To use the processed dataset refer to this google drive link.
Each datapoint in the dataset would look like,

    {"input": "Input: [4, -2, 0, 0, 5, 5] Output: [25, 25, 20, 0, 0, -10] Function:",
    "output": "sort_des(reverse(mul_n(sort_asc(sort_asc([4, -2, 0, 0, 5, 5])),5)))"}

Training with TRLX

Run python3 -m train_trlx.py to run the training with grounded interpreter. The reward_fn, would return -1 if a sample generated is of invalid syntax. it would return 0.5 if the generated syntax is valid but doesn't satisfy IO condition.

cat-state

thanks for this example @reshinthadithyan ! it mostly looks good aside from a few things. Could you also format the python files with black?

cat-state · 2022-11-06T01:28:55Z

examples/experiments/grounded_program_synthesis/lang.py

+        output.append(value_gen)
+    return output
+
+


Could you comment this file more? Maybe to separate the sections and an explanation at the top for the structure of the language/interpreter

Strong agree. The documentation here is not clear.

Adding more documentation, thanks for the review.

Added more documentation on the sampler.

cat-state · 2022-11-06T01:29:14Z

examples/experiments/grounded_program_synthesis/README.md

+*Program synthesis* is the task of automatically generating programs that solve a given task by satisfying an IO condition. In Neural Program Synthesis the synthesizer is a neural network which is a Language Model that takes in an input/output pair and tries to generate the program in the defined toy DSL's Grammar.
+
+## Toy List Manipulation DSL Grammar
+The DSL has the following grammar:


In addition to the DSL grammar, add some snippets

Added example snippets to showcase the atomic functions.

LouisCastricato · 2022-11-06T18:01:15Z

This looks fantastic! I think we should upload a trained model to hugging face hub that goes along with this tutorial, and allow people to validate their model against our uploaded one. What do you think Reshinth?

reshinthadithyan · 2022-11-06T19:57:57Z

@LouisCastricato Yes, the idea is to give a fully reproducible script for the anyone to train it by themselves and validate.

LouisCastricato · 2022-11-06T23:20:07Z

LGTM, ready to merge with your approval @cat-state

cat-state · 2022-11-07T13:47:03Z

LGTM! thanks @reshinthadithyan !

LouisCastricato · 2022-11-07T14:30:55Z

Precommit is failing. Will merge after resolution.

Reshinth and others added 3 commits October 22, 2022 09:17

Adding python state experiment

33ce692

Grounded Program Synthesis

15b0405

Upgrade README.md

ca3193f

cat-state self-requested a review November 6, 2022 01:25

cat-state reviewed Nov 6, 2022

View reviewed changes

Fixing bugs in train_trlx

9c757b8

reshinthadithyan added 2 commits November 7, 2022 01:16

Fix train bugs

e18ca8c

More examples on DSL

b76730a

Format python files with black

f620aeb

reshinthadithyan added 3 commits November 7, 2022 14:01

Adding

88185b5

Adding more documentation

1758e03

Final refactor

8eedcfc

cat-state approved these changes Nov 7, 2022

View reviewed changes

Merge branch 'master' into grounded-prog-synth

743eb59

LouisCastricato approved these changes Nov 7, 2022

View reviewed changes

reshinthadithyan added 4 commits November 7, 2022 20:03

Fixing format

1d37aa2

Remove trailing whitespaces

afa2434

applied pre-commit hooks

0ef1003

Fix trailing whitespace

102d122

LouisCastricato merged commit 87f6127 into CarperAI:master Nov 7, 2022

hxdtest mentioned this pull request Mar 26, 2023

@LouisCastricato Yes, the idea is to give a fully reproducible script for the anyone to train it by themselves and validate. #393

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

EXAMPLE : Interpreter grounded Neural Program Synthesis [WIP] #81

EXAMPLE : Interpreter grounded Neural Program Synthesis [WIP] #81

reshinthadithyan commented Nov 5, 2022

cat-state left a comment

cat-state Nov 6, 2022

LouisCastricato Nov 6, 2022

reshinthadithyan Nov 6, 2022

reshinthadithyan Nov 7, 2022

cat-state Nov 6, 2022

reshinthadithyan Nov 7, 2022

LouisCastricato commented Nov 6, 2022

reshinthadithyan commented Nov 6, 2022

LouisCastricato commented Nov 6, 2022

cat-state commented Nov 7, 2022

LouisCastricato commented Nov 7, 2022

EXAMPLE : Interpreter grounded Neural Program Synthesis [WIP] #81

EXAMPLE : Interpreter grounded Neural Program Synthesis [WIP] #81

Conversation

reshinthadithyan commented Nov 5, 2022

Questions:

Interpreter Grounded Program Synthesis

Toy List Manipulation DSL Grammar

Training with TRLX

cat-state left a comment

Choose a reason for hiding this comment

cat-state Nov 6, 2022

Choose a reason for hiding this comment

LouisCastricato Nov 6, 2022

Choose a reason for hiding this comment

reshinthadithyan Nov 6, 2022

Choose a reason for hiding this comment

reshinthadithyan Nov 7, 2022

Choose a reason for hiding this comment

cat-state Nov 6, 2022

Choose a reason for hiding this comment

reshinthadithyan Nov 7, 2022

Choose a reason for hiding this comment

LouisCastricato commented Nov 6, 2022

reshinthadithyan commented Nov 6, 2022

LouisCastricato commented Nov 6, 2022

cat-state commented Nov 7, 2022

LouisCastricato commented Nov 7, 2022