RASP validator fails for some programs #11

langosco · 2023-11-20T19:40:24Z

Issue #9 introduces a validator to check RASP programs that compile incorrectly.
Here's one case---a RASP program that computes the sum of all inputs up to the current index---in which I think the validator fails (or I've misunderstood how it works):

from tracr.rasp import rasp
from tracr.compiler import validating, compiling


def sum_of_inputs() -> rasp.SOp:
    before = rasp.Select(rasp.indices, rasp.indices, rasp.Comparison.LEQ)
    means = rasp.Aggregate(before, rasp.tokens)  # returns sequence s_i = mean_{j<=i} input_j
    sums = rasp.SequenceMap(lambda x, y: x*y, means, rasp.indices+1)
    return sums


sums = sum_of_inputs()

# The output of the RASP program sums is different that the output of the compiled model:
rasp_output = sums([3, 2, 1, 1])
compiled_model = compiling.compile_rasp_to_model(sums, vocab={1,2,3}, max_seq_len=5, compiler_bos="BOS")
compiled_output = compiled_model.apply(["BOS", 3, 2, 1, 1]).decoded

print(rasp_output)  # output: [3.0, 5.0, 6.0, 7.0]
print(compiled_output)  # output: ['BOS', 3, 4, 3, 4]

# However, it looks like the validator doesn't catch the error:
print(validating.validate(sums, [1, 2, 3]))  # returns an empty list

david-lindner · 2023-11-21T19:17:48Z

Thanks! Fixed by 001bdb3 -- but, feel free to reopen if you find other cases the validator doesn't catch

langosco · 2024-01-16T00:53:27Z

Came across another case that the validator doesn't catch:

from tracr.rasp import rasp
from tracr.compiler import compiling, validating

sel = rasp.Select(rasp.indices, rasp.tokens, rasp.Comparison.EQ)
sop = rasp.Aggregate(sel, rasp.indices)
program = rasp.Aggregate(sel, sop)


model = compiling.compile_rasp_to_model(program, vocab={1,2,3,4}, max_seq_len=5, compiler_bos="BOS")
compiled_output = model.apply(["BOS", 1, 2, 3, 4]).decoded
rasp_output = program([1, 2, 3, 4])


# The output of the compiled model does not match the output of the RASP program:
print(rasp_output)  # [2.0, 3.0, None, None]
print(compiled_output) # ['BOS', 2, 3, 0, 1]

# The validator doesn't catch the error:
print(validating.validate(program, [1, 2, 3, 4])) # []

langosco · 2024-01-16T00:56:10Z

Also seems worth linking the two other cases documented in pull requests #13 #14

david-lindner · 2024-01-22T12:26:18Z

For all of these cases, can you try increasing the mlp_exactness parameter, ie. add mlp_exactness=100 to the call to compile? I suspect that at least for #14 the issue is an approximation error in the MLP layer of the selector width

langosco · 2024-01-22T13:21:30Z

You're right, looks like that fixes #14! mlp_exactness=100 is the default already, but #14 compiles fine when using mlp_exactness=120.

It doesn't seem to fix the other cases unfortunately.

david-lindner closed this as completed Nov 21, 2023

david-lindner reopened this Jan 16, 2024

langosco mentioned this issue Jan 25, 2024

Invalid compilation caused by categorical Aggregate with default=None #25

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RASP validator fails for some programs #11

RASP validator fails for some programs #11

langosco commented Nov 20, 2023 •

edited

Loading

david-lindner commented Nov 21, 2023

langosco commented Jan 16, 2024 •

edited

Loading

langosco commented Jan 16, 2024

david-lindner commented Jan 22, 2024

langosco commented Jan 22, 2024

RASP validator fails for some programs #11

RASP validator fails for some programs #11

Comments

langosco commented Nov 20, 2023 • edited Loading

david-lindner commented Nov 21, 2023

langosco commented Jan 16, 2024 • edited Loading

langosco commented Jan 16, 2024

david-lindner commented Jan 22, 2024

langosco commented Jan 22, 2024

langosco commented Nov 20, 2023 •

edited

Loading

langosco commented Jan 16, 2024 •

edited

Loading