Dead code in dash_pipeline.p4 PNA version #399

fruffy · 2023-07-11T14:25:10Z

I ran some coverage benchmarks on the dash_pipeline.p4 program using P4Testgen and I noticed that there is some dead code in the dash PNA pipeline.
In the nrve_encap action there is this check:
https://github.com/sonic-net/DASH/blob/main/dash-pipeline/bmv2/dash_nvgre.p4#L48

However, this code can never be executed. nrve_encap is only called when route_service_tunnel was previously executed. And route_service_tunnel always sets the IPv4 header invalid here when it calls service_tunnel_encode.

Is this because dash_pipeline.p4 is not complete yet? What is the intended behavior?

The text was updated successfully, but these errors were encountered:

chrispsommers · 2023-07-11T16:04:45Z

@marian-pritsak please take a look, thanks

chrispsommers · 2023-07-11T16:16:29Z

@fruffy Thanks for reporting this. I would like to understand how you ran this test and if it might be something we can incorporate into the DASH CI/CD pipeline? In DASH we use a p4c docker container which is a slimmed-down version of p4lang/p4c derived by taking only the one bmv2 backend (to keep the image smaller). It would not be too hard to add in the p4testgen backend and call it upon every build, if it would catch dead code like this. Can you provide instructions on how you ran p4testgen to expose this dead code? Thanks!

fruffy · 2023-07-11T16:49:02Z

@fruffy Thanks for reporting this. I would like to understand how you ran this test and if it might be something we can incorporate into the DASH CI/CD pipeline? In DASH we use a p4c docker container which is a slimmed-down version of p4lang/p4c derived by taking only the one bmv2 backend (to keep the image smaller). It would not be too hard to add in the p4testgen backend and call it upon every build, if it would catch dead code like this. Can you provide instructions on how you ran p4testgen to expose this dead code? Thanks!

P4Testgen supports monitoring of statement coverage (and other P4 nodes) when generating tests. With that we can also implement different types of test-case-generation strategies to produce more targeted tests. For example,

./p4testgen --target dpdk --arch pna --std p4-16 -I/p4/p4c/build/p4include --test-backend METADATA --seed 1 --max-tests 0 --out-dir p4c/build/results/dash-pipeline-pna --path-selection GREEDY_STATEMENT_SEARCH --stop-metric MAX_STATEMENT_COVERAGE --track-coverage STATEMENTS dash-pipeline/bmv2/dash_pipeline.p4  -DTARGET_DPDK_PNA --print-coverage

will try to cover all statements in the program and then stop once it has achieved that. For this particular program achieving full coverage was not possible and test-case-generation did not terminate.

We could use P4Testgen to check that we can cover everything but that is subject to a bunch of limitations:

The statement coverage algorithm is a heuristic and may not always find the path to cover a particular statement early.
If code is dead P4Testgen will simply not terminate. So we have to use some form of bound.

fruffy · 2023-07-11T16:51:06Z

In parallel, I have been working on a tool that can provably detect this type of dead-code (and other missing optimizations). https://github.com/fruffy/flay
This should work better for CI, but it is not ready yet.

chrispsommers · 2023-07-11T17:04:04Z

If code is dead P4Testgen will simply not terminate. So we have to use some form of bound.
Could you elaborate? Does the program run forever and manual intervention (e.g. a timeout in the calling process) need to be imposed? I guess this could be achieved in some Python test wrapper.

fruffy · 2023-07-11T17:08:59Z

Could you elaborate? Does the program run forever and manual intervention (e.g. a timeout in the calling process) need to be imposed? I guess this could be achieved in some Python test wrapper.

P4Testgen will generate tests until there are no more paths. For programs such as dash_pipeline this can mean upwards of 200-300k tests. Likely even many million if the program grows in complexity.

The best solution is probably to expect full statement coverage within a reasonable amount of tests (maybe a thousand?). But since the algorithm is still a heuristic that could also fail if the program changes. That is something we would have to try out.

chrispsommers · 2023-07-11T19:06:01Z

Thanks @fruffy Based on this, I think it's a bit early to add routine CI tests on DASH using P4testgen, but let's keep up the experimenting. Also I'll try to monitor progress on flay, thanks for pointing it out.

jnfoster · 2023-07-11T20:29:34Z

Just wanted to chime in to say that this is awesome!

KrisNey-MSFT · 2023-07-13T17:38:47Z

Any HW can use this tool to exercise the paths - thank you!

fruffy · 2023-07-14T00:48:59Z

Any HW can use this tool to exercise the paths - thank you!

Not sure if I understand correctly. Are you asking about ways to generate tests that could exercise the paths we try to cover? Unfortunately, the DPDK PNA target does not yet have a testing pipeline, but we are working on one.

KrisNey-MSFT · 2023-07-14T04:59:48Z

hi @fruffy - we were just looking and this (and grateful for the contribution) in the DASH Community Call :)
We weren't asking...I think we understand that this is is a work in progress for the future - and were making a statement that in the future the goal would be to use the tool to exercise the paths. And thank you!

fruffy · 2023-08-11T10:15:29Z

Has this been confirmed or is there an easy fix to this? I am maintaining a PR on P4C which contributes a snapshot of the dash program as a compiler test. I would like to contribute the version without dead code. Mostly for selfish reasons, we are planning to run testing benchmarks on this particular version.

chrispsommers · 2023-08-11T15:53:23Z

@marian-pritsak Have you had a chance to look at this? Thanks.

chrispsommers · 2023-08-11T18:20:20Z

@jfingerh I took a closer look at this and I have a suspicion this is actually an artifact of translating

DASH/dash-pipeline/bmv2/dash_nvgre.p4

Line 36 in 1756b49

    
           hdr.ipv4.total_len = hdr.inner_ipv4.total_len*(bit<16>)(bit<1>)hdr.inner_ipv4.isValid() + \

, which contains multiple conditional expressions, and https://github.com/sonic-net/DASH/blob/main/dash-pipeline/bmv2/dash_nvgre.p4#L48, which is the p4-dpdk compatible equivalent with the expression terms "unrolled" into individual ifs and expressions which are later added up in

DASH/dash-pipeline/bmv2/dash_nvgre.p4

Line 55 in 1756b49

hdr.ipv4.total_len = (ETHER_HDR_SIZE + IPV4_HDR_SIZE + UDP_HDR_SIZE +

. This then exposed the part of the original combined expression which is never valid and p4testgen found it!

So, we could change the code to remove all the IPv4 components because per @fruffy 's observations, this is never called for IPv4 case. I'd want to leave a comment in the code mentioning it would need to be revised to handle IPv4. Thoughts?

jafingerhut · 2023-08-23T16:25:25Z

So my attempt in writing the version of the code with if statements was to faithfully duplicate the behavior of the original code, which uses multiplications by (bit<1>) hdr.ipv4.isValid() and similar expressions for other header valid bits.

Any tool that checks "line coverage" will get 100% coverage for the original, but not my modified version. A tool that somehow decided to check that all possible values of sub-expressions like (bit<1>) hdr.ipv4.isValid() in the original code would have caught this in the original code, too. However, deciding which kinds of case coverage one wants in such expressions is a bit of a difficult thing to define, I believe.

If there is dead code in my translated version, I believe there are also "dead sub-expressions" in the original code that could be removed.

chrispsommers · 2023-08-23T16:28:15Z

If there is dead code in my translated version, I believe there are also "dead sub-expressions" in the original code that could be removed.

Agreed. @marian-pritsak could you please comment? Thanks.

jafingerhut · 2023-08-23T16:55:53Z

I would also guess that in future versions of the DASH P4 code, the branches that are currently dead code could easily become live code again, if additional packet handling cases are added in other parts of the program executed before this code is executed.

fruffy · 2023-09-13T15:09:57Z

So my attempt in writing the version of the code with if statements was to faithfully duplicate the behavior of the original code, which uses multiplications by (bit<1>) hdr.ipv4.isValid() and similar expressions for other header valid bits.

Any tool that checks "line coverage" will get 100% coverage for the original, but not my modified version. A tool that somehow decided to check that all possible values of sub-expressions like (bit<1>) hdr.ipv4.isValid() in the original code would have caught this in the original code, too. However, deciding which kinds of case coverage one wants in such expressions is a bit of a difficult thing to define, I believe.

If there is dead code in my translated version, I believe there are also "dead sub-expressions" in the original code that could be removed.

Is this version local or part of a PR?

jafingerhut · 2023-09-13T17:26:55Z

Is this version local or part of a PR?

It is a few lines earlier than the link to a line of code you gave in the original issue, here: https://github.com/sonic-net/DASH/blob/main/dash-pipeline/bmv2/dash_nvgre.p4#L35-L42

That code is only compiled if you #define the symbol TARGET_BMV2_V1MODEL, whereas the code with the if statement version is only compiled if you #define the symbol TARGET_DPDK_PNA.

The reason for the #ifdef is that as of 2023-Apr when I created the TARGET_DPDK_PNA version, p4c-dpdk did not support multiplication of 2 run-time variables (and probably still does not). The p4c BMv2 back end does not fully support if statements within action bodies.

If you want to experiment with the TARGET_BMV2_V1MODEL version of the code with p4testgen, I would guess that you need to define that symbol, and NOT define TARGET_DPDK_PNA, and then you will get a different version of the P4 program.

fruffy · 2023-09-13T17:44:13Z

Oh, I see what you mean. Yes, for the v1model version we cover all statements, but not for the pna version because of the way the code is structured. In some case that exposed a dormant issue.

jafingerhut · 2023-09-13T17:58:45Z

Right. There are I think Verilog coverage tools that go more detailed than line coverage, and check for something like "subexpression coverage", that might expose the "dead cases" in the TARGET_BMV2_V1MODEL version of the code. There is probably a widely-accepted name for what I am calling "subexpression coverage", but if so, I do not know that name.

I asked one hardware verification person for names of other kinds of code coverage, and he mentioned these:

branch
FSM
toggle
condition
and maybe expression depending on simulator

I do not have definitions of all of those, but the Wikipedia page on code coverage describes a few others besides function and line coverage with examples: https://en.wikipedia.org/wiki/Code_coverage

I mention these not because I think p4testgen ought to do all of these things, by the way. Mainly just for me looking for the right widely-used terms (if any) for the more detailed kinds of coverage that have been implemented for other programming languages.

chrispsommers · 2023-09-21T20:10:59Z

@fruffy We discussed at length in today's WG meeting (myself, @jafingerhut, @marian-pritsak) and the consensus was:

it is true that given today's P4 code and choice of tunneling options, the particular path of code you identified in Dead code in dash_pipeline.p4 PNA version #399 (comment) will never be executed
this may not hold true forever if other features/tunneling options are added in the future
It is not a "bug," it is future-proofing the code in case other options arise which would make this code path live.

All this means for you is you can decide how to handle the dead-code detection in your use of DASH code as a test case. Perhaps you want to "expect" this and not "hard-fail." We hope you do decide to use DASH code as a test case. In fact at some point we could add p4testgen to our CI pipeline. Let us know how the effort described in #399 (comment) proceeds.

We appreciate your bringing this to our attention, it resulted in better awareness and demonstrates the power of your tool. I am closing this issue, but feel free to add to the conversation thread here.

fruffy · 2023-10-03T08:57:16Z

Thanks for keeping me posted on this! I will merge a patched version to the p4c repository.

KrisNey-MSFT · 2023-10-25T16:13:22Z

@r12f for visibility

chrispsommers assigned marian-pritsak and jfingerh Jul 11, 2023

chrispsommers unassigned jfingerh Jul 11, 2023

chrispsommers closed this as completed Sep 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dead code in dash_pipeline.p4 PNA version #399

Dead code in dash_pipeline.p4 PNA version #399

fruffy commented Jul 11, 2023

chrispsommers commented Jul 11, 2023

chrispsommers commented Jul 11, 2023

fruffy commented Jul 11, 2023 •

edited

Loading

fruffy commented Jul 11, 2023 •

edited

Loading

chrispsommers commented Jul 11, 2023

fruffy commented Jul 11, 2023

chrispsommers commented Jul 11, 2023

jnfoster commented Jul 11, 2023

KrisNey-MSFT commented Jul 13, 2023

fruffy commented Jul 14, 2023

KrisNey-MSFT commented Jul 14, 2023

fruffy commented Aug 11, 2023

chrispsommers commented Aug 11, 2023

chrispsommers commented Aug 11, 2023

jafingerhut commented Aug 23, 2023

chrispsommers commented Aug 23, 2023

jafingerhut commented Aug 23, 2023

fruffy commented Sep 13, 2023

jafingerhut commented Sep 13, 2023

fruffy commented Sep 13, 2023

jafingerhut commented Sep 13, 2023

chrispsommers commented Sep 21, 2023

fruffy commented Oct 3, 2023

KrisNey-MSFT commented Oct 25, 2023

Dead code in dash_pipeline.p4 PNA version #399

Dead code in dash_pipeline.p4 PNA version #399

Comments

fruffy commented Jul 11, 2023

chrispsommers commented Jul 11, 2023

chrispsommers commented Jul 11, 2023

fruffy commented Jul 11, 2023 • edited Loading

fruffy commented Jul 11, 2023 • edited Loading

chrispsommers commented Jul 11, 2023

fruffy commented Jul 11, 2023

chrispsommers commented Jul 11, 2023

jnfoster commented Jul 11, 2023

KrisNey-MSFT commented Jul 13, 2023

fruffy commented Jul 14, 2023

KrisNey-MSFT commented Jul 14, 2023

fruffy commented Aug 11, 2023

chrispsommers commented Aug 11, 2023

chrispsommers commented Aug 11, 2023

jafingerhut commented Aug 23, 2023

chrispsommers commented Aug 23, 2023

jafingerhut commented Aug 23, 2023

fruffy commented Sep 13, 2023

jafingerhut commented Sep 13, 2023

fruffy commented Sep 13, 2023

jafingerhut commented Sep 13, 2023

chrispsommers commented Sep 21, 2023

fruffy commented Oct 3, 2023

KrisNey-MSFT commented Oct 25, 2023

fruffy commented Jul 11, 2023 •

edited

Loading

fruffy commented Jul 11, 2023 •

edited

Loading