Assert return codes specified in API in functional tests #331

mbenson1 · 2019-12-26T00:20:23Z

I noticed that osal-core-test.c only tests if the function returns OS_SUCCESS or does NOT return OS_SUCCESS. It does not actually test that the return code is correct. For example, the test to ensure that the OSAL does not create a queue with a name that already exists does not actually test for OS_ERR_NAME_TAKEN. It merely passes if the return is not OS_SUCCESS. A quick spot check indicates this design pattern is systemic.

These unit tests were used to certify (per NPR7150A) the ARINC 653 version of CFS. This requires requirements have tracing back to tests. Testing the actual expected return code is a better pattern, but is required if there is an actual requirement defining the return codes.

skliper · 2020-03-26T17:29:29Z

Preference is to stress inputs from functional tests (vs mock/stub coverage tests) and verify handling/responses. @dmknutsen assigned to check compliance against the unit test standard and we'll evaluate approaches to resolve.

CDKnightNASA · 2020-03-27T23:22:05Z

Should we have a coding standard where all unit test assertions must check "return == something" (no use of "!=")?

jphickey · 2020-03-28T18:45:29Z

Should we have a coding standard where all unit test assertions must check "return == something" (no use of "!=")?

I don't think we can make this sort of generalization. It really depends on whether or not there is a specific requirement for a certain return code. If there is, then yes, a functional test should check for it. But in the majority of cases (particularly with PSP and OSAL) there is no documented requirements at that API level.

On the other hand, where there isn't a specific documented error code requirement, it is more future-proof to simply check that returncode != OS_SUCCESS.

The reality is that we should discourage applications from checking for only certain specific error return codes. By relying too heavily on a specific set of return codes, we open the possibility that app developers check only for those return codes, and then we are backed into an API corner (so to speak) if we need to expand those in the future as the libraries evolve.

The better pattern, and the one that we should encourage for real apps, is to check for OS_SUCCESS, and if that is not true, treat it as an error. The app can then also check for known (specific) error codes if they want to handle them, but there should be a catch-all error handling path. Apps should never treat an undocumented return code as "success".

skliper · 2020-03-30T13:27:52Z

We don't have any OSAL or PSP requirements, so relying on requirements to verify against doesn't work. Historically and in this round, certification is based on the spec/APIs. Since the APIs do currently document at least some of the return codes, I'd expect those to be explicitly checked (if it's in the spec the code should match).

Then the question becomes - should we remove the documentation of specific error codes for each API (do we fix the document or code)? Could generalize documentation as OS_SUCCESS or appropriate error code for everything that applies.

I think the developmental trade here is more if we want to support a concept where apps could handle certain errors in a unique manner, vs just reporting the error code. If we want apps to be able to take a unique action based on the error, I'd think we'd need to provide consistent behavior across OSALs. Basically the functional test should enforce a specific return where one is defined in the API. On the other hand, if apps should only ever have at most 2 responses, OS_SUCCESS or any error case then I'd think the specific error checks may not need to be enforced (depends on the certification expectations though).

These are just developer concerns though, maybe @tngo67 can help with certification expectations?

@acudmore - any inputs from your perspective? Enforce no specific error codes, a subset, or all possible?

*EDIT - tagged wrong person

skliper · 2020-03-30T13:29:20Z

Tagged for discussion. In the short term @dmknutsen is auditing the current functional tests for compliance with our processes to get a sense of how well we currently comply with GSFC expectations.

skliper · 2020-03-30T13:31:10Z

@ejtimmon do you know if any GSFC apps have unique responses to different error codes? Or is the error response typically just an event and reporting of error code?

jphickey · 2020-03-30T13:46:02Z

There are a few error conditions which applications do handle specially, mainly timeouts, possibly queue full/empty.

Most error codes actually deal with things that the application/caller could have and should have checked on their own (i.e. name too long, bad pointer, bad id, etc). Highly unlikely an application would handle these explicitly because a properly-written application would never generate them. These are completely tools/hints for the developer to aid in finding the bugs in the code. Once debugged, they don't occur.

On the other hand, the timeouts and queue full errors deal with the state of the system at the time of the call and therefore could occur in a properly written and operating system. Applications that call functions that are subject to system state like this do need to check for these codes.

skliper · 2020-03-30T13:59:46Z

Might be worth a scrub of the API documentation relating to error codes from that context, as it's our contract w/ the user. Doesn't need to limit additional error codes for unique situations as development progresses, but explicitly list those guaranteed (or that should be enforced) by the functional or coverage tests.

Maybe something like:

\return Execution status. Success and error codes below are verified by test, but users should not assume the error code list is exhaustive. Precedence is not enforced, calls with multiple errors may return any one of the related the error codes.

Then define the codes we want to enforce by test. Typically I'd think most standard ones are probably a good idea, but doesn't limit future development?

jphickey · 2020-03-30T14:34:03Z

I like that wording - main thing is to be clear that the list of errors is not exhaustive/all-inclusive, and that precedence is not defined. That is, if you pass in both a long name and a bad pointer, all that the API guarantees is that the return code is not OS_SUCCESS, not a particular error code value.

Might even want to add a note in the list of error codes, that the specific error code definitions may be extended or refined in future versions of the software. Warn users/developers against only checking for specific error values aside from OS_SUCCESS.

CDKnightNASA · 2020-03-30T14:39:03Z

I don't know, to me any return code should have an associated coverage (and probably functional) unit test case, so producing/maintaining an exhaustive list of return codes should be concurrent to the development of the unit tests.

skliper · 2020-03-30T14:49:27Z

Coverage tests shall absolutely have explicit checks, they are white box and need to relate to the current exhaustive list for each implementation.

Functional is API based, and what I have been referring to. Needs to enforce the standard set, but I'm suggesting functional tests don't have to limit development or unique error codes across implementations where an explicit error code isn't required (per the API documentation).

jphickey · 2020-03-30T15:05:58Z

I second that -- Coverage tests are written against a specific/unique implementation, where functional tests are written against an API. We need to keep these tests separate and distinct.

The former (coverage) will be updated with each and every code change and there is no separate documentation for it (or in other words, the implementation itself is the documentation of what the coverage tests need to do).

The latter (functional) does have separate documentation, and should only need to change if/when there is an actual API change.

CDKnightNASA · 2020-03-30T15:33:05Z

Gotcha, and I think I understand more fully--coverage tests cover all error return values, and you're suggesting functional tests only check specific return values if necessary but generally can just check success or error and not worry about which error unless it's particularly important.

skliper · 2020-03-30T15:34:33Z

See #399 to capture documentation updates to keep this issue focused on the tests.

tngo67 · 2020-03-31T17:06:40Z

Should we have a coding standard where all unit test assertions must check "return == something" (no use of "!=")?

I second that. We do have such coding standard that we have to comply.

tngo67 · 2020-03-31T17:07:31Z

I second that -- Coverage tests are written against a specific/unique implementation, where functional tests are written against an API. We need to keep these tests separate and distinct.

The former (coverage) will be updated with each and every code change and there is no separate documentation for it (or in other words, the implementation itself is the documentation of what the coverage tests need to do).

The latter (functional) does have separate documentation, and should only need to change if/when there is an actual API change.

I third that! :-)

astrogeco · 2020-04-01T16:48:27Z

CCB 2020-04-01 - Approved implementation

Partial #331, scrub return values

astrogeco · 2021-05-12T17:29:14Z

Reopening since there are remaining items

mbenson1 · 2021-05-12T18:05:51Z

Not sure if it matters, but I concur with the end results of this discussion. I just noticed the notification email. We maintain a single set of functional tests that test our OSALs, since the functionality should be identical between all the OSALs. We maintain a separate set of coverage tests for each of our OSALs, since the internal code is different and may require differing inputs to fully exercise statement and branch coverage for each OSAL. When there is a requirement that drives a specific return code, our functional tests are designed to verify it. When there is a requirement that drives a success or non-success return code, our functional tests are required to only verify success or non-success. In the later case, the specific non-success return code is not required to be tested, but coverage tests may still exercise in the course of capturing required statement and branch coverage.

jphickey · 2021-05-12T18:16:29Z

Here is an updated report of where there are differences between the doxygen documentation and the return values validated by the unit-tests (Formatted as a "to do" list):

https://gist.github.com/jphickey/2797f70ab0465a7d088e3281c17f2138

skliper · 2021-05-13T12:20:22Z

From the API/test traceability audit the following "rules" stood out as not being consistently followed -

Needed "Standard" tests based on input parameter type:

osal_id_t: OS_OBJECT_ID_UNDEFINED returns OS_ERR_INVALID_ID
any pointer that can't be NULL: OS_INVALID_POINTER
paths: OS_FS_ERR_PATH_INVALID, OS_FS_ERR_PATH_TOO_LONG
absolute file name: OS_FS_ERR_PATH_INVALID, OS_FS_ERR_PATH_TOO_LONG, OS_FS_ERR_NAME_TOO_LONG
names (without paths): OS_FS_ERR_NAME_TOO_LONG
any size that can't be zero: OS_ERR_INVALID_SIZE

Other comments

Note any pointer that can be NULL in API documentation (and associated behavior)
Note OS_SUCCESS returns, and explicitly document anything that doesn't
Don't stack error tests, many tests set multiple inputs invalid and a specific return is checked. Need to independently test each input (many cases where invalid length is passed in for multiple strings at the same time)
Any sizes should document if they can be 0 or not

skliper · 2021-06-29T16:17:00Z

This was split into #1008 to #1024 and resolved via associated tickets. Closing based on those fixes and the partial fix.

skliper added this to the 5.1.0 milestone Dec 30, 2019

skliper added the enhancement label Dec 30, 2019

skliper assigned dmknutsen Mar 26, 2020

skliper added the CCB:Ready Pull request is ready for discussion at the Configuration Control Board (CCB) label Mar 30, 2020

skliper mentioned this issue Mar 30, 2020

Error code documentation updates related to enforcement/test #399

Closed

astrogeco added CCB - 20200401 and removed CCB:Ready Pull request is ready for discussion at the Configuration Control Board (CCB) labels Apr 1, 2020

skliper added the unit-test Tickets related to the OSAL unit testing (functional and/or coverage) label Apr 10, 2020

skliper modified the milestones: 5.1.0, 5.2.0 Jun 5, 2020

skliper modified the milestones: 5.2.0, 6.0.0 Sep 1, 2020

skliper changed the title ~~Unit tests have insufficient assertions~~ Assert return codes specified in API in functional tests Dec 8, 2020

astrogeco added the community label May 5, 2021

astrogeco added a commit that referenced this issue May 11, 2021

Merge pull request #980 from jphickey/fix-331-retcodes

84b7abb

Partial #331, scrub return values

astrogeco mentioned this issue May 11, 2021

osal Integration candidate: 2021-05-11 #996

Merged

jphickey mentioned this issue May 11, 2021

Return value of "OS_rmdir" implementation not being passed through #999

Closed

astrogeco closed this as completed in b9c9b29 May 12, 2021

astrogeco reopened this May 12, 2021

astrogeco linked a pull request May 12, 2021 that will close this issue

Partial #331, scrub return values #980

Merged

skliper closed this as completed Jun 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Assert return codes specified in API in functional tests #331

Assert return codes specified in API in functional tests #331

mbenson1 commented Dec 26, 2019

skliper commented Mar 26, 2020

CDKnightNASA commented Mar 27, 2020

jphickey commented Mar 28, 2020

skliper commented Mar 30, 2020 •

edited

Loading

skliper commented Mar 30, 2020

skliper commented Mar 30, 2020

jphickey commented Mar 30, 2020

skliper commented Mar 30, 2020

jphickey commented Mar 30, 2020

CDKnightNASA commented Mar 30, 2020

skliper commented Mar 30, 2020

jphickey commented Mar 30, 2020

CDKnightNASA commented Mar 30, 2020

skliper commented Mar 30, 2020

tngo67 commented Mar 31, 2020

tngo67 commented Mar 31, 2020

astrogeco commented Apr 1, 2020

astrogeco commented May 12, 2021

mbenson1 commented May 12, 2021

jphickey commented May 12, 2021

skliper commented May 13, 2021 •

edited

Loading

skliper commented Jun 29, 2021

Assert return codes specified in API in functional tests #331

Assert return codes specified in API in functional tests #331

Comments

mbenson1 commented Dec 26, 2019

skliper commented Mar 26, 2020

CDKnightNASA commented Mar 27, 2020

jphickey commented Mar 28, 2020

skliper commented Mar 30, 2020 • edited Loading

skliper commented Mar 30, 2020

skliper commented Mar 30, 2020

jphickey commented Mar 30, 2020

skliper commented Mar 30, 2020

jphickey commented Mar 30, 2020

CDKnightNASA commented Mar 30, 2020

skliper commented Mar 30, 2020

jphickey commented Mar 30, 2020

CDKnightNASA commented Mar 30, 2020

skliper commented Mar 30, 2020

tngo67 commented Mar 31, 2020

tngo67 commented Mar 31, 2020

astrogeco commented Apr 1, 2020

astrogeco commented May 12, 2021

mbenson1 commented May 12, 2021

jphickey commented May 12, 2021

skliper commented May 13, 2021 • edited Loading

skliper commented Jun 29, 2021

skliper commented Mar 30, 2020 •

edited

Loading

skliper commented May 13, 2021 •

edited

Loading