[WIP] Automated examples testing to prevent regression #34

tlvu · 2015-05-11T05:07:33Z

As a new contributor, it will be very easy for me to introduce regression because obviously I do not know the code. I prefer to be able to discover regressions myself than sending broken PR.

I know that Mike is working on some unit testing. Not sure what coverage level he is at.

This PR is a quick and simple way to automatically test that all the examples still produce the same output, if not, it means I might have introduced some regression when working in the code.

I chose to focus my effort on the example because:

capturing the output of the examples has the following advantages
- it's very easy
- the reader of the examples immediately have have the corresponding output without having to setup the environment to run mochi to get the corresponding output
ensuring the examples still work is the fastest way to have the greatest test coverage with minimum effort (for the same coverage, I think I would have to write a lot more unit tests)
the examples are likely the most trivial cases we support so if they do not work, we have a problem. Unit testing will be able to find more complex/corner case problems but I want to at least ensure the basic scenarios are working

My naive, quick and simplistic approach does have a problem: it does not work well when the output is not consistent between runs (see details in the commit description). I am not sure what we should do about it. Making the test more complex to handle that or fixing the examples so they produce consistent output.

The problem with timer.mochi.out is because my Docker image is missing RxPY, I will add it later.

There are 2 goals for doing this: * User will have the output of all examples ready. They can compare the output with the code in the example without being forced to run the examples. * Since the examples are showcase for the features we support, we should always make sure the examples work as expected. One quick and easy way is to make sure the output does not change between commits.

flask_restful.mochi.out and timer.mochi.out has been rebuilt to avoid false positive (testexamples expects a different current dir than previously when these 2 .out files were built). As of now, the following example failures are observed (with Docker image tlvu/mochi:0.2.4.2-20150508). The simple reason is because the examples do not produce identical output between different runs. Not sure how to fix these examples. $ ./testexamples 18c18 < 0.0025186538696289062 5 --- > 0.002317190170288086 5 24c24 < <function _gs126.<locals>.fafa at 0x7fbd67d047b8> --- > <function _gs126.<locals>.fafa at 0x7f90df7e47b8> ERROR: etc.mochi changed 4c4 < time: 0.07205533981323242 --- > time: 0.0695199966430664 ERROR: fact.mochi changed 4c4 < time: 0.15062212944030762 --- > time: 0.1504371166229248 ERROR: fact_pattern.mochi changed 2c2 < pmap({'a': 1, 'b': 2}) --- > pmap({'b': 2, 'a': 1}) 5c5 < pmap({'a': 1, 'b': 2}) --- > pmap({'b': 2, 'a': 1}) ERROR: keyword_arguments.mochi changed 1c1 < 23.061758756637573 1 --- > 22.92900848388672 1 ERROR: tak.mochi changed 3a4 > 明後日雨のち晴 7a9 > 明後日雨のち晴 11a14 > 明後日晴れ ERROR: urlopen.mochi changed

pya · 2015-05-11T06:57:14Z

Here are the testing guidelines:
https://github.com/i2y/mochi/blob/master/doc/TESTS.rst

How do they play with your testing approach?

tlvu · 2015-05-13T00:52:31Z

@pya Was not aware you wrote that TESTS.rst document, sorry. I guess the proper way is to convert all the examples to your format (with matching result_* function)? I recall seeing you wrote tests from some of the examples.

Maybe we should consolidate that and add directly all the matching result_* functions directly in the examples files? Should we go this route instead?

I was just doing this because I needed a quick and simple way to

test existing simple case is not broken
show users the output of all the examples without forcing them to run the examples to have the output
validate user run-time environment (when I started, I had no idea if my output was good when running the example, recall I had python 3.4.0 and one of the example was broken with that version)
now I use this output to test new docker image (validate the run-time environment)

By having all the matching result_* functions in the example files will also satisfy my needs above.

For the shell script not working for Windows users, I totally forgot about Windows users because I wanted to use this to test new Docker images (docker is the "default" mochi command in the script).

pya · 2015-05-14T06:54:38Z

@tlvu Sounds like a good idea. The result_* function scheme was the simplest thing I could come up with that allows us using py.test. I am open to improvements to this. I have a few objectives here:

Make writing tests as simple as possible while providing suitable functionality.
Encourage new users to write tests as one simple way to get involved. Ideally, it should only take a few minutes of reading and looking at examples to write your first useful test.
Gradually (or faster ;)), move to a TDD approach: Tests first. Implementation later.

Something like a unique naming or numbering of tests might be useful. This should be primarily for human consumption but also machine readable and searchable. Just using commit IDs seems not really working. Maybe using categories and numbers or only numbers and some kind of database with a two-way number-description mapping. This should not make things more complicated, therefore it needs to be somehow (semi-) automatic. Registering tests somewhere should give you a new number automatically. Of course, the description has to be provided by a human. Maybe naming conventions for files, functions, and docstrings can help here. „Conventions before configuration.“ Challenge: Provide useful functionality, yet make it simple to use. It should just work and go out of the way. Ideas welcome.

boxed · 2015-05-18T20:52:18Z

Maybe this might be relevant: https://github.com/boxed/pytest-readme

It has the really nice feature that all line numbers are the same in the tests as in the README.

tlvu added 2 commits May 10, 2015 17:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Automated examples testing to prevent regression #34

[WIP] Automated examples testing to prevent regression #34

tlvu commented May 11, 2015

pya commented May 11, 2015

tlvu commented May 13, 2015

pya commented May 14, 2015

boxed commented May 18, 2015

[WIP] Automated examples testing to prevent regression #34

Are you sure you want to change the base?

[WIP] Automated examples testing to prevent regression #34

Conversation

tlvu commented May 11, 2015

pya commented May 11, 2015

tlvu commented May 13, 2015

pya commented May 14, 2015

boxed commented May 18, 2015