Add GeoIP db fixtures for filebeat tests by jsoriano · Pull Request #8447 · elastic/beats

jsoriano · 2018-09-26T12:34:13Z

Add a fake GeoIP DB so filebeat tests don't depend on the DB provided in elasticsearch images.

This protects us from changes in GeoIP, but we'd still detect changes in the fields sent by the plugin.

jsoriano · 2018-09-26T12:38:07Z

@tsg @ruflin this could be a possible solution for the geoip issue in filebeat modules test, wdyt?

I'd have to add a couple more fixtures. When new test files are added in the future, their IPs would have to be added to the DB, or better, they should use IPs from the networks in the fake DB.

jsoriano · 2018-09-26T14:10:14Z

Added another city to the fixtures, and networks for all ips we are currently using.

ruflin · 2018-09-27T08:50:05Z

On the one hand this is a really cool solution and would definitively solve our problem. On the other hand I think we should know when Geoip fields change, especially new fields added or removed. Also it makes adding new log files harder, meaning the geoip database file also has to be updated.

The other part that I think we shouldn't do is change our testing environment as at least I personally often use this setup also for personal testing, building dashboards etc. Having a non standard geoip library in there is not optimal.

When would we want our test to fail? Assuming an IP change it's coordinates/address a bit would not be relevant for our tests. New fields added or missing fields seems more important to me and I would expect tests to fail. We had this in the past but it was because geoip added new fields. How often does this happen? Is it possible that we know when and how ES updates the geoip lib?

jsoriano · 2018-09-27T09:41:18Z

Thanks @ruflin for your comments.

I think we should know when Geoip fields change, especially new fields added or removed.

I don't think GeoIP fields change so much, #8204 happened after changes in geoip plugin, not changes in the GeoIP DB, I agree that this case should require an action on our side, because for the same data the plugin will give a different set of values.

For changes in the schema of GeoIP DBs maybe we could have an additional explicit test, that downloads the Lite DB and checks for existing fields.

The other part that I think we shouldn't do is change our testing environment as at least I personally often use this setup also for personal testing, building dashboards etc. Having a non standard geoip library in there is not optimal.

We could try to do the override optional. Having tests depending on data we don't control is also not optimal 🙂

When would we want our test to fail?

On #8401 tests failed both because some coordinates changed, but also because an IP changed its location completely, I don't think we should worry about IP movements. I agree that new fields added by the geoip plugin should be detected and I think that with this solution they will.

How often does this happen?

I have seen it happening twice this month, not sure if it happened before, maybe ES started to update geoip data with more frequency.

Is it possible that we know when and how ES updates the geoip lib?

This is included in the docker image, I guess that it is downloaded when the image is built, but not sure, I'll check. In any case while using snapshots in builds we will always have builds failing before we update the expected values, even if we know when the DB has been updated.

Maybe the root problem is to use snapshot images, tests shouldn't depend on moving parts. We could use fixed versions on tests, and update them as part of the release process.

ph · 2018-09-28T17:56:56Z

Maybe the root problem is to use snapshot images, tests shouldn't depend on moving parts. We could use fixed versions on tests, and update them as part of the release process.

I would still prefer to keep using snapshot images, ES move fast and I would prefer to know early if we break something.

ruflin · 2018-10-01T19:26:57Z

++ on keep using the snapshots.

For how often it happens: I didn't see it for a long time, then around 6 months ago and now twice in the last week. Does that mean now it will disappear again for a long time? Not so sure.

If we do not care too much about the exact values of the geoip fields I would say it's best that we check the top level entry is there but not the exact content. If we generate it, we still generate all the fields. The nice part about this is we see that things changed when run GENERATE but having the changes will not fail the tests on old builds.

jsoriano · 2018-10-03T11:27:19Z

-- on using snapshots (at least by default 🙂)

I think that tests should be repeatable and its results reproducible, by using unversioned snapshots we cannot guarantee any of these things.

There have been very few problems with geoip but I see them now as symptoms. We had also tests failing by other reasons, as snapshot versions being removed from repositories, and the thing is that when these issues appear they affect to all builds of integration branches, generating additional noise in all active PRs to these branches. The errors happening when images are removed are fixed by using a new snapshot version, this indicates that at least during some periods of time we are not even testing the latest snapshot till it is manually updated, so even the advantage of testing with latest builds mitigates.

In my opinion tests should depend on fixed versions whenever possible (ideally on released versions, so they are better tested and have more chances to be available during more time), I see the advantages of using snapshot images to detect issues as soon as possible in integration branches, but I am not sure this justifies breaking all builds, specially when it affects a growing number of people.

We could test on fixed versions by default, and have something like daily builds using snapshot versions, this way we wouldn't be affecting PRs and we'd still detect issues. We could add a process to increase the fixed versions at least on every release.

jsoriano · 2018-10-04T12:30:11Z

I am going to close this by now, let's continue the discussion on snapshots next week offline 🙂

jsoriano added in progress Pull request is currently in progress. module Filebeat Filebeat flaky-test Unstable or unreliable test cases. labels Sep 26, 2018

jsoriano added the discuss Issue needs further discussion. label Sep 26, 2018

jsoriano force-pushed the geoip-test branch 2 times, most recently from 9cd8d0c to f5cec58 Compare September 26, 2018 13:06

Add GeoIP fixtures for filebeat tests

d688c4c

jsoriano force-pushed the geoip-test branch from f5cec58 to d688c4c Compare September 26, 2018 14:09

jsoriano added review and removed in progress Pull request is currently in progress. labels Sep 26, 2018

jsoriano closed this Oct 4, 2018

jsoriano deleted the geoip-test branch October 4, 2018 12:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add GeoIP db fixtures for filebeat tests#8447

Add GeoIP db fixtures for filebeat tests#8447
jsoriano wants to merge 1 commit intoelastic:masterfrom
jsoriano:geoip-test

jsoriano commented Sep 26, 2018 •

edited

Loading

Uh oh!

jsoriano commented Sep 26, 2018 •

edited

Loading

Uh oh!

jsoriano commented Sep 26, 2018

Uh oh!

ruflin commented Sep 27, 2018

Uh oh!

jsoriano commented Sep 27, 2018

Uh oh!

ph commented Sep 28, 2018

Uh oh!

ruflin commented Oct 1, 2018

Uh oh!

jsoriano commented Oct 3, 2018

Uh oh!

jsoriano commented Oct 4, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jsoriano commented Sep 26, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jsoriano commented Sep 26, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jsoriano commented Sep 26, 2018

Uh oh!

ruflin commented Sep 27, 2018

Uh oh!

jsoriano commented Sep 27, 2018

Uh oh!

ph commented Sep 28, 2018

Uh oh!

ruflin commented Oct 1, 2018

Uh oh!

jsoriano commented Oct 3, 2018

Uh oh!

jsoriano commented Oct 4, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jsoriano commented Sep 26, 2018 •

edited

Loading

jsoriano commented Sep 26, 2018 •

edited

Loading