Conversation
[git-generate]
find . -type f -exec dos2unix {} \;
[git-generate]
find . -type f -exec gsed -i -r 's/[[:space:]]+$//g' {} \;
[git-generate]
find . -type f -name '*.hbs' -exec gsed -i -r '$a\' {} \;
find . -type f -name '*.yml' -exec gsed -i -r '$a\' {} \;
- rely on generator to set header in README - use current ECS version - single quotes to reduce escape proliferation - tag failable processors - use non-escaping mustache braces - use the timestamp in the document to set the @timestamp - use canonical error message string
🌐 Coverage report
|
cf9aa03 to
048a218
Compare
Assumes that the tychon log input is line-based. Fields in tychon.arp need to be extracted out to ECS fields and field definitions need to be fleshed out. Sytem test now becomes trivial.
|
@efd6 Three of these items are somewhat straight forward #2 There is no way to currently retain the event.original despite the option in the agent config template. To do this the JSON parsing should be move to ingest so that the unparsed JSON text can be retained. I imagine, but don't know for sure if this is true, that the original data is single line JSON since it's coming from a log file. #3 There is no system test (this would confirm/refute the assumption above and so is help even only in that way). This is also a blocker since as a log file input there is no reason not to have this present in the package. #4 I think the data streams in general (and demonstrated here) stutter; it would be better if the datastream were arp instead of tychon_arp since that will show up to the user as tychon.tychon_arp in the datastream.dataset field in their index. #1 destination, host, id and network are either non-ECS compliant or have child fields that are not ECS-compliant. Ideally, these would all be put under tychon and then have ECS fields populated from the relevant information there. As is, this cannot be merged. |
048a218 to
88955b4
Compare
|
- add system test infrastucture (not passing yet) - convert from logfile to filestream input - use more conventional field definition structure
logs require trailing new lines for... reasons.
b2c0fca to
ee62547
Compare
at some point there were corrupted (probably in the file sanitisation steps)
ee62547 to
8430e26
Compare
|
@joeperuzzi I have the system tests functioning now and have added back the |
|
@efd6 Thanks! These are the universal changes I caught that we'll apply across the rest of the project streams from your code push: Items we can easily take care of:
Items that are easy changes but need more testing from our QA team: These items need more in-depth discussion as these types of changes will have drastic effects to the integration for our own items, the integration with Elastic Defend, and other third-party integrations using this data
|
|
Hi! We just realized that we haven't looked into this PR in a while. We're sorry! We're labeling this issue as |
|
Hi! This PR has been stale for a while and we're going to close it as part of our cleanup procedure. We appreciate your contribution and would like to apologize if we have not been able to review it, due to the current heavy load of the team. Feel free to re-open this PR if you think it should stay open and is worth rebasing. Thank you for your contribution! |
DISCUSSION ONLY DO NOT MERGE
There are still issues with this:
destination,host,idandnetworkare either non-ECS compliant or have child fields that are not ECS-compliant. Ideally, these would all be put under tychon and then have ECS fields populated from the relevant information there. As is, this cannot be merged.event.originaldespite the option in the agent config template. To do this the JSON parsing should be move to ingest so that the unparsed JSON text can be retained. I imagine, but don't know for sure if this is true, that the original data is single line JSON since it's coming from a log file.