feat: add the XPath/RegEx Data Extractor and Expression Evaluation for HTTP Probe #192

haoel · 2022-08-10T07:39:20Z

try to support the following expression evaluation.

http:
  - name: EaseProbe RSS Feed
    url: https://github.com/megaease/easeprobe/releases.atom
    eval:
      doc : XML # <-- document type, currently support HTML, XML, JSON, TEXT
      expression: "updated > '2022-07-01'"  # <-- the expression need to be evaluated.
      # define how the variable `updated` come from
      variables: 
         - name: updated
           type: time # support int/float/string/bool/time/duration 
           query: "//feed/updated" # XPath 
           time_format: "2006-01-02T15:04:05Z07:00"

or simple one

http:
  - name: EaseProbe RSS Feed
    url: https://github.com/megaease/easeprobe/releases.atom
    proxy: socks5://localhost:1085
    eval:
      doc : XML
      expression: "x_time('//feed/updated') > '2022-07-01'"

Note: x_str(), x_int(), x_float(), x_time(), x_bool(), x_duration() are the functions can extract the data by using XPath or Regex expression

Please focus on those files

eval/*.go
probe/http/http.go

Note:

There are many files that have been changed because of the go 1.19 format.

…r HTTP probe

codecov-commenter · 2022-08-10T07:41:47Z

Codecov Report

Merging #192 (f36cc03) into main (504c529) will increase coverage by 0.41%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##             main     #192      +/-   ##
==========================================
+ Coverage   93.92%   94.34%   +0.41%     
==========================================
  Files          46       49       +3     
  Lines        3655     3926     +271     
==========================================
+ Hits         3433     3704     +271     
  Misses        155      155              
  Partials       67       67

Impacted Files	Coverage Δ
conf/conf.go	`80.61% <ø> (ø)`
conf/log.go	`85.48% <ø> (ø)`
probe/data.go	`65.18% <ø> (ø)`
probe/status.go	`76.47% <ø> (ø)`
report/common.go	`96.59% <ø> (ø)`
eval/eval.go	`100.00% <100.00%> (ø)`
eval/extract.go	`100.00% <100.00%> (ø)`
eval/types.go	`100.00% <100.00%> (ø)`
probe/http/http.go	`97.97% <100.00%> (+0.17%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

…xpression

zhao-kun

IMHO, If I implemented the feature, I would prefer to splitting the Evaluator and Configuration into different objects, and the Evaluator object has only one in the global, configuration object for each HTTP prober.

Just a different design style

zhao-kun · 2022-08-12T03:58:12Z

eval/eval.go

+		return false, err
+	}
+
+	expression, err := govaluate.NewEvaluableExpressionWithFunctions(e.Expression, e.EvalFuncs)


The result of govaluate.NewEvaluableExpressionWithFunctions(e.Expression, e.EvalFuncs) can be cached which should minor improve performance. [1]

[1] https://github.com/Knetic/govaluate/blob/master/benchmarks_test.go#L104-L114

haoel · 2022-08-12T04:38:52Z

IMHO, If I implemented the feature, I would prefer to splitting the Evaluator and Configuration into different objects, and the Evaluator object has only one in the global, configuration object for each HTTP prober.

Just a different design style

I considered separating the Evaluator and Configuration, but they are the exactly same object which would introduce the duplication code we have to maintain their consistency. And there is a Variable object inside, so finally, just use one struct for both of them.

If we only use one global Evaluator object for all HTTP probers, then we have to manage the different configurations for different HTTP probers, as they would have different document format(xml/html/json/txt) and different expressions. And this also would introduce multiple threads complexity. I prefer to have the dedicated Evaluator object for each HTTP probers.

feat: add the XPath/RegEx Data Extractor and Expression Evaluation fo…

e2ccd00

…r HTTP probe

lint warning

f36cc03

haoel force-pushed the eval branch from 93a72dc to f36cc03 Compare August 11, 2022 06:09

haoel added 3 commits August 11, 2022 19:10

1)remove time_format, 2) support the xpatch in place

28865aa

refactoring the unit test

d150ad6

add a test case mix the extract funciton and variable in evaluation e…

0290a6b

…xpression

haoel requested review from localvar and zhao-kun August 12, 2022 03:30

zhao-kun reviewed Aug 12, 2022

View reviewed changes

localvar approved these changes Aug 12, 2022

View reviewed changes

zhao-kun approved these changes Aug 15, 2022

View reviewed changes

zhao-kun merged commit 46f758f into megaease:main Aug 15, 2022

haoel mentioned this pull request Oct 17, 2022

bug-fixing: http probe report the unsupported document type #235

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add the XPath/RegEx Data Extractor and Expression Evaluation for HTTP Probe #192

feat: add the XPath/RegEx Data Extractor and Expression Evaluation for HTTP Probe #192

haoel commented Aug 10, 2022 •

edited

Loading

codecov-commenter commented Aug 10, 2022 •

edited

Loading

zhao-kun left a comment •

edited

Loading

zhao-kun Aug 12, 2022

haoel commented Aug 12, 2022

feat: add the XPath/RegEx Data Extractor and Expression Evaluation for HTTP Probe #192

feat: add the XPath/RegEx Data Extractor and Expression Evaluation for HTTP Probe #192

Conversation

haoel commented Aug 10, 2022 • edited Loading

codecov-commenter commented Aug 10, 2022 • edited Loading

Codecov Report

zhao-kun left a comment • edited Loading

Choose a reason for hiding this comment

zhao-kun Aug 12, 2022

Choose a reason for hiding this comment

haoel commented Aug 12, 2022

haoel commented Aug 10, 2022 •

edited

Loading

codecov-commenter commented Aug 10, 2022 •

edited

Loading

zhao-kun left a comment •

edited

Loading