Skip to content

Commit

Permalink
Adds detection for GeedoProductSearch and Elastic Synthetics (#7679)
Browse files Browse the repository at this point in the history
* Adds detection for GeedoProductSearch
* Adds detection for Elastic Synthetics
ref #7677
  • Loading branch information
liviuconcioiu authored May 8, 2024
1 parent b471ac7 commit 413436d
Show file tree
Hide file tree
Showing 2 changed files with 28 additions and 0 deletions.
15 changes: 15 additions & 0 deletions Tests/fixtures/bots.yml
Original file line number Diff line number Diff line change
Expand Up @@ -7549,3 +7549,18 @@
name: Castopod
category: Crawler
url: https://www.castopod.org/
-
user_agent: Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; GeedoProductSearch; +http://www.geedo.com/product-search.html) Chrome/79.0.3945.88 Safari/537.36
bot:
name: GeedoProductSearch
category: Crawler
url: https://geedo.com/product-search/
-
user_agent: Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) HeadlessChrome/121.0.6167.57 Safari/537.36 Elastic/Synthetics
bot:
name: Elastic Synthetics
category: Site Monitor
url: https://github.com/elastic/synthetics
producer:
name: Elasticsearch B.V.
url: https://www.elastic.co/
13 changes: 13 additions & 0 deletions regexes/bots.yml
Original file line number Diff line number Diff line change
Expand Up @@ -3707,6 +3707,11 @@
category: 'Crawler'
url: 'https://geedo.com/bot/'

- regex: 'GeedoProductSearch'
name: 'GeedoProductSearch'
category: 'Crawler'
url: 'https://geedo.com/product-search/'

- regex: 'BackupLand(?:/([\d+.]+))?'
name: 'BackupLand'
category: 'Crawler'
Expand Down Expand Up @@ -4427,6 +4432,14 @@
category: 'Crawler'
url: 'https://www.castopod.org/'

- regex: 'Elastic/Synthetics'
name: 'Elastic Synthetics'
category: 'Site Monitor'
url: 'https://github.com/elastic/synthetics'
producer:
name: 'Elasticsearch B.V.'
url: 'https://www.elastic.co/'

# Generic bots
- regex: 'nuhk|grub-client|Download Demon|SearchExpress|Microsoft URL Control|borg|altavista|dataminr\.com|teoma|oegp|http%20client|htdig|mogimogi|larbin|scrubby|searchsight|semanticdiscovery|snappy|vortex(?!(?: Build|Plus))|zeal(?!ot)|dataparksearch|findlinks|BrowserMob|URL2PNG|ZooShot|GomezA|Google SketchUp|Read%20Later|7Siters|centuryb\.o\.t9|InterNaetBoten|EasyBib AutoCite|Bidtellect|tomnomnom/meg|cortex|Re-re Studio|adreview|AHC/|NameOfAgent|Request-Promise|ALittle Client|Hello,? world|wp_is_mobile|0xAbyssalDoesntExist|Anarchy99|^revolt|nvd0rz|xfa1|Hakai|gbrmss|fuck-your-hp|IDBTE4M CODE87|Antoine|Insomania|Hells-Net|b3astmode|Linux Gnu \(cow\)|Test Certificate Info|iplabel|Magellan|TheSafex?Internetx?Search|Searcherweb|kirkland-signature|LinkChain|survey-security-dot-txt|infrawatch|^xenu|^(?:chrome|firefox|KvshClient|Zeus|ZmEu)$'
name: 'Generic Bot'
Expand Down

0 comments on commit 413436d

Please sign in to comment.