Feature Request: Remove Top-Level Domain requirement #44

OttScott · 2022-02-17T22:21:41Z

Current version requires URL for Blacklist/Whitelist.
This Feature Request seeks to remove the requirement for Top-Level domains (.com) but instead parse the entirety of the URL. This would allow specific words to be blocked instead of just sites, thus allowing users more flexibility in how this tool is used.
So if someone found ballerinas distracting, they could just add 'ballerinas' or even 'balle*' to their blacklist. Or Football* or NFL* or yahoo* (which would include yahoosports.com).

I'm sure you see the benefit of such a flexible approach. Thanks for consideration.

AXeL-dev · 2022-02-17T23:03:36Z

Hey,

So basically, we need to allow blocking using keywords instead of urls only? 🤔

OttScott · 2022-02-17T23:05:59Z

That would be my preference. Site blocking is fine, but I can spend hours chasing down images or videos on google or bing, but I don't want to block those sites themselves.

AXeL-dev · 2022-02-17T23:11:35Z

Yeah i get what you mean, still i'm wondering about the fact that when using keywords we should inspect the website content as well & not only the url(s), & going through the whole opened tabs content to block some specific parts might be a bit tricky & may involve some performance pitfalls.

Anyway, if you have any specific suggestions on how to implement that it would be helpful, otherwise i'll try to study it when i have some time to.

OttScott · 2022-02-17T23:35:41Z

Mmm. I'm not sure about content inspection. If I get a random targeted ad for NFL.com, I don't want my research page blocked. And I imagine managing the Whitelist vs Blacklist collisions in that space would get messy... I think URL validation against a set of keywords would be sufficient. And maybe leave Whitelist as URLs only for easier management and to avoid collisions.

If it's easier, perhaps optionally allow Regex instead of a URI? 😜

AXeL-dev · 2022-02-17T23:58:28Z

actually, regex(s) are already supported, it's just that the allowed regex format is a bit restricted (you can check that here & here as well).

So i think that if you try something like *.*NFL*.* as a url, it would work for any domain name containing the NFL keyword.

Other variants could be:

*.NFL*.* for domain names that only starts with the NFL keyword.
*.*NFL.* when the NFL keyword should be at the end.

OttScott · 2022-02-18T19:12:27Z

This still requires a NFL domain name, but doesn't prevent searches unless I block Google.
You can generalize the solution and get a lot more capability. This could even be used as a simple Parental Control feature, especially if you maintained a default list of naughty words. By allowing users the ability to block anything in the URL but still allowing them whitelisting, you can give them extensive flexibility.

AXeL-dev · 2022-02-18T19:44:16Z

Well i agree that seeking for a keyword in the whole url is more flexible & easy from a UX view 🤔, still it can't be really described as a parent control, since we do not inspect the content of the pages & on some websites the search query is not provided in the url, so it won't work for such cases.

All this said, i think that i'm going to add a keywords list feature, 'cause i don't wanna mess with the actual blacklist/whitelist system (since for each entered URL we try to get the website favicon & that won't work with keywords, & i honestly don't like to merge separate features).

Thanks for all your suggestions by the way & stay tuned! i may work on this feature this weekend.

Edit: if you have any other suggestions, feel free to share your thoughts.

AXeL-dev · 2022-02-19T12:42:01Z

@OttScott what do you think of the following (from a UX perspective):

OttScott · 2022-02-20T06:14:33Z

@AXeL-dev, I Love it!
I'm a little torn on whether the Whitelist should override the keywords as well, but this seems like a great way to handle this. Is it too much to ask for Regex processing on the Keywords too? 😁

AXeL-dev · 2022-02-20T14:30:38Z

@AXeL-dev, I Love it! I'm a little torn on whether the Whitelist should override the keywords as well, but this seems like a great way to handle this.

I think that having keywords matching on both whitelist & blacklist provides more flexibility (even if that feature may not be so useful on the whitelist).

Is it too much to ask for Regex processing on the Keywords too?

It's a good idea actually, & i struggled a little bit to find a good way to allow both text-only matching & regex matching, so finally i came out with this simple idea:

if the keyword has the / prefix & suffix, then it would be handled as a regex, example: /NFL.*/ would be considered as a regex & you can use any regex pattern inside the /, additionally you can provide the regex flags at the end, example: /test/i for case insensitive matching.
for any other kind of keywords or badly formatted regex patterns, they would be handled as simple text.

WDYT @OttScott ?

OttScott · 2022-02-21T08:08:56Z

Why not just try both? If Regex match of the string fails to provide any results, then run a full-text search (tolower of course). That way you don't have any funny documentation about how regex needs to be supplied. Just takes two passes per keyword.
🤷‍♂️

AXeL-dev · 2022-02-21T09:19:33Z

Why not just try both? If Regex match of the string fails to provide any results, then run a full-text search (tolower of course).

In this case, the user won't be able to set the regex flags, unless we add a second input to specify the flags (& from a UX view, i don't like that). Another solution would be to add a switch next to the keyword input with 2 choices (string or regex), still i really don't wanna go through all these complications 'cause then i'll need to change the keywords data structure & this would require more work (i'm lazy by nature 😁).

That way you don't have any funny documentation about how regex needs to be supplied. Just takes two passes per keyword. man_shrugging

Well i think that a simple documentation like the following is quite enough:

Otherwise i think that the regex format i used is widely known (it's the same as javascript regex format). Another point would be that i don't want people to make abuse of regex patterns in keywords, 'cause you can really match by anything like you can even enter a url regex & it would work, which makes the urls list kinda useless, that's why i kinda prefer to keep regexes in keywords as a side/additional feature & not as the main one.

AXeL-dev · 2022-02-22T19:57:19Z

Deployed in version 2.6.0.

AXeL-dev added the feature request new feature label Feb 17, 2022

AXeL-dev added a commit that referenced this issue Feb 19, 2022

feat: implement keywords blacklist #44

321b6e5

AXeL-dev mentioned this issue Feb 19, 2022

Feat: Keywords blacklist/whitelist #45

Merged

AXeL-dev closed this as completed in #45 Feb 22, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Remove Top-Level Domain requirement #44

Feature Request: Remove Top-Level Domain requirement #44

OttScott commented Feb 17, 2022

AXeL-dev commented Feb 17, 2022

OttScott commented Feb 17, 2022

AXeL-dev commented Feb 17, 2022

OttScott commented Feb 17, 2022

AXeL-dev commented Feb 17, 2022 •

edited

Loading

OttScott commented Feb 18, 2022

AXeL-dev commented Feb 18, 2022 •

edited

Loading

AXeL-dev commented Feb 19, 2022 •

edited

Loading

OttScott commented Feb 20, 2022

AXeL-dev commented Feb 20, 2022 •

edited

Loading

OttScott commented Feb 21, 2022

AXeL-dev commented Feb 21, 2022 •

edited

Loading

AXeL-dev commented Feb 22, 2022

Feature Request: Remove Top-Level Domain requirement #44

Feature Request: Remove Top-Level Domain requirement #44

Comments

OttScott commented Feb 17, 2022

AXeL-dev commented Feb 17, 2022

OttScott commented Feb 17, 2022

AXeL-dev commented Feb 17, 2022

OttScott commented Feb 17, 2022

AXeL-dev commented Feb 17, 2022 • edited Loading

OttScott commented Feb 18, 2022

AXeL-dev commented Feb 18, 2022 • edited Loading

AXeL-dev commented Feb 19, 2022 • edited Loading

OttScott commented Feb 20, 2022

AXeL-dev commented Feb 20, 2022 • edited Loading

OttScott commented Feb 21, 2022

AXeL-dev commented Feb 21, 2022 • edited Loading

AXeL-dev commented Feb 22, 2022

AXeL-dev commented Feb 17, 2022 •

edited

Loading

AXeL-dev commented Feb 18, 2022 •

edited

Loading

AXeL-dev commented Feb 19, 2022 •

edited

Loading

AXeL-dev commented Feb 20, 2022 •

edited

Loading

AXeL-dev commented Feb 21, 2022 •

edited

Loading