Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

blogspot.* recognized as ICANN TLD #112

Closed
ghost opened this issue Nov 9, 2016 · 2 comments
Closed

blogspot.* recognized as ICANN TLD #112

ghost opened this issue Nov 9, 2016 · 2 comments

Comments

@ghost
Copy link

ghost commented Nov 9, 2016

Hi,
I noticed that example.blogspot.com is splitted to example blogspot.com
It behaves the same in Python 3.5.2, but it works in Python 2.7.2

I wanted to delete the cache file in /usr/local/lib/python3.5/dist-packages/tldextract/.tld_set but it was not there. Updating with tldextract -u solved the problem.

@ghost
Copy link
Author

ghost commented Nov 9, 2016

I wanted to delete the cache file in /usr/local/lib/python3.5/dist-packages/tldextract/.tld_set but it was not there. Force updating with tldextract -u solved the problem. Still not sure which cache file it used.

@john-kurkowski
Copy link
Owner

That blogspot.* parsing is expected if you use tldextract with the include_psl_private_domains=True arg. The default is False.

Sounds like one of your caches was created with include_private_psl_domains=True? tldextract -u is indeed the workaround. That will recreate the cache with the False arg.

The cache inconsistency is a dupe of #66.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant