-
Notifications
You must be signed in to change notification settings - Fork 46
Convert URL's to a normalized unicode format
jehiah/urlnorm
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
urlnorm.py ========== Normalize a URL to a standard unicode representation urlnorm normalizes a URL by: * lowercasing the scheme and hostname * converting the hostname to IDN format * taking out default port if present (e.g., http://www.foo.com:80/) * collapsing the path (./, ../, etc) * removing the last character in the hostname if it is '.' * unquoting any % escaped characters (where possible) Installation ============ pip install urlnorm Example ======= >>> import urlnorm >>> urlnorm.norm("http://xn--q-bga.com./u/u/../%72/l/") u'http://q\xe9.com/u/r/l/'
About
Convert URL's to a normalized unicode format
Resources
Stars
Watchers
Forks
Packages 0
No packages published