Metadata-Version: 2.0
Name: url-normalize
Version: 1.3.1
Summary: URL normalization for Python
Home-page: https://github.com/niksite/url-normalize
Author: Nikolay Panov
Author-email: github@niksite.ru
License: Python
Platform: UNKNOWN
Classifier: Environment :: Web Environment
Classifier: Intended Audience :: Developers
Classifier: Operating System :: OS Independent
Classifier: Topic :: Text Processing :: Indexing
Classifier: Topic :: Utilities
Classifier: Topic :: Internet
Classifier: Topic :: Software Development :: Libraries :: Python Modules
Classifier: Programming Language :: Python
Classifier: Programming Language :: Python :: 3

url-normalize
=============

URI Normalization function:
   * Take care of IDN domains.
   * Always provide the URI scheme in lowercase characters.
   * Always provide the host, if any, in lowercase characters.
   * Only perform percent-encoding where it is essential.
   * Always use uppercase A-through-F characters when percent-encoding.
   * Prevent dot-segments appearing in non-relative URI paths.
   * For schemes that define a default authority, use an empty authority if the default is desired.
   * For schemes that define an empty path to be equivalent to a path of "/", use "/".
   * For schemes that define a port, use an empty port if the default is desired
   * All portions of the URI must be utf-8 encoded NFC from Unicode strings

Inspired by Sam Ruby's urlnorm.py: http://intertwingly.net/blog/2004/08/04/Urlnorm

Example:
```
$ pip install git+git://github.com/niksite/url-normalize.git
Collecting git+git://github.com/niksite/url-normalize.git
  Cloning git://github.com/niksite/url-normalize.git to /tmp/pip-trXUik-build
Installing collected packages: url-normalize
  Running setup.py install for url-normalize
Successfully installed url-normalize-1.2
$ python
Python 2.7.11 (default, Dec  8 2015, 23:51:37)
[GCC 4.9.2] on linux2
Type "help", "copyright", "credits" or "license" for more information.
> from url_normalize import url_normalize
> print url_normalize('www.foo.com:80/foo')
> http://www.foo.com/foo
```

History:
   * 05 Jan 2016: Python 3 compatibility
   * 29 Dec 2015: PEP8, setup.py
   * 10 Mar 2010: support for shebang (#!) urls
   * 28 Feb 2010: using 'http' schema by default when appropriate
   * 28 Feb 2010: added handling of IDN domains
   * 28 Feb 2010: code pep8-zation
   * 27 Feb 2010: forked from Sam Ruby's urlnorm.py


