Skip to content

Commit

Permalink
version bump
Browse files Browse the repository at this point in the history
  • Loading branch information
adbar committed Feb 19, 2021
1 parent 357a7ab commit fa30f8d
Show file tree
Hide file tree
Showing 3 changed files with 16 additions and 1 deletion.
5 changes: 5 additions & 0 deletions HISTORY.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,11 @@
## History / Changelog


### 0.3.1

- improve filter precision


### 0.3.0

- reduced dependencies: replace requests with bare urllib3, and tldextract with tld for Python 3.6 upwards
Expand Down
10 changes: 10 additions & 0 deletions README.rst
Original file line number Diff line number Diff line change
Expand Up @@ -229,3 +229,13 @@ This effort is part of methods to derive information from web documents in order
- Barbaresi, A. "`Efficient construction of metadata-enhanced web corpora <https://hal.archives-ouvertes.fr/hal-01371704v2/document>`_", Proceedings of the `10th Web as Corpus Workshop (WAC-X) <https://www.sigwac.org.uk/wiki/WAC-X>`_, 2016.

Contact: see `homepage <https://adrien.barbaresi.eu/>`_ or `GitHub <https://github.com/adbar>`_.


Similar work
------------

These Python libraries perform similar normalization tasks but don't entail language or content filters:

- `ural <https://github.com/medialab/ural/>`_
- `urlnorm <https://github.com/kurtmckee/urlnorm>`_
- `yarl <https://github.com/aio-libs/yarl/>`_
2 changes: 1 addition & 1 deletion courlan/__init__.py
Original file line number Diff line number Diff line change
Expand Up @@ -8,7 +8,7 @@
__author__ = 'Adrien Barbaresi'
__license__ = 'GNU GPL v3+'
__copyright__ = 'Copyright 2020-2021, Adrien Barbaresi'
__version__ = '0.3.0'
__version__ = '0.3.1'


# imports
Expand Down

0 comments on commit fa30f8d

Please sign in to comment.