CHANGES.txt

v0.6.4 (2015-02-17)
===================

- Performance: an optional 'extension_to_ignores' parameter (List) can be passed to DwCAReader's constructor. In cases where an archive contains large but unneeded extensions, this can greatly improve memory consumption. A typical case fot that would be the huge 'verbatim.txt' contained in GBIF downloads.

v0.6.3 (2015-02-16)
===================

- Performance: we now use core_id based indexes for extensions. There's a memory penalty, but extension file parsing is now only done once. 

v0.6.2 (2015-01-26)
===================

- Better performance with extensions.

v0.6.1 (2015-01-09)
===================

- It can now open not zipped (directory) Darwin Core Archives
- More testing for Descriptor classes.
- Better respect of the standard (http://rs.tdwg.org/dwc/terms/guides/text/):
    * We now support default value (\n) for linesTerminatedBy and fieldsTerminatedBy.
- Lower memory use with large archives.

v0.6.0 (2014-08-08)
===================

- Better performance thanks to a better architecture
- API add: brand new _ArchiveDescriptor and _SectionDescriptor
- API change: DwCAReader.descriptor is an instance of _ArchiveDescriptor (previously BeautifulSoup)
- API remove: DwCAReader.core_rowtype (use DwCAReader.descriptor.core.type instead)
- API remove: DwCAReader.extensions_rowtype (use DwCAReader.descriptor.extensions_type instead)
- API remove: DwCAReader.core_terms (use DwCAReader.descriptor.core.terms instead)

v0.5.1 (2014-08-05)
===================

- Performance: dramatically improved performance of get_row_by_index/looping for large files by
               building an index of line positions at file opening (so there's a slight overhead
               there)

v0.5.0 (2014-01-21)
===================

- API new: DwCAReader.descriptor
- API change: "for core_line in dwca.each_line():" => "for core_row in dwca:"
- API change: from_core and from_extension attributes of DwCALine (and sublasses) have been removed.
              The isinstance built-in function can be used to test if a line is an instance of DwCACoreLine
              or of DwCAExtensionLine.
- API change: DwCAReader.lines => DwCAReader.rows
- API change: DwCACoreLine => CoreRow
- API change: DwCAExtensionLine => ExtensionRow
- API change: DWCAReader.get_line_by_id => DWCAReader.get_row_by_id
- API change: DWCAReader.get_line_by_index => DWCAReader.get_row_by_index
- API change: DwCAReader.get_row_by_* methods throw RowNotFound when failure instead of returning None

- Cleaner code and better documentation.

v0.4.0 (2013-09-24)
===================

- API change: dwca.get_line() -> dwca.get_line_by_id()
- API new: dwca.get_line_by_index()
- (Core File) iteration order is now guaranteed for dwca.each_line()
- Refactoring: DwCALine subclassed (as DwCACoreLine and DwCAExtensionLine)

v0.3.3 (2013-09-05)
===================

- DwCALines are now hashable.

v0.3.2 (2013-08-28)
===================

- API: added the dwca.absolute_temporary_path() method.

v0.3.1 (2013-08-09)
===================

- Bugfix: lxml added as a requirement.

v0.3.0 (2013-08-08)
===================

- XML parsing (metadata, EML, ...) now uses BeautifulSoup 4 instead of v3.

v0.2.1 (2013-08-02)
===================

- Added a property (core_terms) to DwCAReader to get a list the DwC terms in use in the core file.

v0.2.0 (2013-07-31)
===================

- Specific support for GBIF Data portal (occurrences) export.
- Small bug fixes.

v0.1.1 (2013-05-28)
===================

- Fixes packaging issues.

v0.1.0 (2013-05-28)
===================

 - Initial release.