forked from BelgianBiodiversityPlatform/python-dwca-reader
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathCHANGES.txt
106 lines (74 loc) · 3.51 KB
/
CHANGES.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
v0.6.4 (2015-02-17)
===================
- Performance: an optional 'extension_to_ignores' parameter (List) can be passed to DwCAReader's constructor. In cases where an archive contains large but unneeded extensions, this can greatly improve memory consumption. A typical case fot that would be the huge 'verbatim.txt' contained in GBIF downloads.
v0.6.3 (2015-02-16)
===================
- Performance: we now use core_id based indexes for extensions. There's a memory penalty, but extension file parsing is now only done once.
v0.6.2 (2015-01-26)
===================
- Better performance with extensions.
v0.6.1 (2015-01-09)
===================
- It can now open not zipped (directory) Darwin Core Archives
- More testing for Descriptor classes.
- Better respect of the standard (http://rs.tdwg.org/dwc/terms/guides/text/):
* We now support default value (\n) for linesTerminatedBy and fieldsTerminatedBy.
- Lower memory use with large archives.
v0.6.0 (2014-08-08)
===================
- Better performance thanks to a better architecture
- API add: brand new _ArchiveDescriptor and _SectionDescriptor
- API change: DwCAReader.descriptor is an instance of _ArchiveDescriptor (previously BeautifulSoup)
- API remove: DwCAReader.core_rowtype (use DwCAReader.descriptor.core.type instead)
- API remove: DwCAReader.extensions_rowtype (use DwCAReader.descriptor.extensions_type instead)
- API remove: DwCAReader.core_terms (use DwCAReader.descriptor.core.terms instead)
v0.5.1 (2014-08-05)
===================
- Performance: dramatically improved performance of get_row_by_index/looping for large files by
building an index of line positions at file opening (so there's a slight overhead
there)
v0.5.0 (2014-01-21)
===================
- API new: DwCAReader.descriptor
- API change: "for core_line in dwca.each_line():" => "for core_row in dwca:"
- API change: from_core and from_extension attributes of DwCALine (and sublasses) have been removed.
The isinstance built-in function can be used to test if a line is an instance of DwCACoreLine
or of DwCAExtensionLine.
- API change: DwCAReader.lines => DwCAReader.rows
- API change: DwCACoreLine => CoreRow
- API change: DwCAExtensionLine => ExtensionRow
- API change: DWCAReader.get_line_by_id => DWCAReader.get_row_by_id
- API change: DWCAReader.get_line_by_index => DWCAReader.get_row_by_index
- API change: DwCAReader.get_row_by_* methods throw RowNotFound when failure instead of returning None
- Cleaner code and better documentation.
v0.4.0 (2013-09-24)
===================
- API change: dwca.get_line() -> dwca.get_line_by_id()
- API new: dwca.get_line_by_index()
- (Core File) iteration order is now guaranteed for dwca.each_line()
- Refactoring: DwCALine subclassed (as DwCACoreLine and DwCAExtensionLine)
v0.3.3 (2013-09-05)
===================
- DwCALines are now hashable.
v0.3.2 (2013-08-28)
===================
- API: added the dwca.absolute_temporary_path() method.
v0.3.1 (2013-08-09)
===================
- Bugfix: lxml added as a requirement.
v0.3.0 (2013-08-08)
===================
- XML parsing (metadata, EML, ...) now uses BeautifulSoup 4 instead of v3.
v0.2.1 (2013-08-02)
===================
- Added a property (core_terms) to DwCAReader to get a list the DwC terms in use in the core file.
v0.2.0 (2013-07-31)
===================
- Specific support for GBIF Data portal (occurrences) export.
- Small bug fixes.
v0.1.1 (2013-05-28)
===================
- Fixes packaging issues.
v0.1.0 (2013-05-28)
===================
- Initial release.