Skip to content

Releases: crealytics/spark-excel

File format v2

23 Aug 19:47
4c22545
Compare
Choose a tag to compare
#210 File format v2 (#389)

* register data source for .format("excel")

* ignore .vscode

* V2 with new Spark Data Source API, uses FileDataSourceV2

* set header default to true, got 1st test passed

* ExcelHelper become options awareness

* handle string type for error-formula

* PlainNumberReadSuite is good now. Also fixed the issue in #285. This introduces a breaking change (good, I think)

* test-case for issue_285

* Handling Error Cells and Undefined Rows

* Test cases for #52 #74 #97 issues

* format & test cases for column pruning (projection)

* Added more test-cases for numerical types

* Stricter numerical types (Integer, Long and Double) in schema inferring. Issue #162

* preparing for final push on writing

* Apply format & Writing is working

* Added excel-row-number column for issues #40 #59 #115 and refactoring

* refactoring unit-tests

* preparing for MR

* Update all test-cases with ScalaTest 3.x

* Writing aware about dataAddress

* writing with dataAddress; No change on dependencies nor build script

* Schema Infering Improvement: {Iterator instead of Seq; Use both samplingRatio and excerptSize}

* added more recent spark version to CI/CD

* support from spark 2.4.1 up

* Fix scalastyle check & enable non-ascii character due to native of unit-tests

* Update src/main/2.4/scala/com/crealytics/spark/v2/excel/ExcelDataSource.scala

Co-authored-by: Martin Mauch <[email protected]>

* Update src/main/2.4/scala/com/crealytics/spark/v2/excel/ExcelDataSource.scala

Co-authored-by: Martin Mauch <[email protected]>

* spark-excel examples in Jupyter Notebook

Co-authored-by: Martin Mauch <[email protected]>

Support processing ERROR cells v2

12 Aug 13:25
cc6a64c
Compare
Choose a tag to compare

Fix #399: Support error type for formula cell

Support processing ERROR cells

22 Feb 08:47
Compare
Choose a tag to compare

v0.13.7

Fix #343 : Support processing ERROR cell types.

Add options to use different number formatter

10 Dec 13:50
49292da
Compare
Choose a tag to compare
v0.13.6

Build pushes to main branch

v0.13.5

08 Aug 15:53
Compare
Choose a tag to compare
  • Improve the loading time for big files. (#280 )

Various updates

05 Aug 13:03
Compare
Choose a tag to compare
v0.13.4

Trigger publishing for Github Release

Fix scala.Some is not valid type

18 May 07:11
Compare
Choose a tag to compare

Switch to @pjfanning's branch of excel-streaming-reader

11 May 13:48
Compare
Choose a tag to compare
v0.13.2

POI 4.1.2 and switch to excel-streaming-reader fork

0.13.1

01 Mar 18:49
74ab86f
Compare
Choose a tag to compare

Fixes:

  • Do not skip first line if header=false #215

Rename useHeader option to header

23 Feb 21:47
Compare
Choose a tag to compare

In order to align with the spark CSV reader, the name of the useHeader option was shortened to just header.
Thanks @EnverOsmanov!