Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Staging/main/0.10.8 #1081

Merged
merged 5 commits into from
Jan 11, 2024
Merged

Staging/main/0.10.8 #1081

merged 5 commits into from
Jan 11, 2024

Commits on Dec 12, 2023

  1. Feature: added parquet sampling (#1070)

    * parquet sampling function developed in data_utils.py; Added sample_nrows argument in ParquetData class; Added test_len_sampled_data in test_parquet_data.py
    
    * resolved conflict with dev, added more tests
    
    * fixed sample empty column bug
    
    * fixed comments in data_utils.py, including:
    1. added type of return in sample_parquet function;
    2. changed variable names in sample_parquet function to more descriptive names (select -> sample_index, out -> sample_df);
    3. created convert_unicode_col_to_utf8 function to reduce repeating code in sample_parquet and read_parquet_df functions
    
    * 1. renamed variable names in covert_unicode_col_to_utf8 function (data_utils.py) to be more descriptive (types -> input_column_types, col -> iter_column), other part unchanged
    
    2. test_parquet_data.py, move import statement to the top of file
    
    3. test_parquet_data.py, merged all tests about parquet sample feature to their original tests
    
    * checked the datatype and input file path before and after reload with sampling option enabled
    
    * test
    
    * delete test edit in avro_data.py, updated fastavro version in  requirment.txt
    
    * remove fastavro.reader type
    
    * change fastavro version back to original
    
    * 1. sample_parquet function description
    2. test_len_data method keep one sample length test
    3. remove sampling test in test_specifying_data_type
    4. remove sampling test in test_reload_data
    menglinw committed Dec 12, 2023
    Configuration menu
    Copy the full SHA
    0d56dac View commit details
    Browse the repository at this point in the history

Commits on Jan 8, 2024

  1. Depedency: matplotlib version bump (#1072)

    * bump tag matplotlib
    
    * bumpt to most recent
    
    * 3.9.0 update
    taylorfturner committed Jan 8, 2024
    Configuration menu
    Copy the full SHA
    93268f8 View commit details
    Browse the repository at this point in the history
  2. Bump actions/setup-python from 4 to 5 (#1078)

    Bumps [actions/setup-python](https://github.com/actions/setup-python) from 4 to 5.
    - [Release notes](https://github.com/actions/setup-python/releases)
    - [Commits](actions/setup-python@v4...v5)
    
    ---
    updated-dependencies:
    - dependency-name: actions/setup-python
      dependency-type: direct:production
      update-type: version-update:semver-major
    ...
    
    Signed-off-by: dependabot[bot] <[email protected]>
    Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
    Co-authored-by: Taylor Turner <[email protected]>
    dependabot[bot] and taylorfturner committed Jan 8, 2024
    Configuration menu
    Copy the full SHA
    683a91e View commit details
    Browse the repository at this point in the history

Commits on Jan 10, 2024

  1. Make _assimilate_histogram not use self (#1071)

    Co-authored-by: Taylor Turner <[email protected]>
    junholee6a and taylorfturner committed Jan 10, 2024
    Configuration menu
    Copy the full SHA
    4a4329d View commit details
    Browse the repository at this point in the history

Commits on Jan 11, 2024

  1. version bump

    taylorfturner committed Jan 11, 2024
    Configuration menu
    Copy the full SHA
    c7fe089 View commit details
    Browse the repository at this point in the history