Release v0.11.1 #248

sundarshankar89 · 2025-06-23T16:05:19Z

Expose the number of available CPUs for concurrent processing (#244). The library now provides a method to determine the available CPU count for concurrent processing, which is used to optimize parallel task execution. This method attempts to obtain the number of logical CPUs available for the process through various approaches, including using the os module's process_cpu_count attribute, the sched_getaffinity function on Linux systems, or the total number of CPUs in the system, with a default fallback to 1 if all else fails. The parallel task execution functionality has been updated to utilize this available CPU count method to automatically determine the number of threads to use when running tasks concurrently, unless a specific thread count is manually specified, allowing for more accurate and flexible concurrent processing, particularly in containerized environments with imposed CPU quotas.
Improve support for reading text files that contain a Unicode BOM at the start (#243). The library's text file reading functionality has been enhanced to handle Unicode Byte Order Mark (BOM) markers at the start of files, providing better support for reading local and Workspace files. New methods have been introduced to detect and handle BOM markers, including _detect_encoding_bom and decode_with_bom, which enable accurate detection of the encoding and decoding of text files. Additionally, the _read_text_from_binary_io and read_text methods have been added to read text from binary IO and file paths, respectively, taking into account BOM markers and non-seekable files. The existing open method has been updated to utilize the decode_with_bom method when opening files in text mode, allowing for improved handling of BOM markers and non-seekable files. The read_text function can handle various BOMs, including UTF-8, UTF-16 LE, UTF-16 BE, UTF-32 LE, and UTF-32 BE, and correctly decodes the text, while also supporting sized reads and raising a ValueError for non-seekable files when a size is specified.

* Expose the number of available CPUs for concurrent processing ([#244](#244)). The library now provides a method to determine the available CPU count for concurrent processing, which is used to optimize parallel task execution. This method attempts to obtain the number of logical CPUs available for the process through various approaches, including using the `os` module's `process_cpu_count` attribute, the `sched_getaffinity` function on Linux systems, or the total number of CPUs in the system, with a default fallback to 1 if all else fails. The parallel task execution functionality has been updated to utilize this available CPU count method to automatically determine the number of threads to use when running tasks concurrently, unless a specific thread count is manually specified, allowing for more accurate and flexible concurrent processing, particularly in containerized environments with imposed CPU quotas. * Improve support for reading text files that contain a Unicode BOM at the start ([#243](#243)). The library's text file reading functionality has been enhanced to handle Unicode Byte Order Mark (BOM) markers at the start of files, providing better support for reading local and Workspace files. New methods have been introduced to detect and handle BOM markers, including `_detect_encoding_bom` and `decode_with_bom`, which enable accurate detection of the encoding and decoding of text files. Additionally, the `_read_text_from_binary_io` and `read_text` methods have been added to read text from binary IO and file paths, respectively, taking into account BOM markers and non-seekable files. The existing `open` method has been updated to utilize the `decode_with_bom` method when opening files in text mode, allowing for improved handling of BOM markers and non-seekable files. The `read_text` function can handle various BOMs, including UTF-8, UTF-16 LE, UTF-16 BE, UTF-32 LE, and UTF-32 BE, and correctly decodes the text, while also supporting sized reads and raising a `ValueError` for non-seekable files when a size is specified.

github-actions · 2025-06-23T16:06:39Z

✅ 40/40 passed, 1 flaky, 2 skipped, 2m32s total

Flaky tests:

🤪 test_upgrades_works (10.012s)

_{Running from acceptance #325}

The 26.30.0 release of sqlglot introduced a breaking change that affects our tests; this PR is a quick-fix to prevent that version from being used. This PR also includes type-hinting fixes that newer versions of mypy need, along with accompanying fixes for issues that the improved type-hints expose. For now this is intended to: - Unblock databrickslabs/blueprint#248. - Supersede #409.

sundarshankar89 requested a review from nfx as a code owner June 23, 2025 16:05

sundarshankar89 temporarily deployed to runtime June 23, 2025 16:05 — with GitHub Actions Inactive

gueniai approved these changes Jun 23, 2025

View reviewed changes

asnare approved these changes Jun 23, 2025

View reviewed changes

sundarshankar89 added the pr/do-not-merge label Jun 23, 2025

This was referenced Jun 24, 2025

Update sqlglot dependency databrickslabs/lsql#409

Closed

Limit sqlglot to releases earlier than 26.30.0. databrickslabs/lsql#410

Merged

sundarshankar89 closed this Jun 25, 2025

sundarshankar89 deleted the prepare/0.11.1 branch June 25, 2025 08:40

asnare removed the request for review from nfx June 25, 2025 08:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Release v0.11.1 #248

Release v0.11.1 #248

Uh oh!

sundarshankar89 commented Jun 23, 2025

Uh oh!

github-actions bot commented Jun 23, 2025

Uh oh!

Uh oh!

Release v0.11.1 #248

Release v0.11.1 #248

Uh oh!

Conversation

sundarshankar89 commented Jun 23, 2025

Uh oh!

github-actions bot commented Jun 23, 2025

Uh oh!

Uh oh!