Skip to content

Conversation

jsnv-dev
Copy link

@jsnv-dev jsnv-dev commented Oct 2, 2025

Apache ORC provides both C++ and Java libraries for reading and writing ORC files, which are widely used by major data processing frameworks. Also, ORC-based formats are increasingly used to store AI training datasets at large scale.

Motivation

This integration follows the recent discovery of CVE-2025-47436, a heap buffer overflow vulnerability in the C++ LZO decompressor affecting Apache ORC versions. The vulnerability occurs when specially crafted malformed ORC files can result in memory corruption.

Continuous fuzzing through OSS-Fuzz will help identify similar input validation vulnerabilities earlier and improve the robustness of the ORC file parser.

Project Details

This initial PR includes only the project.yaml configuration. The build infrastructure will be added in a follow-up PR after your approval.

Copy link

github-actions bot commented Oct 2, 2025

jsnv-dev is integrating a new project:
- Main repo: https://github.com/apache/orc
- Criticality score: 0.57426

@jsnv-dev
Copy link
Author

Hi all,

Just wanted to follow up on this PR and see if there's any feedback or if there's anything I can do to help move this forward. Happy to make any changes needed or provide additional information.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant