Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix SIGSEGV when compiled with -march=znver4 #19

Closed
wants to merge 1 commit into from

Commits on Oct 18, 2023

  1. Due to unaligned allocations, library crashes in nontemporalMemcpy

    in _mm512_stream_si512 (which requires 64-aligned allocations,
    but used to copy default-aligned objects).
    
    As it is seemingly difficult to change allocations for copied
    objects (common objects with ref-counts), the fix just replaces
    nontemporalMemcpy with normal memcpy, which is already optimized
    in most versions of C runtime.
    
    Closes ROCm#18
    [email protected] committed Oct 18, 2023
    Configuration menu
    Copy the full SHA
    8913c41 View commit details
    Browse the repository at this point in the history