Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Gentoo patch - optional RPATH setting #1

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

epsilon-0
Copy link

This makes RPATH setting optional as the location of libraries is not fixed and can be moved around.

It should have no effect on normal installation but is useful in many cases

Thanks,
Aisha

@epsilon-0 epsilon-0 changed the title Gentoo patche - optional RPATH setting Gentoo patch - optional RPATH setting Oct 8, 2020
@epsilon-0
Copy link
Author

Also fixes a small typo which causes the enable_aocl_zen variable to be seen as a command.

@dzambare
Copy link

dzambare commented Oct 9, 2020

Thanks for the pull request.

Any particular reason why LD_LIBRARY_PATH can't be used in this case?

@epsilon-0
Copy link
Author

There are a lot of users on the cluster.
Setting a global LD_LIBRARY_PATH is not possible as it will break whatever their personal local setups are.
Sending each of them a notification to update their LD_LIBRARY_PATH is also not possible.

dzambare pushed a commit that referenced this pull request Aug 28, 2021
- x7, x8: Used to store address for Alpha and Beta.
  As Alpha & Beta was not used in k-loops, use x0, x1 to load
  Alpha & Beta's addresses after k-loops are completed, since A & B's
  addresses are no longer needed there.
  This "ldr [addr]; -> ldr val, [addr]" would not cause much performance
  drawback since it is done outside k-loops and there are plenty of
  instructions between Alpha & Beta's loading and usage.
- x9: Used to store cs_c. x9 is multiplied by 8 into x10 and not used
  any longer. Directly loading cs_c and into x10 and scale by 8 spares
  x9 straightforwardly.
- x11, x12: Not used at all. Simply remove from clobber list.
- x13: Alike x9, loaded and scaled by 8 into x14, except that x13 is
  also used in a conditional branch so that "cmp x13, #1" needs to be
  modified into "cmp x14, #8" to completely free x13.
- x3, x4: Used to store next_a & next_b. Untouched in k-loops. Load
  these addresses into x0 and x1 after Alpha & Beta are both loaded,
  since then neigher address of A/B nor address of Alpha/Beta is needed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants