Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generate an output file with runtime and build information #705

Merged
merged 5 commits into from
Jan 8, 2024

Conversation

amahussein
Copy link
Collaborator

@amahussein amahussein commented Dec 26, 2023

Signed-off-by: Ahmed Hussein (amahussein) [email protected]

Fixes #699

This PR is to dump the runtime/build information of the tools jar to the output folder. By knowing which Tools version has been used and which Spark version has been used in the runtime, developers can have more insights about the output and the numbers generated by the Tools.

For Qualification:

  • rapids_4_spark_qualification_output/runtime.properties

For Profiling:

  • rapids_4_spark_profile/runtime.properties

A sample of the generated file is as follows:

Notice that

  • runtime.spark.version is the spark version loaded during runtime, while build.spark.version is the version used to build the tools jar.

  • build.verion represents the Tools jar version

#RAPIDS Accelerator for Apache Spark's Build Information
#Sun Dec 24 17:51:25 CST 2023
build.hadoop.version=3.3.6
build.java.version=1.8.0_322
build.scala.version=2.12.15
build.spark.version=3.1.1
build.version=23.10.2-SNAPSHOT
runtime.spark.version=3.3.3

Signed-off-by: Ahmed Hussein (amahussein) <[email protected]>

Fixes NVIDIA#699

This PR is to dump the runtime/build information of the tools jar to the
output folder. By knowing which Tools version has been used and which
Spark version has been used in the runtime, developers can have more
insights about the output and the numbers generated by the Tools.

For Qualification:

- `rapids_4_spark_qualification_output/runtime.properties`

For Profiling:

- `rapids_4_spark_profile/runtime.properties`

A sample of the generated file is as follows:

Notice that

- `runtime.spark.version` is the spark version loaded
during runtime, while `build.spark.version` is the version used to build
the tools jar.

- `build.verion` represents the Tools jar version

```
build.hadoop.version=3.3.6
build.java.version=1.8.0_322
build.scala.version=2.12.15
build.spark.version=3.1.1
build.version=23.10.2-SNAPSHOT
runtime.spark.version=3.3.3
```
Signed-off-by: Ahmed Hussein (amahussein) <[email protected]>
@amahussein
Copy link
Collaborator Author

@viadea and @kuhushukla
WDYT about the list of properties in the output files? Is there anty other property helpful that you think is missing?

Copy link
Collaborator

@nartal1 nartal1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nit: Need to update year in copyright section.

@amahussein
Copy link
Collaborator Author

Thanks @nartal1
I addressed the comments.

Copy link
Collaborator

@nartal1 nartal1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @amahussein !

@amahussein amahussein merged commit dedc8d7 into NVIDIA:dev Jan 8, 2024
9 checks passed
@amahussein amahussein deleted the spark-rapids-tools-699 branch January 8, 2024 18:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
core_tools Scope the core module (scala) feature request New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[FEA] Generate an output file with runtime information
2 participants