-
Notifications
You must be signed in to change notification settings - Fork 138
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to understand the spark-lens statistics. Is there any wiki? #50
Comments
update: I was able to find some explanation about the reports on (https://www.qubole.com/blog/introducing-sparklens-report/). This is fine. If you take a look at the generated report, there are some values in negative (eg: under "efficiency statistics" > "Driver vs Executor Wallclock Time"). We are not able to understand why the values are negative and regarding this - there is no mention on the said blog site. |
Hello - I am following up on my question. Thanks. |
Hi, I am seeking help on the generated report. Please advise. |
Hi, @normalscene The negative values of the driver/executor wall-clock time might be an indication of event loss while running the spark application. In the newer releases of the Sparklens, we have handled the event loss gracefully. Were you using the older version while generating the sparklens.json? If so, I would suggest you try with the latest version. If you are already using the latest version, can you please try to generate the report once via the command line instead of using sparklens.qubole.com. This will help us narrow down the issue to sparklens reporting service or sparklens core. You can find the steps to generate the sparklens command-line report in the ReadMe. |
Hello @mayurdb The current version being used is as follows:
I believe this is the latest version as I took it recently (less than 2 weeks). Please correct me if I am incorrect here. Unfortunately I am having difficulty spotting the command line reporting steps. Could you kindly just point me towards the section no. ? Just for record sake, here is my command which I am using to submit my job:
|
With the above command itself, you should be able to see the report in the driver logs after application completion. |
Oh! Understood. I will have to re-run the job (it takes around 75 minutes) and I will get back after that. thanks for your prompt reply. Regards, |
To get more accurate results, before you re-run see if there was an event loss in the previous runs. grep for key-word "Dropped" in the driver logs. If there was an event loss, try increasing the event queue capacity to a higher value and also increase the driver's memory. This is the conf: spark.scheduler.listenerbus.eventqueue.capacity |
I have re-run the job (but this time, took a smaller data sample to make the processing a little quick compared to earlier one, which took around 75 minutes). JSON reporting & command-line statistics - both were enabled. Here are my observations:
Please advise which logs can I share with you to help investigate this. Edit: Does this seem like a bug ? Please confirm. Links to generated reports:http://sparklens.qubole.com/report_view/0cfede16246e4423c402 (without drop queue setting) |
@normalscene |
Could you please point me towards the "event log file". I believe I am not aware of its whereabouts. Cheers, |
Your will need to add the following config options to your spark-submit. After the application is complete, event log file will be available in the spark.eventLog.dir |
Thank you. I will try this out and get back with results shortly. |
I gave it a shot and unfortunately it didn't generate any logs :(. Allow me to share the command I used just to double check whether I fired the right command or not. Have I missed something?
|
I have captured the sparklens command-line output. It doesn't give us any indication whether if the event log capture was invoked or not. |
@normalscene Not really much aware of the internal workings of dataproc. If you have access to history server, newer versions allow downloading this file directly from the history server UI. |
Hi @mayurdb / @iamrohit / @rishitesh
We have generated our application's performance report but we are having trouble understanding the output report (html). We have generated the JSON file and we uploaded it to your reporting service which generated a nice HTML report (http://sparklens.qubole.com/report_view/616b961beb11dc0298b1)
Could you please direct us to any type of documentation which explains how to interpret the output HTML report ? Thank you in advance.
Gaurav
The text was updated successfully, but these errors were encountered: