-
Notifications
You must be signed in to change notification settings - Fork 856
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How and where to change the spark history server port number in dr-elephant #686
Comments
Hi @selvamraman, @selvamraman if possible can you provide some insight regarding your usage of Dr.Elephant, like
This information would be helpful for us in providing a resolution to your issues. |
AWS EMR Cluster details: FetcherConf.xml:::
|
@selvamraman hope the reply resolved your issue, if so then can I close this issue? |
@ShubhamGupta29 05-05-2020 19:30:53 ERROR [ForkJoinPool-1-worker-5] com.linkedin.drelephant.spark.fetchers.SparkRestClient : error reading applicationInfo http://dns_name:18480/api/v1/applications/application_1588700272512_0001. Exception Message = java.net.SocketException: Unexpected end of file from server The spark history server address is not listening http and it is https - how to include https for history server in the config? Also, what are the environment variable do i need to setup before start the EMR?
|
Hi @selvamramam, Environment Variable needed before setup:
|
I will let you know the final result. Thanks you so much @ShubhamGupta29 your prompt response. |
Normally, the Hadoop https address and its port number are predefined in Hadoop config. So, for your Dr.Elephant, Just modify Fetchers source code to grab the correct config name, and let it use https head (if SSL is enabled) would be good. I am having a similar issue. My cluster only uses an SSL connection for some reason. And my Hadoop's config set the P.S. However, in rare cases, if you got some errors like this when starting. javax.net.ssl.SSLHandshakeException: java.security.cert.CertificateException: No name matching xxx.xxx.xxx.xxx found. highly likely, It is caused by the incorrect authentication setting on the cluster. Fixing the incorrect setting is recommended, well,..., for simple test purpose you can also try add static {
//WORKAROUND. TO BE REMOVED.
javax.net.ssl.HttpsURLConnection.setDefaultHostnameVerifier(
new javax.net.ssl.HostnameVerifier(){
public boolean verify(String |hostname|,
javax.net.ssl.SSLSession sslSession) {
if (hostname.equals("mytargethostname")) {
return true;
}
return false;
}
});
} To temporarily workaround the issue. check here for details |
Spark 2.4.4 version
emr release 5.29
Due to some constraint, i can not use port number 18080 for spark history server. Spark history server points to other than 18080(example: 18480).
I am getting below error message in dr-elephant log.
04-29-2020 05:31:54 ERROR [ForkJoinPool-1-worker-29] com.linkedin.drelephant.spark.fetchers.SparkRestClient : error reading applicationInfo http://master-dns-name:18080/api/v1/applications/application_1588136450759_0001. Exception Message = java.net.SocketTimeoutException: connect timed out
I have followed this guide to install dr.elephant in EMR- https://aws.amazon.com/blogs/big-data/tune-hadoop-and-spark-performance-with-dr-elephant-and-sparklens-on-amazon-emr/
The text was updated successfully, but these errors were encountered: