-
Notifications
You must be signed in to change notification settings - Fork 151
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug] spark executor task error when reading shuffle data when using java open jdk11 #2082
Comments
@rickyma Do you have any suggestion? |
I've no idea. Could you please test this case using JDK 11? @maobaolong |
@rickyma We use jdk8 for all RSS cluster and client, so we did not encounter this issue for production env. But I did a test on JDK11 just konw, this issue reproduced.
val data = sc.parallelize(Seq(("A", 1), ("B", 2), ("C", 3), ("A", 4), ("B", 5), ("A", 6), ("A", 7),("A", 7), ("A", 7), ("A", 7), ("A", 7), ("A", 7), ("A", 7), ("A", 7), ("A", 7), ("A", 7), ("A", 7), ("A", 7), ("A", 7)));
val result = data.reduceByKey(_ + _);
result.collect().foreach(println);
System.exit(0); |
Could you help to solve this? It will block people from using JDK 11. @maobaolong |
@rickyma @ChenRussell JDK11 cannot works with uniffle client for now. |
Do you mean JDK11? |
Sorry for the mistake, yeah, i mean JDK11, I got this conclusion from the community meeting. |
@advancedxy @LuciferYang Do you have any suggestion about this issue? |
Based on the stack trace, It seems that RSS is releasing a slice or a duplicate, which might indicate a bigger problem and needs some further investigation. |
I've been running into the same issue.
Not sure how to check for memory leaks right now though. |
@ChenRussell Have you tried |
Yes, I have tried, that's not working, it's not the accessible problem, it can access the reflection method |
@ChenRussell Got it ~ Is there an existing unit test (UT) that can reproduce this issue? Java 8 uses |
You can use this #2082 (comment) to reproduce this issue. |
agree. The root cause is that ByteBuffer#attachment is unexpectedly null. |
Can we assess if |
Similar problems arise here, such as ARTEMIS-2131 2378 WFLY-11026 , We can refer to it |
It seems to be caused by this code, a result byteBuf will be a silced ByteBuf, but a silced Buffer incubator-uniffle/common/src/main/java/org/apache/uniffle/common/netty/TransportFrameDecoder.java Line 179 in b7c9ca5
|
Maybe we could remove
Because we have released the ByteBuf
cc @rickyma @maobaolong @leixm WDYT? |
Code of Conduct
Search before asking
Describe the bug
I use openjdk 11 in spark image, and I get errors when spark task reading shuffle data from uniffle server, here is the executor task error log:
Affects Version(s)
0.9.0
Uniffle Server Log Output
Uniffle Engine Log Output
Uniffle Server Configurations
rss.rpc.server.type GRPC_NETTY ...
Uniffle Engine Configurations
Additional context
No response
Are you willing to submit PR?
The text was updated successfully, but these errors were encountered: