-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
groupByInterval not working [Spark: 2.4.4] #5
Comments
Troubleshooting suggests, that this has to do with the non-supported spark version and wrong build in which the Intervalize class is missing. However, could you please document the necessary code changes of ts-flint to build a databricks version. Thanks. |
I managed to build it for the current databricks runtimes (5.3 and above; tested with 6.1)
export TERM=xterm-color
sbt assemblyNoTest |
PR: #6 |
Two joint dfs should be grouped by a third (clock) df.
But Error is thrown:
java.lang.NoClassDefFoundError: Could not initialize class com.twosigma.flint.rdd.function.group.Intervalize$
Steps to reproduce
data_joined.show()
+--------------------+---+----+-----+-----+
| time|F_1| F.2|types|state|
+--------------------+---+----+-----+-----+
| 2018-12-01 14:00:00.000| 22| 7.3| 0|False|
|2018-12-01 14:00:00.100| 38|71.3| 0|False|
|2018-12-01 14:00:00.260| 26| 7.9| 42| True|
+--------------------+---+----+-----+-----+
The text was updated successfully, but these errors were encountered: