Generate fields for markers in REPL so compiler plugin can extract the schema #1154

koperagen · 2025-04-28T12:20:37Z

without this change plugin cannot see schemas from previous cells, generated by REPL:

val df = DataFrame.read("...")
%%
val df1 = df.add("a") { 42 } 
// only df1.a is available here, because from plugin POV df has empty schema

After this PR generated markers (DataFrame<T>) will have unified structure, so schema can be extracted from both REPL and plugin generated ones

…e schema

Jolanrensen · 2025-04-29T10:44:57Z

core/src/main/kotlin/org/jetbrains/kotlinx/dataframe/schema/DataFrameSchema.kt

@@ -4,5 +4,9 @@ public interface DataFrameSchema {

    public val columns: Map<String, ColumnSchema>

-    public fun compare(other: DataFrameSchema): CompareResult
+    /**
+     * By default generated markers for leafs aren't used as supertypes: @DataSchema(isOpen = false)


I don't think we use the word "leaf" for DataFrames. Value/Frame columns are used for this purpose I believe.

"strictlyEqualValueCols" maybe?

Yeah, maybe "nested schemas" is a better wording, idk. By leaf here i meant.

interface A { val col: Int val nested: B <== Leaf val nestedFrame: List<B> <== Leaf }

What do you think?

so you actually meant "node"? because the schema continues downwards in a different schema? (B is a schema too right?) a "leaf" is normally a node with no children
Yes, then nested schema is better definitely

I see here the gravitation of Graph Theory with presenting dataframe as a tree.
I am fine with both schema - oriented and graph theory oriented wording, but... have a guess that our users closer to the hierarchical schemas wording

Generate fields for markers in REPL so compiler plugin can extract th…

d48cdd9

…e schema

koperagen requested a review from zaleslaw April 28, 2025 12:20

koperagen self-assigned this Apr 28, 2025

Jolanrensen reviewed Apr 29, 2025

View reviewed changes

Jolanrensen approved these changes Apr 29, 2025

View reviewed changes

koperagen added this to the 1.0.0-Beta1 (0.16) milestone Apr 29, 2025

koperagen added the Compiler plugin Anything related to the DataFrame Compiler Plugin label Apr 29, 2025

zaleslaw approved these changes Apr 30, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generate fields for markers in REPL so compiler plugin can extract the schema #1154

Generate fields for markers in REPL so compiler plugin can extract the schema #1154

koperagen commented Apr 28, 2025

Jolanrensen Apr 29, 2025

Jolanrensen Apr 29, 2025

koperagen Apr 29, 2025 •

edited

Loading

Jolanrensen Apr 29, 2025 •

edited

Loading

zaleslaw Apr 30, 2025

Generate fields for markers in REPL so compiler plugin can extract the schema #1154

Are you sure you want to change the base?

Generate fields for markers in REPL so compiler plugin can extract the schema #1154

Conversation

koperagen commented Apr 28, 2025

Jolanrensen Apr 29, 2025

Choose a reason for hiding this comment

Jolanrensen Apr 29, 2025

Choose a reason for hiding this comment

koperagen Apr 29, 2025 • edited Loading

Choose a reason for hiding this comment

Jolanrensen Apr 29, 2025 • edited Loading

Choose a reason for hiding this comment

zaleslaw Apr 30, 2025

Choose a reason for hiding this comment

koperagen Apr 29, 2025 •

edited

Loading

Jolanrensen Apr 29, 2025 •

edited

Loading