Implement Typed Documents and TypeRegistry #282

jterapin · 2025-03-06T21:47:57Z

Description: Implementation for Typed Documents and TypeRegistry. Currently only supports JSON documents.

It is highly likely that we have to revisit this implementation in the future since typed document/type registry still being evolved.

gems/smithy-schema/lib/smithy-schema/document.rb

gems/smithy-schema/lib/smithy-schema/type_registry.rb

gems/smithy-schema/spec/smithy-schema/document_spec.rb

gems/smithy-schema/lib/smithy-schema/document.rb

gems/smithy-schema/lib/smithy-schema/document_utils.rb

projections/shapes/lib/shapes/schema.rb

mullermp

Nice. Looking better. Still have a bunch of comments though .. In general I think we can still simplify and also be less aggressive on validation and instead be more permissive where possible.

mullermp · 2025-05-09T01:28:51Z

gems/smithy-schema/lib/smithy-schema/document.rb

+    #   shape = Smithy::Schema::StructureShape.new
+    #   data = Document::Data.new({ "name" => "example" }, shape: shape)
+    #
+    module Document


I was expecting document to be a class and have it be the delegator. What was the intention of making another data subclass?

gems/smithy-schema/lib/smithy-schema/document.rb

mullermp · 2025-05-09T01:36:53Z

gems/smithy-schema/lib/smithy-schema/type_registry.rb

+      # @param  [Hash<String, Shapes::StructureShape>] registry
+      def initialize(registry = {})
+        @registry = registry
+        @shapes_by_type = register_shape_types(registry.values)


Is there a way to populate this from the code generated side? If we must iterate shapes, we may as well backtrack and populate both maps in one pass. That at least reduces generated code. Otherwise is shapes_by_type even necessary?

mullermp · 2025-05-09T01:37:19Z

gems/smithy-schema/lib/smithy-schema/type_registry.rb

+
+      # @api private
+      # @return [Hash<String, Shapes::StructureShape>]
+      attr_accessor :registry


Both of these accessors shouldn't exist. Our public methods should hide this detail.

mullermp · 2025-05-09T02:37:52Z

gems/smithy-schema/lib/smithy-schema/document/serializer.rb

+        end
+
+        def typed_document?(values)
+          (values.is_a?(Smithy::Schema::Structure) && @type_registry.shape_by_type(values.class)) ||


Isn't this already checked? And wouldn't this always be true if it was a structure, because it would already be registered?

mullermp · 2025-05-09T02:39:34Z

gems/smithy-schema/lib/smithy-schema/document/serializer.rb

+          ref.shape.member(name) || find_member_ref_by_names(ref, name)
+        end
+
+        def find_member_ref_by_names(ref, name)


This seems inefficient. Similar to what we do in codecs, for structure and union, you will want to iterate the shape members and not the values, then you can check json name that way. You're doing a loop for every member, so it's n^2 performance.

mullermp · 2025-05-09T02:39:59Z

gems/smithy-schema/lib/smithy-schema/document/serializer.rb

+          end
+        end
+
+        def resolve_member_name(member_ref, opts)


Check out the location_name approach in my PR - we should use similar terms. You can easily handle this with || optionality.

gems/smithy-schema/lib/smithy-schema/document/deserializer.rb

mullermp · 2025-05-09T02:49:47Z

gems/smithy-schema/spec/smithy-schema/document/serializer_spec.rb

+      describe Serializer do
+        let(:shapes) do
+          shapes = SchemaHelper.sample_shapes
+          shapes['smithy.ruby.tests#Structure']['members']['timestampDateTime'] = {


I would prefer if you move these definitions closer to the test (in the actual tests where they are needed) - it's easier to manage tests that way if they are discrete.

alextwoods

Nice - its generally looking good.
I think the functionality from the Document::Data class could be moved into Document as a class (unless theres some reason I'm missing). I also understand why the Document serializer and deserializer exist separately and require a type registery - but I think I would lean towards the public interface for serializing/deserializing documents living on the top level class - it could still require a type registry to be provided and could use these classes under the hood to implement it (and they could then be api private).

mullermp · 2025-05-09T22:52:55Z

That's effectively what I was also saying but I agree.

jterapin added 3 commits March 6, 2025 12:12

Add type registry prototype class

d33fcdf

Add type registry to codegenerated schema

5ff40cc

Update projections

66a6285

jterapin changed the title ~~Implement Typed Documents and TypeRegistry~~ [WIP] Implement Typed Documents and TypeRegistry Mar 6, 2025

jterapin added 24 commits March 7, 2025 13:39

Merge branch 'decaf' into typed_documents

c5e45ed

Merge branch 'decaf' into typed_documents

2bd15a2

Merge branch 'decaf' into typed_documents

03bfa82

Update requires

e6435d5

Add initial document implementation

0830827

Merge decaf into branch

877654f

Update to include cbor

a61318f

Expand on typed docs

4edfae3

Update file names

ff959f1

Merge branch 'decaf' into typed_documents

8b9b560

More refactoring

598db66

Merge branch 'decaf' into typed_documents

3a4c0d1

Remove scratches

269b2b5

Fix rubocop

90c58ce

Clean up document

a1e46cc

Clean document specs

8b666cd

Update TypeRegistry

2ddf4bd

Add documentation

112ddf4

Add TypeRegistry specs

6283813

Merge branch 'decaf' into typed_documents

efbfa5e

Add TypeRegistry tests

88ff845

Update projections

66b2cde

Update syntax

22998a0

Update projections

9afeacd

jterapin changed the title ~~[WIP] Implement Typed Documents and TypeRegistry~~ Implement Typed Documents and TypeRegistry Apr 15, 2025

jterapin commented Apr 15, 2025

View reviewed changes

jterapin added 27 commits April 28, 2025 08:42

Update example

b67cf54

Merge decaf into branch

ea9b380

Remove reference to type registry from client

381602b

Update projections

6d41f64

Update projections

83fa2bb

Document now inherits SimpleDelegator

b23a2e9

Expand on type registry docs

306a97e

Fix bug in timehelper

8ef3055

Slim down the sample shapes

a297963

Update docs

8e2d322

Merge decaf into branch

532ba8a

Only add structures to type registry

cb66cec

Update TypeRegistry to limit to StructureShape

9fa984c

Document revamp

166c8b1

Rename document test cases

9cef7ee

Merge branch 'decaf' into typed_documents

d01a780

Improve Document Deserializer

e283281

Update Document Serializer and its specs

a48f7a1

Update Document Data class

1913938

Remove TimeHelper

8cc697c

Require delegate

75141d1

Fix relative ordering

d36a70b

Fix type registry test

b2a6998

Remove unnecessary shape

9219b13

Rubocop fix

1f0b771

More changes

74450a1

Add docs

68c94c1

mullermp reviewed May 9, 2025

View reviewed changes

alextwoods reviewed May 9, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Typed Documents and TypeRegistry #282

Implement Typed Documents and TypeRegistry #282

jterapin commented Mar 6, 2025 •

edited

Loading

mullermp left a comment

mullermp May 9, 2025

mullermp May 9, 2025

mullermp May 9, 2025

mullermp May 9, 2025

mullermp May 9, 2025

mullermp May 9, 2025

mullermp May 9, 2025

alextwoods left a comment

mullermp commented May 9, 2025

Implement Typed Documents and TypeRegistry #282

Are you sure you want to change the base?

Implement Typed Documents and TypeRegistry #282

Conversation

jterapin commented Mar 6, 2025 • edited Loading

mullermp left a comment

Choose a reason for hiding this comment

mullermp May 9, 2025

Choose a reason for hiding this comment

mullermp May 9, 2025

Choose a reason for hiding this comment

mullermp May 9, 2025

Choose a reason for hiding this comment

mullermp May 9, 2025

Choose a reason for hiding this comment

mullermp May 9, 2025

Choose a reason for hiding this comment

mullermp May 9, 2025

Choose a reason for hiding this comment

mullermp May 9, 2025

Choose a reason for hiding this comment

alextwoods left a comment

Choose a reason for hiding this comment

mullermp commented May 9, 2025

jterapin commented Mar 6, 2025 •

edited

Loading