Add `TextualEnum` `newtype` wrapper #186

mjgpy3 · 2024-07-29T20:45:57Z

This is an experiment that I'm putting out for review to see what the team thinks. I won't feel bad if we don't like it.

What is it?

TextualEnum is a newtype wrapper meant to provide a lot of boilerplate serialization/deserialization instances that we keep defining and redefining as we introduce new types and services.

How's it work?

The core idea is that if you can give your type an Enum instance, a Bounded instance, and a way to convert to text, then that should be enough to provide string-type (i.e. textual) representations of it.

So toText tells us how to serialize the value, and we can look through the Bounded/Enum instance values to parse it "from text".

toText is preferable from fromText because, for bounded/enums, totality is straightforward to check/implement and further enforced if cases are updated/removed/added.

EnumName

I've also included EnumName because we've started introducing codec/schema definitions that can be named.

Example

You can see the specs (PrimaryColor) for an example.

chris-martin · 2024-07-29T20:52:56Z

freckle-app/library/Freckle/App/TextualEnum.hs

+instance (Bounded a, Enum a, EnumValue a, Eq a) => HasCodec (TextualEnum a) where
+  codec = stringConstCodec $ (id &&& (toText . enumValue)) <$> enums @a


Can you actually derive ~~instance this~~ this instance via EnumValue? As we just happened to be discussing a minute ago there seems to be some problem with deriving HasCodec.

Not sure I follow what you're asking. Are you asking if this instance is broken or if there's a simpler way to derive it?

Referring to what we mentioned in backend guild today that HasCodec for whatever reason isn't a class that works with deriving via. Just wondering how much that limits the utility of this instance.

I think he's askjing if you can remove this explicit instance and use deriving via to achieve this

Oh I follow now.

I'm saying I know you cannot use deriving via to achieve this, so I don't understand what purpose the instance serves.

freckle-app/package.yaml

chris-martin · 2024-07-29T21:15:40Z

freckle-app/library/Freckle/App/TextualEnum.hs

+class EnumValue a where
+  -- | Convert a 'TextualEnum' to 'Text'
+  toText :: a -> Text


A thought -

data TextMapping a = TextMapping !(a -> Text) !(Text -> Maybe a) class EnumValue a where textMapping :: TextMapping a enumBoundedTextMapping :: (Enum a, Bounded a) => (a -> Text) -> TextMapping a enumBoundedTextMapping f = let !m = Map.fromList $ (id &&& f) <$> [minBound..maxBound] in TextMapping f (flip (Map.lookup m))

data ContentArea = Math | Ela deriving (...) via TextualMapping ContentArea instance EnumValue ContentArea where textMapping = enumBoundedTextMapping $ \case Math -> "math" Ela -> "ela"

This approach does two things -

Allows the user options other than enum/bounded to enumerate if they want

Allows a logarithmic rather than linear lookup, using a Map which is constructed only once (not 100% sure if that would require explicitly lifting each textMapping definition to the top level or not, would want a benchmark to test.)

I thought about doing something like this but I came to the conclusion that basically

not 100% sure if that would require explicitly lifting each textMapping definition to the top level or not

was kind of hazy (i.e. I'm unsure exactly how to get GHC to compute it once) and thus prone to being incorrectly done.

I'm also not convinced that a hash map is actually quicker for smaller enums. Hashing a value vs. looking through like 4-13 cases feels like a wash. I could be totally wrong here as I have no data to back this up but the tradeoffs don't feel worth it at first blush.

personally I greatly prefer using deriving via instead of wrapping a bunch of types with a utility type; I'm fine with wrapping in a utility type for calculation purposes, but it gets really stinky when someone does e.g.

data Person = { age :: Sum Int } totalAge :: [Person] -> Int totalAge = getSum . fold . age

instead of, like:

data Person = { age :: Int } totalAge :: [Person] -> Int totalAge = getSum . fold <$> Sum . age -- which would let you then do youngestAge :: [Person] -> Int youngestAge = getMin . fold <$> Min . age

.. tho I admit ive seen it in the our codebase a few times, and that this is a personal preference that I can't justify beyond "confuses the data model" and "at what point do you stop adding utility wrappers"

Tho, lol, I guess this utility wrapper thing is kinda sorta analogous to monad transformer stacks, eh?

I don't really have an opinion on the linear vs log search. If the N is small it doesn't matter much. I guess if someone did the following, it would matter a lot:

newtype Age = Age Int deriving (...) via TextualMapping Age instance EnumValue ContentArea where textMapping = enumBoundedTextMapping show main = do putStrLn $ show $ Age maxBound

but idk this seems like quite the goofy thing to do, and hey, I notice that you address it. If you do it via mapping, and someone were to enumerate the set of Ints, you'd end up eating up so much memory... so idk.

you could make a way for the user to use a second function tho so a body could pick, e.g. enumBoundedTextMapping and enumBoundedTextMappingMemo or something

greatly prefer using deriving via instead of wrapping a bunch of types with a utility type

Agreed and I'm wondering if maybe the spec here would be a good place to demonstrate using TextualEnum to define instances directly for the PrimaryColor type.

joelmccracken

lgtm but i'd like to resolve the "how this is to be used" (hopefully, not wrapping a bunch of domain types, but deriving via/newtype)

joelmccracken · 2024-07-30T18:47:37Z

freckle-app/tests/Freckle/App/TextualEnumSpec.hs

+    via TextualEnum PrimaryColor
+
+instance EnumValue PrimaryColor where
+  toText = \case


What about an example showing using https://hackage.haskell.org/package/base-4.20.0.1/docs/Data-Data.html#v:toConstr which would help DRY/standardize/make one less spot to forget to update when adding an enum value

I'm 👎 in general on things that couple strings to Haskell identifiers, because it makes the code less robust against refactoring, leads to compromising what strings should be to deal with Haskell restrictions (e.g. wanting an enum string to be "type"), and makes it harder to search the code when you're expecting to be able to find a string literal appearing somewhere but it's actually derived from a Haskell identifier instead -- But sometimes I guess it's appropriate, and TIL about toConstr, so wouldn't argue against more docs.

🤷 I get that pov. I do go back and forth. so much is automatically generated, and depending upon tradeoffs, it becomes unclear when specifically someone would be searching for a string vs somewhere else. But yea if you do autogen these, it really important that someone be able to decouple these when its needed, and sometimes stuff don't have "i need to roll my own" docs.

either way perhaps its worth documenting/commenting/illustrating using it? i'm good with either way.

chris-martin · 2024-07-30T19:10:59Z

freckle-app/library/Freckle/App/TextualEnum.hs

+class EnumName a where
+  -- | Name of a 'TextualEnum', used for naming schemas
+  enumName :: Proxy a -> Text


How about making this derivable?

newtype DataName a = DataName a instance Data a => EnumName (DataName a) where enumName _ = _ -- Some sort of reflection? data MyThing = ... deriving EnumName via DataName MyThing

chris-martin

A few ideas for additional features floating around but I think this will be useful.

mjgpy3 · 2024-09-26T19:32:21Z

@joelmccracken re-requesting a review on this. I think I've covered your concerns but it's been a while so just wanted to make sure (sorry, I went on vacation shortly after opening this so I forgot about it).

joelmccracken

looks fine, honestly I barely remember any of this lol

This is an experiment that I'm putting out for review to see what the team thinks. I won't feel bad if we don't like it. **What is it?** `TextualEnum` is a `newtype` wrapper meant to provide a lot of boilerplate serialization/deserialization instances that we keep defining and redefining as we introduce new types and services. **How's it work?** The core idea is that if you can give your type an `Enum` instance, a `Bounded` instance, and a way to convert to text, then that should be enough to provide string-type (i.e. textual) representations of it. So `toText` tells us how to serialize the value, and we can look through the `Bounded`/`Enum` instance values to parse it "from text". `toText` is preferable from `fromText` because, for bounded/enums, totality is straightforward. **EnumName** I've also included `EnumName` because we've started introducing codec/schema definitions that can be named. **Example** You can see the specs (`PrimaryColor`) for an example. **Benefits** This should reduce a great deal of boilerplate, and, if used, will reduce a lot of ad-hoc-ry that exists across our in-the-wild definitions.

mjgpy3 requested review from a team and jason-lieb and removed request for a team July 29, 2024 20:45

mjgpy3 force-pushed the gilli/textual-enum branch from 4d1eb1b to 1592ef8 Compare July 29, 2024 20:48

chris-martin reviewed Jul 29, 2024

View reviewed changes

freckle-app/package.yaml Show resolved Hide resolved

chris-martin reviewed Jul 29, 2024

View reviewed changes

joelmccracken requested changes Jul 30, 2024

View reviewed changes

joelmccracken reviewed Jul 30, 2024

View reviewed changes

chris-martin reviewed Jul 30, 2024

View reviewed changes

chris-martin approved these changes Jul 30, 2024

View reviewed changes

jason-lieb removed their request for review August 2, 2024 17:50

mjgpy3 requested a review from joelmccracken September 26, 2024 17:12

joelmccracken approved these changes Oct 2, 2024

View reviewed changes

mjgpy3 force-pushed the gilli/textual-enum branch from 53304e5 to d11ce52 Compare October 2, 2024 20:52

mjgpy3 force-pushed the gilli/textual-enum branch from d11ce52 to 1ac1ae7 Compare October 2, 2024 21:00

mjgpy3 merged commit 5dc71f4 into main Oct 4, 2024
6 checks passed

mjgpy3 deleted the gilli/textual-enum branch October 4, 2024 13:13

chris-martin mentioned this pull request Oct 4, 2024

TextualEnum: changelog and release #204

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `TextualEnum` `newtype` wrapper #186

Add `TextualEnum` `newtype` wrapper #186

mjgpy3 commented Jul 29, 2024 •

edited

Loading

chris-martin Jul 29, 2024 •

edited

Loading

mjgpy3 Jul 30, 2024

chris-martin Jul 30, 2024

joelmccracken Jul 30, 2024

mjgpy3 Jul 30, 2024

chris-martin Jul 30, 2024

chris-martin Jul 29, 2024 •

edited

Loading

mjgpy3 Jul 30, 2024

joelmccracken Jul 30, 2024

chris-martin Jul 30, 2024 •

edited

Loading

joelmccracken left a comment

joelmccracken Jul 30, 2024

chris-martin Jul 30, 2024

joelmccracken Jul 30, 2024

chris-martin Jul 30, 2024

chris-martin left a comment

mjgpy3 commented Sep 26, 2024

joelmccracken left a comment

		instance (Bounded a, Enum a, EnumValue a, Eq a) => HasCodec (TextualEnum a) where
		codec = stringConstCodec $ (id &&& (toText . enumValue)) <$> enums @a

Add TextualEnum newtype wrapper #186

Add TextualEnum newtype wrapper #186

Conversation

mjgpy3 commented Jul 29, 2024 • edited Loading

chris-martin Jul 29, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chris-martin Jul 29, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chris-martin Jul 30, 2024 • edited Loading

Choose a reason for hiding this comment

joelmccracken left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chris-martin left a comment

Choose a reason for hiding this comment

mjgpy3 commented Sep 26, 2024

joelmccracken left a comment

Choose a reason for hiding this comment

Add `TextualEnum` `newtype` wrapper #186

Add `TextualEnum` `newtype` wrapper #186

mjgpy3 commented Jul 29, 2024 •

edited

Loading

chris-martin Jul 29, 2024 •

edited

Loading

chris-martin Jul 29, 2024 •

edited

Loading

chris-martin Jul 30, 2024 •

edited

Loading