Add GrayBox predictor #96

odow · 2024-08-28T23:21:28Z

Part of #90

@pulsipher is this what you had in mind?

I'll do something similar for Lux and PyTorch.

~~And I'll also sort vector-valued outputs.~~

odow · 2024-08-29T00:20:45Z

I actually think this is super cool!

test/test_Flux.jl

pulsipher

This is cool. In addition to my inline comments, it would be nice to avoid adding unnecessary additional nonlinear operators if I want to use the same NN for multiple sets of inputs. I am also not sure the full space formulation seems necessary, but I suppose it is consistent with the rest of the predictors.

ext/MathOptAIFluxExt.jl

pulsipher · 2024-08-29T00:55:27Z

ext/MathOptAIFluxExt.jl

+        return only(Flux.outputsize(predictor, (length(x),)))
+    end
+    function with_jacobian(x)
+        ret = Flux.withjacobian(x -> predictor(Float32.(x)), collect(x))


Will the Flux model always use Float32?

I think it's the default. Different precision is a bit all over the place, since if you throw x::Vector{Float64} in, it automatically changes your weights to the same type. I dislike this aspect of Flux. At the very least, let's wait until someone complains before addressing this. It works for the tests.

pulsipher · 2024-08-29T01:00:58Z

src/predictors/GrayBox.jl

+    return map(1:predictor.predictor.output_size(x)) do i
+        op_i = JuMP.add_nonlinear_operator(
+            model,
+            length(x),
+            (x...) -> f(i, x...),
+            (g, x...) -> ∇f(g, i, x...);
+            name = Symbol("op_$(gensym())"),
+        )
+        return op_i(x...)


Would it be possible to add a Hessian function?

Do people compute Hessian's of NNs? Or you just want arbitrary possibility? Do you have an example where this is useful?

My sense is to leave as-is for the first pass. We can always add it later.

In a paper I am about to submit, we used hessians with a PyTorch NN in an optimal control problem and saw a significant speed up. This was done with PyNumero's graybox interface.

Can you link me the code of getting Hessians etc out of torch?

I'll dig it up from my former student that just graduated. In the meantime, I know that we used torch.func which provides functions to evaluate the Jacobian and the Hessian directly: https://pytorch.org/docs/stable/func.api.html

I also found jacobian = torch.autograd.functional.jacobian(model, x). But func seems better.

We tried torch.autograd.functional, but it was quite a bit slower. Notably, we did leverage the batch abilities of torch.func to evaluate all the gradients of a NN over a different sets of inputs which probably gave torch.func an extra advantage.

pulsipher · 2024-08-29T01:09:24Z

Another cool thing is that this has little to no restrictions on what layers the Chain can use.

odow · 2024-08-29T01:24:33Z

Lux support is complicated, because you need to bring your own AD system. I'll leave out for now.

odow · 2024-08-29T03:15:54Z

As a first pass, I think this can be merged. We can come back and improve the performance, and I'll open an issue to add Hessian support.

odow changed the title ~~Add gray_box kwarg to Flux~~ Add GrayBox predictor Aug 29, 2024

odow force-pushed the od/gray-box branch from 6f99aae to e0a6959 Compare August 29, 2024 00:16

odow commented Aug 29, 2024

View reviewed changes

test/test_Flux.jl Outdated Show resolved Hide resolved

pulsipher reviewed Aug 29, 2024

View reviewed changes

odow added 7 commits August 29, 2024 15:02

Add gray_box kwarg to Flux

1a491da

Add GrayBox

ba7bc0c

Update

f8a1e2d

Update

a1642d3

Update

71531c7

Update

fc73fe6

Update

c474e83

odow force-pushed the od/gray-box branch from 0426756 to c474e83 Compare August 29, 2024 03:13

odow merged commit 1d5fb1d into main Aug 29, 2024

odow deleted the od/gray-box branch August 29, 2024 03:16

odow mentioned this pull request Aug 29, 2024

[GrayBox] add Hessian support #90

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add GrayBox predictor #96

Add GrayBox predictor #96

odow commented Aug 28, 2024 •

edited

Loading

odow commented Aug 29, 2024

pulsipher left a comment •

edited

Loading

pulsipher Aug 29, 2024

odow Aug 29, 2024

pulsipher Aug 29, 2024

odow Aug 29, 2024

odow Aug 29, 2024

pulsipher Aug 29, 2024

odow Aug 29, 2024

pulsipher Aug 29, 2024

odow Aug 29, 2024

pulsipher Aug 29, 2024

pulsipher commented Aug 29, 2024

odow commented Aug 29, 2024

odow commented Aug 29, 2024

Add GrayBox predictor #96

Add GrayBox predictor #96

Conversation

odow commented Aug 28, 2024 • edited Loading

odow commented Aug 29, 2024

pulsipher left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pulsipher commented Aug 29, 2024

odow commented Aug 29, 2024

odow commented Aug 29, 2024

odow commented Aug 28, 2024 •

edited

Loading

pulsipher left a comment •

edited

Loading