Feature/issue 202 vectorize all #276

rayleigh · 2016-04-01T16:45:37Z

Submisison Checklist

Run unit tests: ./runTests.py test/unit
Run cpplint: make cpplint
Declare copyright holder and open-source license: see below

Summary:

Vectorization of unary functions and tests of these vectorizations

Intended Effect:

This will add vectorized unary functions and their tests.

How to Verify:

Run unit tests

Side Effects:

Updated the existing testing vectorizing framework to skip tests if the expected value and test value are both NaN. The example function foo_fun now returns NaN for 0.

Added the appropriate apply_scalar_unary function to stan/math/prim/mat.hpp, stan/math/rev/mat.hpp, and stan/math/fwd/mat.hpp. All tests in stan/math/mix/scal/fun and test/unit/math/mix/scal/prob/normal_test.cpp were updated to include stan/math/mix/mat.hpp so that they wouldn't error out.

Documentation:

I have documented the code of the vectorized functions, but I don't think it's fully user-facing yet.

Reviewer Suggestions:

Copyright and Licensing

Please list the copyright holder for the work you are submitting (this will be you or your assignee, such as a university or company):
Rayleigh

By submitting this pull request, the copyright holder is agreeing to license the submitted work under the following licenses:

Code: BSD 3-clause (https://opensource.org/licenses/BSD-3-Clause)
Documentation: CC-BY 4.0 (https://creativecommons.org/licenses/by/4.0/)

…re/issue-202-vectorize-all

…stan-dev/math into feature/issue-202-vectorize-all

…re/issue-202-vectorize-all

bob-carpenter · 2016-04-01T16:59:07Z

That submissions checklist is supposed to be the pre-submission checklist!

bob-carpenter · 2016-04-01T16:59:47Z

And it needs the model tests. The tests aren't even complete yet. Maybe you want to remove this pull request until it's complete.

syclik · 2016-04-01T17:04:30Z

The model tests will need to be in the Stan library.

(Bob, this is related to how we deal with a new feature across repos.)

On Fri, Apr 1, 2016 at 12:59 PM, Bob Carpenter [email protected]
wrote:

And it needs the model tests. The tests aren't even complete yet. Maybe
you want to remove this pull request until it's complete.

—
You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub
#276 (comment)

rayleigh · 2016-04-01T17:05:22Z

Okay. I'm closing the pull request and reopen it when it has everything.

rayleigh · 2016-04-04T02:15:58Z

I ran all unit tests and make cpplint. There's one build/include error from the make cpplint report, but when I looked through an old thread on the Stan-dev discussion group, it sounded like Jenkins could help me better understand the report.

bob-carpenter · 2016-04-04T02:39:33Z

Jenkins won't help you with cpplint (unless it shows up in
a different compiler).

What is the error? If it's the standard libraries at the
end one, just move the built-in library includes to the end.

Bob

On Apr 3, 2016, at 10:15 PM, rayleigh [email protected] wrote:

I ran all unit tests and make cpplint. There's one build/include error from the make cpplint report, but when I looked through an old thread on the Stan-dev discussion group, it sounded like Jenkins could help me better understand the report.

—
You are receiving this because you commented.
Reply to this email directly or view it on GitHub

rayleigh · 2016-04-04T03:39:14Z

Jenkins actually helped. I accidentally included stan/math/prim/mat/vectorize/apply_scalar_unary.hpp twice in log1p_exp.hpp instead of including it and the non-vectorized version.

syclik · 2016-04-04T03:41:06Z

It looks like the current problem is with a compiler error:
http://d1m1s1b1.stat.columbia.edu:8080/job/Math%20Pull%20Request%20-%20Tests%20-%20Header/257/warnings18Result/new/

You need to include cmath in order for that to be fixed.

On Sun, Apr 3, 2016 at 11:39 PM, rayleigh [email protected] wrote:

Jenkins actually helped. I accidentally included
stan/math/prim/mat/vectorize/apply_scalar_unary.hpp twice in log1p_exp.hpp
instead of including it and the non-vectorized version.

—
You are receiving this because you commented.
Reply to this email directly or view it on GitHub
#276 (comment)

…re/issue-202-vectorize-all

bob-carpenter · 2016-05-28T19:27:13Z

Is this ready to merge or if not, what's left to test?

…re/issue-202-vectorize-all

rayleigh · 2016-06-05T23:23:29Z

I made the changes that you suggested. I want to note that I moved the function cgrad from the rev arr version of util.hpp to the rev scal version because test/unit/math/mix/scal/fun/log_mix_test.cpp uses cgrad.

rtrangucci · 2016-06-17T15:51:10Z

test/unit/math/mix/mat/fun/acos_test.cpp

+
+/**
+ * This is the structure for testing mock function acos (defined in the
+ * testing framework).  See README.txt for more instructions.


@rayleigh we need to remove README.txt, along with all the references to the README

Okay. I got a chance to work on it today and made those changes.

Does anyone still need me to code review this again?
I'm traveling and pretty booked this week.

Bob

On Jun 27, 2016, at 8:24 AM, rayleigh [email protected] wrote:

In test/unit/math/mix/mat/fun/acos_test.cpp:

@@ -0,0 +1,97 @@
+#include <stan/math/mix/mat.hpp>
+#include <gtest/gtest.h>
+#include <test/unit/math/prim/mat/vectorize/prim_scalar_unary_test.hpp>
+#include <test/unit/math/rev/mat/vectorize/rev_scalar_unary_test.hpp>
+#include <test/unit/math/fwd/mat/vectorize/fwd_scalar_unary_test.hpp>
+#include <test/unit/math/mix/mat/vectorize/mix_scalar_unary_test.hpp>
+#include <stan/math/prim/mat/fun/acos.hpp>
+#include <test/unit/math/prim/mat/vectorize/vector_builder.hpp>
+
+/**

* This is the structure for testing mock function acos (defined in the

* testing framework). See README.txt for more instructions.

Okay. I got a chance to work on it today and made those changes.

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub, or mute the thread.

…re/issue-202-vectorize-all

rayleigh · 2016-07-05T22:51:49Z

I know that I haven't been pushing this, but is there anything else that I need to do? I know that there are a few future issues, but I've changed the text, updated the rev arr util.hpp, and marked tests where both expected and test values are NaN as success.

syclik · 2016-07-05T23:15:13Z

One thing I haven't seen yet is how this performs for the univariate functions pre and post. Unless I've missed it. If so, can you repost? I'm actually concerned about that.

On Jul 5, 2016, at 6:51 PM, rayleigh [email protected] wrote:

I know that I haven't been pushing this, but is there anything else that I need to do? I know that there are a few future issues, but I've changed the text, updated the rev arr util.hpp, and marked tests where both expected and test values are NaN as success.

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub, or mute the thread.

rayleigh · 2016-07-05T23:31:56Z

Daniel, what part of the univariate function performance are you concerned about?

syclik · 2016-07-05T23:54:26Z

Wall time. I just want to make sure we're not going to take a significant performance hit for the existing univariate functions.

On Jul 5, 2016, at 7:31 PM, rayleigh [email protected] wrote:

Daniel, what part of the univariate function performance are you concerned about?

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub, or mute the thread.

rayleigh · 2016-07-06T00:08:39Z

I see. I didn't measure that because I wasn't aware that I needed to, but looking at foo_test, it takes 447 ms to run the whole tests. 437 ms is from running the mix expect_values test.

However, the vectorize code is essentially putting the univariate function within a for loop. There is no optimization within the vectorize code itself because I believe this project is to make the univariate functions easier to use when writing Stan code.

bob-carpenter · 2016-07-06T00:24:59Z

Everything should get compiled away through the templates
and it's the autodiff time that's the killer, not the
function computes, but it can't hurt to test.

Do I need to review more code at this point?

Bob

On Jul 5, 2016, at 4:54 PM, Daniel Lee [email protected] wrote:

Wall time. I just want to make sure we're not going to take a significant performance hit for the existing univariate functions.

On Jul 5, 2016, at 7:31 PM, rayleigh [email protected] wrote:

Daniel, what part of the univariate function performance are you concerned about?

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub, or mute the thread.

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub, or mute the thread.

rayleigh · 2016-07-06T00:34:59Z

If test/unit/math/rev/scal/fun/util.hpp, test/unit/math/rev/arr/fun/util.hpp, and test/unit/math/prim/mat/vectorize/expect_val_eq.hpp have been reviewed, then there really haven't been any other code changes since I've pushed all the easy to vectorize univariate functions up.

If we want to test performance, what scenarios should I test these functions in? For instance, if my function is foo, would I need to run the following tests before and after vectorization:

foo(int x)
foo(double x)
foo(rev x)
foo(fwd x)
foo(mix x)

bob-carpenter · 2016-07-06T00:41:15Z

If you want to be thorough, look at log() applied to

T in {var, double} for

T (scalar)
std::vector (array)
Eigen::Matrix<T, Dynamic, 1> (vector)
Eigen::Matrix<T, Dynamic, Dynamic> (matrix)

We didn't have any higher-order vectorization than that and
have never run speed tests on forward-mode.

It doesn't need to be elaborate or checked in.

Also, did all the non-C++ files get removed?

Bob

On Jul 5, 2016, at 5:35 PM, rayleigh [email protected] wrote:

If test/unit/math/rev/scal/fun/util.hpp, test/unit/math/rev/arr/fun/util.hpp, and test/unit/math/prim/mat/vectorize/expect_val_eq.hpp have been reviewed, then there really haven't been any other code changes since I've pushed all the easy to vectorize univariate functions up.

If we want to test performance, what scenarios should I test these functions in? For instance, if my function is foo, would I need to run the following tests before and after vectorization:

foo(int x)
foo(double x)
foo(rev x)
foo(fwd x)
foo(mix x)

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub, or mute the thread.

rayleigh · 2016-07-06T01:10:57Z

I believe that all the non-C++ files got removed.

As for speed tests, I didn't suggest any vectorizations because for many of the univariate functions, there wasn't a vectorized version. So, what would I use to get the pre-vectorization results?

syclik · 2016-07-06T01:49:10Z

I just want to make sure the univariate functions for double and var are
the about the same speed as without these changes. I know that everything
should be compiled away, but I'll believe it when I see it.

I'd start simple:

time one function a whole bunch of times before and after
time a couple of Stan programs that use one of the functions before and
after. Set seeds to known values and make sure everything is identical.

If those show no increase in speed, I'm happy to believe that the compilers
actually did what they're supposed to. If there's more than a 2% time
increase, I'd need to sit and think about it.

On Tue, Jul 5, 2016 at 9:10 PM, rayleigh [email protected] wrote:

I believe that all the non-C++ files got removed.

As for speed tests, I didn't suggest any vectorizations because for many
of the univariate functions, there wasn't a vectorized version. So, what
would I use to get the pre-vectorization results?

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
#276 (comment), or mute
the thread
https://github.com/notifications/unsubscribe/AAZ_F14puqos3_4lVMABt7KXiFayUJpRks5qSwCigaJpZM4H92K0
.

bob-carpenter · 2016-07-06T16:15:32Z

On Jul 5, 2016, at 5:08 PM, rayleigh [email protected] wrote:

I see. I didn't measure that because I wasn't aware that I needed to, but looking at foo_test, it takes 447 ms to run the whole tests. 437 ms is from running the mix expect_values test.

However, the vectorize code is essentially putting the univariate function within a for loop. There is no optimization within the vectorize code itself because I believe this project is to make the univariate functions easier to use when writing Stan code.

Daniel just wants to make sure that things aren't
slower now than they used to be. They weren't previously
optimized, either.

So just run a Stan program in the develop branch and
compare to your branch (CmdStan is easiest for that if
you check out the Stan branches as submodules) for each
of the previously vectorized functions (not many).

Bob

P.S. I'm off to Ann Arbor next Thursday! Side trip from
visiting my parents.

bob-carpenter · 2016-07-06T18:35:46Z

Yes, I was suggesting just doing the log() function.
I hope I sent it! const references to following
argument types at some scale (say 100 or 1000
elements in the list).

T
vector
Matrix<T, -1, 1>
Matrix<T, 1, -1>
matrix<T, -1, -1>

for T in {double, var}

Bob

On Jul 5, 2016, at 9:49 PM, Daniel Lee [email protected] wrote:

I just want to make sure the univariate functions for double and var are
the about the same speed as without these changes. I know that everything
should be compiled away, but I'll believe it when I see it.

I'd start simple:

time one function a whole bunch of times before and after

time a couple of Stan programs that use one of the functions before and
after. Set seeds to known values and make sure everything is identical.

If those show no increase in speed, I'm happy to believe that the compilers
actually did what they're supposed to. If there's more than a 2% time
increase, I'd need to sit and think about it.

On Tue, Jul 5, 2016 at 9:10 PM, rayleigh [email protected] wrote:

I believe that all the non-C++ files got removed.

As for speed tests, I didn't suggest any vectorizations because for many
of the univariate functions, there wasn't a vectorized version. So, what
would I use to get the pre-vectorization results?

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub
#276 (comment), or mute
the thread
https://github.com/notifications/unsubscribe/AAZ_F14puqos3_4lVMABt7KXiFayUJpRks5qSwCigaJpZM4H92K0
.

—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub, or mute the thread.

syclik · 2016-07-19T20:13:39Z

I think we're good. @rayleigh timed a couple functions and it looked like there were no surprises (which is great!).

syclik · 2016-07-19T20:13:47Z

Jenkins, retest this please.

Rayleigh L and others added 18 commits March 15, 2016 12:38

Merge branch 'develop' of https://github.com/stan-dev/math into featu…

2f7c9c7

…re/issue-202-vectorize-all

Add continuous vectorize functions

1320205

Merge branch 'develop' of https://github.com/stan-dev/math into featu…

209ad63

…re/issue-202-vectorize-all

Added more cont vectorized fcts

7847624

Replace matrix exp.hpp

08fa5db

Merge branch 'develop' into feature/issue-202-vectorize-all

7d454b7

Including apply_scalar_unary.hpp in headers

3f30b6a

Updated includes to work for exp.hpp

e52eb16

Merge branch 'feature/issue-202-vectorize-all' of https://github.com/…

af33e1b

…stan-dev/math into feature/issue-202-vectorize-all

Vectorized more unconstrained fcts

e32f86f

Tests now skip NaN

8f24aaa

Updated includes

bd2f660

Added more functions

4117af4

Added more functions

f732822

Added more vectorized functions

e9430e7

Added doc

0ad8b8f

Merge branch 'develop' of https://github.com/stan-dev/math into featu…

0e5d0bd

…re/issue-202-vectorize-all

Updated doc

71586d4

rayleigh closed this Apr 1, 2016

Updated includes

5988e60

rayleigh reopened this Apr 4, 2016

Fixed includes

3c7f98d

Rayleigh L added 2 commits May 2, 2016 14:22

Created function to check nan and value

3f12adf

Merge branch 'develop' of https://github.com/stan-dev/math into featu…

0cce3d3

…re/issue-202-vectorize-all

Rayleigh L added 3 commits June 1, 2016 13:23

Updated nan function test and rev scal util.hpp

7424cdf

Merge branch 'develop' of https://github.com/stan-dev/math into featu…

59148e0

…re/issue-202-vectorize-all

Updated rev scal and arr version of util.hpp

c5886d2

rtrangucci reviewed Jun 17, 2016
View reviewed changes

Rayleigh L added 2 commits June 27, 2016 11:23

Removed reference to README.txt

07152e8

Merge branch 'develop' of https://github.com/stan-dev/math into featu…

a6b94b7

…re/issue-202-vectorize-all

syclik added this to the v2.10++ milestone Jul 19, 2016

syclik added the code reviewed label Jul 19, 2016

syclik merged commit aa1730d into develop Jul 21, 2016

syclik deleted the feature/issue-202-vectorize-all branch July 21, 2016 02:01

rayleigh mentioned this pull request Oct 14, 2016

Vectorize in Stan vectorized functions in math stan-dev/stan#2083

Merged

3 tasks

rayleigh mentioned this pull request Oct 28, 2016

Remove unnecessary using statements from vectorized functions #426

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/issue 202 vectorize all #276

Feature/issue 202 vectorize all #276

rayleigh commented Apr 1, 2016

bob-carpenter commented Apr 1, 2016

bob-carpenter commented Apr 1, 2016

syclik commented Apr 1, 2016

rayleigh commented Apr 1, 2016

rayleigh commented Apr 4, 2016

bob-carpenter commented Apr 4, 2016

rayleigh commented Apr 4, 2016

syclik commented Apr 4, 2016

bob-carpenter commented May 28, 2016

rayleigh commented Jun 5, 2016

rtrangucci Jun 17, 2016

rayleigh Jun 27, 2016

bob-carpenter Jun 28, 2016

rayleigh commented Jul 5, 2016

syclik commented Jul 5, 2016

rayleigh commented Jul 5, 2016

syclik commented Jul 5, 2016

rayleigh commented Jul 6, 2016

bob-carpenter commented Jul 6, 2016

rayleigh commented Jul 6, 2016

bob-carpenter commented Jul 6, 2016

rayleigh commented Jul 6, 2016

syclik commented Jul 6, 2016

bob-carpenter commented Jul 6, 2016

bob-carpenter commented Jul 6, 2016

syclik commented Jul 19, 2016

syclik commented Jul 19, 2016

Feature/issue 202 vectorize all #276

Feature/issue 202 vectorize all #276

Conversation

rayleigh commented Apr 1, 2016

Submisison Checklist

Summary:

Intended Effect:

How to Verify:

Side Effects:

Documentation:

Reviewer Suggestions:

Copyright and Licensing

bob-carpenter commented Apr 1, 2016

bob-carpenter commented Apr 1, 2016

syclik commented Apr 1, 2016

rayleigh commented Apr 1, 2016

rayleigh commented Apr 4, 2016

bob-carpenter commented Apr 4, 2016

rayleigh commented Apr 4, 2016

syclik commented Apr 4, 2016

bob-carpenter commented May 28, 2016

rayleigh commented Jun 5, 2016

rtrangucci Jun 17, 2016

Choose a reason for hiding this comment

rayleigh Jun 27, 2016

Choose a reason for hiding this comment

bob-carpenter Jun 28, 2016

Choose a reason for hiding this comment

rayleigh commented Jul 5, 2016

syclik commented Jul 5, 2016

rayleigh commented Jul 5, 2016

syclik commented Jul 5, 2016

rayleigh commented Jul 6, 2016

bob-carpenter commented Jul 6, 2016

rayleigh commented Jul 6, 2016

bob-carpenter commented Jul 6, 2016

rayleigh commented Jul 6, 2016

syclik commented Jul 6, 2016

bob-carpenter commented Jul 6, 2016

bob-carpenter commented Jul 6, 2016

syclik commented Jul 19, 2016

syclik commented Jul 19, 2016