Vectorized logical functions #2925

jessexknight · 2023-07-17T20:18:21Z

Description

As discussed here, the logical functions don't support vectorization yet.

New functions

Add functions for the logical operators (==,<, <=, >, >=) and their functions (logical_eq, etc.) to allow containers as arguments and support broadcasting. The signatures will be, for example,

int logical_eq(scalar, scalar);
int[] logical_eq(scalars, scalars);
int[] logical_eq(scalar, scalars);
int[] logical_eq(scalars, scalar);

where

scalar is int | real | complex
scalars is scalar[] | vector | row_vector | complex_vector | complex_row_vector.

If there are two scalar inputs, the existing function is called. If one of the arguments is a container, the other argument must be a container of the same shape or a scalar. With a scalar and container, the scalar is broadcast.

Vectorized logic

We also want to add two functions

int any(scalars);
int all(scalars);

where the first returns 0 if all of the arguments are 0 and 1 otherwise, and the second returns 0 if any of the arguments is 0 and 1 otherwise. We could also add a not function that performs elementwise negation.

Example & Expected Output

data {
  int N;
  real x1[N];
  real x2[N];
}
transformed data {
  real y = sum(x1 > x2);
}

Current error:

No matches for: logical_gt(real[ ], real[ ])

Current Version:

v4.6.2

The text was updated successfully, but these errors were encountered:

bob-carpenter · 2023-07-17T20:33:23Z

Thanks for opening the issue, @jessexknight. I edited to add signatures and a definition and to remove the unnecessary R and remove the erroneous comment about efficiency.

As far as efficiency, I'm afraid implementing a built-in in C++ won't be any faster than writing the loop in Stan, because Stan gets compiled down to C++. The only advantage to having built-ins is when we have vectorized autodiff, which we can accelerate.

jessexknight · 2023-07-17T20:58:49Z

Thanks - that is surprising about performance. I'm really out of my depth here with C++ etc., but would there be a performance difference if using something like apply_scalar_binary to existing scalar functions, vs a native function for this kind of thing from the Eigen library?

bob-carpenter · 2023-07-18T14:00:13Z

Yes, if we can get things compiled down to Eigen's vectorized operations then we can exploit their use of CPU vectorization (e.g., SSE and AVX operations at the CPU level). This can give a several times speedup. The term "vectorized" usually refers to using SSE, AVX, etc. on the CPU---our use of the term in Stan is non-standard. I doubt they've vectorized logical operations, but they've done a lot of common math functions like log and exp and sin and cos.

Vectorized CPU operations can be a lot faster.

apply_scalar_binary doesn't compile down that low but we might actually be able to rewrite it to better exploit these operations. I think the current implementation assumes a double or autodiff.

bob-carpenter added feature new function good first issue labels Jul 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vectorized logical functions #2925

Vectorized logical functions #2925

jessexknight commented Jul 17, 2023 •

edited by bob-carpenter

Loading

bob-carpenter commented Jul 17, 2023

jessexknight commented Jul 17, 2023

bob-carpenter commented Jul 18, 2023

Vectorized logical functions #2925

Vectorized logical functions #2925

Comments

jessexknight commented Jul 17, 2023 • edited by bob-carpenter Loading

Description

New functions

Vectorized logic

Example & Expected Output

Current Version:

bob-carpenter commented Jul 17, 2023

jessexknight commented Jul 17, 2023

bob-carpenter commented Jul 18, 2023

jessexknight commented Jul 17, 2023 •

edited by bob-carpenter

Loading