MAINT: interpolate: move fpknot wrap from Cython to manual #14

ev-br · 2024-07-21T12:09:26Z

Reference issue

What does this implement/fix?

Additional information

ngoldbaum

Overall seems fine for the most part, hope the comments are helpful.

ngoldbaum · 2024-07-22T16:35:54Z

scipy/interpolate/src/_dierckxmodule.cc

+py_fpknot(PyObject* self, PyObject *args)
+{
+    PyObject *py_x=NULL, *py_t=NULL, *py_residuals=NULL;
+    int k;   // XXX: python int => "i" in ParseTuple? 


This is fine as long as you never expect reasonable values of k to vall outside the values of a C int. You also need to check the return value of PyArg_ParseTuple for the error case to catch the OverflowError, but you do that.

ngoldbaum · 2024-07-22T16:37:40Z

scipy/interpolate/src/_dierckxmodule.cc

+        return NULL;
+    }
+
+    // XXX: Do not need to DECREF py_x etc from PyArg_ParseTuple?


Since it's a borrowed reference, you don't if you never create an owned reference.

That said, borrowed references are really hard to reason about, and you're creating objects based on py_x and py_t below, so I'd personally INCREF here and then DECREF at the end when you're done with them.

Thanks! Point taken about borrowed vs owned references. Could use an advise though: If the whole trampoline function is literally

PyArg_ParseTuple to get python arguments

check_array

grab pointers from arrays and delegate to numpy-oblivious C code

(this py_fpknot is what I've in mind) keeping all references borrowed looks simpler. How does it fare in the nogil context / under free-threading though, and does switching to owned references change the equation? From reading the free-threading docs (which you wrote?), I've an impression that owned references are generally safer but not always. Don't have a good gut feeling of what is safe and what is not.

ngoldbaum · 2024-07-22T16:38:53Z

scipy/interpolate/src/_dierckxmodule.cc

+
+
+/*
+    // XXX: overkill? Mimic cython's `double[::1] x` etc --- replaced by check_array stanza below


It doesn't make a ton of difference if you don't want the API to be flexible and cast from other dtypes if you don't hand it a double array or don't hand it a C-contiguous array.

If you already control all that then what you have with check_array is fine.

ngoldbaum · 2024-07-22T16:40:21Z

scipy/interpolate/src/_dierckxmodule.cc

+    PyArrayObject *a_t = (PyArrayObject *)py_t;
+    PyArrayObject *a_residuals = (PyArrayObject *)py_residuals;
+
+    // XXX: npy_intp vs Py_ssize_t vs ssize_t (== ptrdiff_t)?


In NumPy 2.0, npy_intp and Py_ssize_t are the same on all platforms. Before NumPy 2.0, npy_intp was 32 bit on windows. It's not quite the same as ptrdiff_t because it's a sized int (to make error checking less awful - negative values always indicate an error).

PY_SSIZE_T_MAX (which is the same as sys.maxsize in python) is also often used as a sigil value for things that are typed as Py_ssize_t.

Ugh. I full well realize it's asking for more and more, so if you've bandwidth to spare for me, I could use an advice.

For context:

The goal is to communicate between python and the C++ extension which does actual computations (https://github.com/ev-br/scipy/blob/bspl_no_fused/scipy/interpolate/src/__fitpack.cc);

The extension does not use anything from python or numpy; Also does not do any memory allocations and relies on the caller to send in everything including temp buffers.

The extension uses ssize_t for all indexing where the classic C++ would use size_t, to explicitly avoid any signed/unsigned mess.

Under MSVC on Windows ssize_t is not defined (apparently), so I just #define ssize_t ptrdiff_t

Have to support Windows, and int32 overflows are not impossible

I control both C++ and python side.

So when I need to return to python an array of integer indices (offsets: for a 2D array A, nonzeros in row j start at A[ j, offset[j]]), my thinking was

allocate np.empty(..., dtype=np.intp) on the python side

use npy_intp in the python/C glue

keep using ssize_t in C++

Does this sound reasonable or am I making things hard for myself?

Ah, it turns out that Py_ssize_t is a typedef for ssize_t where available, so I think using Py_ssize_t on the python side and then ssize_t on the C++ side should be sufficient.

Which leaves a question of the matching numpy dtype/typecode still?

So the resolution (as discussed offline) is:

under numpy 2.x, npy_intp is interchangeable with Py_ssize_t (== ssize_t == ptrdiff_t).

under numpy 1.x and on Windows, npy_intp is 32-bit, but indexing in numpy itself is problematic if indices overflow 32-bit sized ints.

So in the end, the best one can do here is to detect numpy 1.x and Windows, check that the offsets are too large and bail out if so.

ngoldbaum · 2024-07-22T16:47:56Z

scipy/interpolate/src/_dierckxmodule.cc

+        static_cast<const double *>(PyArray_DATA(a_residuals))
+    );
+
+    // XXX: need to DECREF a_* variables? Apparently not unless PyArray_ContiguousFromObject


ContiguousFromObject returns a new reference to an array object. That array might just be a view onto the original array, but it's a new PyObject and so needs to be DECREF'd. If you don't go through that function you're creating a borrowed reference, which doesn't need to be DECREF'd. That said, borrowed references are hard to reason about (as you saw!) since most APIs return new references.

ngoldbaum · 2024-07-22T16:57:06Z

scipy/interpolate/src/_dierckxmodule.cc

+    Py_ssize_t nc;
+    Py_ssize_t startrow=1;  // XXX: optional, keeps the value intact if not given?
+
+    const char *kwlist[] = {"a", "offset", "nc", "y", "startrow", NULL};


just an FYI, depending on what you're doing, if this function is called a lot there's a surprising amount of overhead to create python strings from these strings each time the function is called. For internal-only functions implemented in C, I would look at only using arguments positionally. NumPy goes out of its way to avoid using ParseTupleAndKeywords in hot code paths and also has a whole keyword argument name caching system.

ngoldbaum · 2024-07-22T16:57:36Z

scipy/interpolate/src/_dierckxmodule.cc

+    }
+
+    if (!(check_array(py_a, 2, NPY_DOUBLE) &&
+          check_array(py_offs, 1, NPY_LONG) && // XXX: typecode for ssize_t (==ptrdiff_t)?


If you need to support windows and NumPy 1.x, this might be annoying to deal with if you need specifically a Py_ssize_t, since it may or may not be the same depending on architecture. You could always do a conversion, which will be a no-op if it really is the type you want. Also you probably want NPY_DEFAULT_INT instead of NPY_LONG, which is wrong on NumPy 2.x.

ngoldbaum · 2024-07-22T17:02:46Z

scipy/interpolate/src/_dierckxmodule.cc

+    );
+
+    //Py_DECREF(a_offs);
+    // XXX: a & y modified in-place: need to incref?


Nope, not with a borrowed reference. You can always temporarily take ownership for the length of the function to make it easier to reason about.

ngoldbaum · 2024-07-22T17:03:37Z

scipy/interpolate/src/_dierckxmodule.cc

+/*
+ XXX:
+   1. how to check for refleaks / refcounting errors?
+   2. goto fail & crosses initialization errors: a better pattern or copy-paste?


You either declare all your variables at the top of the function, or you create a new scope with {} and define the temporary variables in the new scope.

ngoldbaum · 2024-07-22T17:04:36Z

scipy/interpolate/src/_dierckxmodule.cc

+
+/*
+ XXX:
+   1. how to check for refleaks / refcounting errors?


Think really hard. There's also https://pypi.org/project/pytest-leaks/ but it's bitrotting pretty hard at this point. You can make a quick and dirty version with a debug python build using sys.gettotalrefcount() though.

Removes an extra `xp.asarray` inside `xp.concat` which was causing errors due to setting an array with a sequence.

- Use `scipy._lib.array_api_compat.numpy` rather than `numpy` as a default when no array specified - Remove use of in-place item assignment in `_cartesian_product` - Remove use of `xp.newaxis` since Torch doesn't support this - Switch to using positional arguments for `xp.linspace` since Torch uses different keywords - Modify slow tests

- Use same style of typing as `_array_api` for `Array` - Use `np_compat` from `_lib.array_api` rather than `_lib.array_api_compat.numpy` - Default to `np.linalg` when `xp.linalg` doesn't exist - Add `xp` argument to test functions rather than calling `array_namespace` every time - Skip testing `array_like` inputs when the backend is not numpy - Change how `FixedRule`, `NestedRule` and `NestedFixedRule` determine the array namespace

- use a `_concat` helper function to pick whether to use `xp.concat` or `xp.concatenate` - use `xp_test = array_namespace(a)` for using `atan` since `atan` isn't supported in np < 2. - compute subregion coordinates and cartesian products without using `np`-specific code

The failing test is testing np.matrix inputs to RGI. This is allowed for backwards compatibility but does not work with xp_assert_* infrastructure. Thus, convert to arrays for assertions. An alternative is to skip the test when running with SCIPY_ARRAY_API env variable (since then we're not bound by backwards compat). But converting to ndarray seems simpler.

The computational complexity is O(1), so there is no reason for this helper to stay in Cython or move to C++.

[skip cirrus] [skip circle]

- Removes an unnecessary `isinstance(a, list)` check since by default `array_namespace(a_list)` is `np_compat` - Deletes the function `_concat` which was used to fall back to `np.concatenate` if `np.concat` wasn't available, and instead make sure `np_compat` is being used

- Determine the compatible version of the array namespace in each rule

* BUG: special.logsumexp: fix type promotion [skip cirrus] [skip circle] * MAINT: adjust dtype determination logic in xp_broadcast_promote * TST: special.logsumexp: add dtype tests * TST: special.logsumexp: fix tests * MAINT: define mixed-type promotion rule if underlying library doesn't * TST: special.logsumexp: update tests * STY: special.logsumexp: use is_array_api_strict where appropriate [skip ci] --------- Co-authored-by: Matt Haberland <[email protected]>

MAINT: interpolate: move an internal utility from cython to python

…ic_filter`, etc.) (scipy#20811) * add axes support to gradient_magnitude and laplace functions update rank filter axes docstring to match more detailed style of other functions * axes should not be cast using xp.asarray in test case * np.empty_like -> xp.empty_like * use concatenation instead of in-place array modification in test suite for JAX compatibility

* ENH: interpolate: add Floater Hormann weights Reviewed at scipy#21497 Co-authored-by: Pauli Virtanen <[email protected]>

…21784) * add tests of basic matmul involving 1D arrays * fix the bug in A@v

…cipy#21792) * BUG: sparse: fix setdiag for matrices with missing diagonal entries We need to first set existing diagonal entries before adding the missing ones, as inserting new values invalidates the computed offsets. * add tests for fix of csr.setdiag --------- Co-authored-by: Dan Schult <[email protected]>

From python, it's find_interval, not py_find_interval

Otherwise, 32-bit build on linux in CI fails on ./tools/check_pyext_symbol_hiding.sh

…trix Reuse _dierckx.data_matrix, which constructs the same object in a "packed" format, with per-row offsets of non-zero elements instead of per-element CSR indices/indptr arrays. Add `extrapolate=False` parameter to data_matrix to account for design_marix usages.

Co-authored-by: Christian Lorentzen <[email protected]>

use vectors not unique_ptr(malloc) for work arrays

ev-br force-pushed the bspl_c_api branch from b6d3fab to f8334af Compare July 22, 2024 10:55

ngoldbaum reviewed Jul 22, 2024

View reviewed changes

ev-br force-pushed the bspl_c_api branch 2 times, most recently from 0568a16 to 4148566 Compare July 24, 2024 08:20

ev-br force-pushed the bspl_no_fused branch 3 times, most recently from fdeb0a3 to aec8a4f Compare July 24, 2024 10:56

ev-br mentioned this pull request Jul 24, 2024

MAINT: interpolate: remove several fused type uses from _bspl.pyx scipy/scipy#21223

Merged

ev-br force-pushed the bspl_c_api branch from f460b5e to 240a0ad Compare July 26, 2024 13:21

ollybritton and others added 11 commits September 10, 2024 11:51

ENH: preliminary support for array api in cubature

3b96a58

ENH: pass tests with array_api_strict

c2c8c0e

BUG: don't use arr.size or expand_dims

3757527

TST: add tests to array_api workflow

0eee12b

BUG: remove unnecessary asarray in Genz-Malik

999568b

Removes an extra `xp.asarray` inside `xp.concat` which was causing errors due to setting an array with a sequence.

MAINT: interpolate: move _lhs_derivatives to python

2a6f43c

The computational complexity is O(1), so there is no reason for this helper to stay in Cython or move to C++.

Merge pull request scipy#21536 from ev-br/test_rgi_matrix

65e1add

ev-br force-pushed the bspl_c_api branch from 240a0ad to 64efb0a Compare September 10, 2024 14:37

lucascolley and others added 9 commits September 10, 2024 16:29

BUG: fft: fix real input to standard funcs

4cca348

[skip cirrus] [skip circle]

Merge pull request scipy#21508 from lucascolley/fft-real

80391ec

BUG: make each rule use compatible array namespace

3eb9973

- Determine the compatible version of the array namespace in each rule

BUG: optimize.minimize.powell: stop squeezing everything (scipy#21526)

4d0e9c7

Merge pull request scipy#21539 from ev-br/bspl_lhs_derivatives

46344fe

MAINT: interpolate: move an internal utility from cython to python

ENH: interpolate: add FloaterHormannInterpolator (scipy#21497)

ba5ef6b

* ENH: interpolate: add Floater Hormann weights Reviewed at scipy#21497 Co-authored-by: Pauli Virtanen <[email protected]>

dschult and others added 29 commits November 3, 2024 22:48

BUG: sparse: fix Asp @ vsp for 1D sparse vector and add tests (scipy#…

0346a93

…21784) * add tests of basic matmul involving 1D arrays * fix the bug in A@v

MAINT: interpolate: move evaluate_spline to C++

275a590

MAINT: interpolate: move _colloc to C++

18db1ee

MAINT: interpolate: move _norm_eq_lsq to C++

850a6a4

MAINT: interpolate: move fpknot wrap from Cython to manual

93b5382

MAINT: interpolate" move fpback wrap from Cython to manual

51b97dd

MAINT: interpolate: move qr_reduce wrap from Cython to manual

4271c98

simplify fpknot

17e8437

MAINT: catch C++ exceptions, address Nathan's comments

4efe90c

MAINT: move py_find_interval from Cython to manual

a227b14

MAINT: interpolate: move _coloc wrap from Cython to manual

aa48802

BUG: interpolate: do not leak wrk arrays

955d87a

MAINT: interpolate: move norm_eq_lsq from Cython to manual

6051da3

MAINT: interpolate: move evaluate_spline from Cython to manual

81e38a6

MAINT: interpolate: add docstrings to _dierckxmodule from _bspl.pyx

c8f1425

MAINT: interpolate: rm dead/commented out code from _bspl.pyx

fa6280b

BUG: interpolate: fix a function name

64b07d9

From python, it's find_interval, not py_find_interval

MAINT: interpolate: rm find_interval cython wrapper

96dd827

MAINT: interpolate: move evaluate_all_bspl from cython to manual

f86dd3d

BLD: interpolate: hide extra symbols in _dierckxmodule.so

31a47fe

Otherwise, 32-bit build on linux in CI fails on ./tools/check_pyext_symbol_hiding.sh

BUG: interpolate: make _dierckx indexing int64_t throughout

5f842e2

MAINT: interpolate: appease mypy

3794844

MAINT: interpolate/__fitpack: move all LAPACK stuff close together

f1d084b

MAINT: interpolate: use int64_t directly w/o typedef-ing

6a85c2b

MAINT: interpolate: Apply suggestions from code review

4dde415

Co-authored-by: Christian Lorentzen <[email protected]>

MAINT: interpolate: address code review comments

ad29967

use vectors not unique_ptr(malloc) for work arrays

MAINT: interpolate/_dierckxmodule: flip bounds checking back to false

2fe3fd1

ev-br force-pushed the bspl_c_api branch from a8840ca to 2fe3fd1 Compare November 4, 2024 12:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MAINT: interpolate: move fpknot wrap from Cython to manual #14

MAINT: interpolate: move fpknot wrap from Cython to manual #14

ev-br commented Jul 21, 2024

ngoldbaum left a comment

ngoldbaum Jul 22, 2024

ngoldbaum Jul 22, 2024

ev-br Jul 23, 2024

ngoldbaum Jul 22, 2024

ngoldbaum Jul 22, 2024

ngoldbaum Jul 22, 2024

ev-br Jul 23, 2024 •

edited

Loading

ngoldbaum Jul 24, 2024

ev-br Jul 24, 2024

ev-br Jul 24, 2024

ngoldbaum Jul 22, 2024

ngoldbaum Jul 22, 2024

ngoldbaum Jul 22, 2024

ngoldbaum Jul 22, 2024

ngoldbaum Jul 22, 2024

ngoldbaum Jul 22, 2024



		/*
		// XXX: overkill? Mimic cython's `double[::1] x` etc --- replaced by check_array stanza below

MAINT: interpolate: move fpknot wrap from Cython to manual #14

Are you sure you want to change the base?

MAINT: interpolate: move fpknot wrap from Cython to manual #14

Conversation

ev-br commented Jul 21, 2024

Reference issue

What does this implement/fix?

Additional information

ngoldbaum left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ev-br Jul 23, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ev-br Jul 23, 2024 •

edited

Loading