Try using hypothesis.target() to better test BooleanArrays #78

asmeurer · 2020-08-24T23:53:02Z

This should not be merged.

This has a lot of commits from #74 which should go away when it is merged.

See issue #77. I've intentionally added a bug here to see if I can get hypothesis to find it. So far, I've been unsuccessful, even with many examples.

…edError These only work in NumPy as fancy indices. However, there is too much of a risk of API confusion in thinking that Tuple() works like Tuple((1, 2, 3)) instead of Tuple(1, 2, 3). So we make this an unconditional error with helpful error messages. Fancy indices should use either a list or array. c.f. issue Quansight-Labs#17.

The shape an array indexed by an integer array is the product of the array and index shapes. If both are large, this can produce a very large resulting array which makes the test run too slow (hypothesis deadline exceeded) and use too much memory.

…nstructors This also fixes ndindex(boolean_scalar) to correctly give a BooleanArray object instead of an Integer object.

Technically for integer scalars ndindex() will return an Integer(), rather than IntegerArray(), so it could vary in some tests if this is tested of not. However, the behavior should be identical for Integer, IntegerArray(integer scalar), and IntegerArray(shape () integer array) in all cases.

This adds a new helper function operator_index() that works like operator.index() except it disallows booleans. The built-in bool works with operator.index, but np.bool_ gives a deprecation warning, so according to our policy of making errors out of things that give deprecation warnings in NumPy, we make this give an error. This affects Integer(), Slice(), and asshape(). A slight compatibility break here against NumPy is that NumPy actually does allow booleans in slices. However, I'm not too worried about this as it's poor form to do that, and most likely would indicate a bug in user code.

…hat don't return an index Not only is this cleaner, as we don't have to "workaround" the function to pretend that it is testing an index, it fixes an issue where the previous way was not actually testing if a function raised an exception properly, because it would just be indexed anyway after calling the function. This affected newshape() and isempty() tests.

…ray types

If an array index is created from another array index, this should be done. This is especially useful when creating a new array index out after broadcasting an existing array index, since broadcasting creates a memory efficient readonly view.

So far we disallow any other indices in the tuple. This also makes it so that Tuple.expand() broadcasts any arrays together.

…e size 0

Any slices, ellipses, or newaxes must not be between the arrays, or it raises NotImplementedError. This will probably never be implemented, because it's a bit of a wart case in the indexing API, which should raise an error (or at least that's what Travis told me to do). Also makes Tuple.expand() convert Integers into IntegerArrays as arrays when the tuple also contains arrays (so that they will also be broadcast).

…Integer()

This adds a private _axis keyword argument to newshape() methods. This is needed to keep the NumPy 1.19 behavior of not raising IndexError on out of bounds indices on empty arrays in some cases. This is needed because the specific cases when this happens require knowing the entire shape, but previously we just passed through idx.newshape(shape[i]) in Tuple.newshape(). This behavior will be deprecated in NumPy 1.20, at which point we will stop supporting it (breaking support earlier is hard because we cannot explicitly test against NumPy without a deprecation warning to catch). When this happens, the _axis argument to the newshape() methods will be removed and we can go back to just passing through shape[i] as before.

Now that Tuple.newshape() calls expand() instead of reduce(), it does not need to handle ellipses.

The test_ndindex_expand_hypothesis() already tests tuples because the ndindices strategy generates tuples, and it already tests all the same things that the test_tuple_expand_hypothesis test tests.

This removes duplication for the newshape tests. The integer and slice exhaustive tests are still in the respective files for the types.

This is needed because there are some changes in 1.20 that will make testing a lot easier. Right now we have to emulate some broken behavior that gives deprecation warnings in 1.20. Without the deprecation warnings, it is a lot harder to test because we can't catch the warning.

So far only the Tuple constructor an expand() are implemented properly.

There are some weird semantics with them where they sort of act like shape () arrays and sort of don't. It isn't that important to support right now, so we'll just leave it unimplemented.

The logic isn't correct in the case where boolean scalars are mixed with array indices, but this is currently disabled in the Tuple constructor.

…mples

…uple.expand

…reduce()

See issue Quansight-Labs#77. This implementation of checking if the array shapes are subsequences of the test array shape doesn't seem to be fine grained enough for it to find a simple bug, so I think it will need to be improved.

That way we can accurately make a target value any time a boolean array doesn't match, as well as any time it does. This doesn't actually work well yet, but I think it's at least better than what I was doing before.

The hypothesis tests should catch this bug, but so far I haven't been able to make them without using explicit @examples.

asmeurer · 2020-08-26T18:14:44Z

I think this approach doesn't really work. A better way is to make smarter strategies that generate boolean arrays that are more likely to match the test array shape, which I've done in #74.

asmeurer added 30 commits August 7, 2020 16:02

Replace duplicate logic in ndindex() with calls to the array class co…

f1b9092

…nstructors This also fixes ndindex(boolean_scalar) to correctly give a BooleanArray object instead of an Integer object.

Make sure a line is covered by the tests

14124ab

Merge branch 'master' into tuples-of-arrays

6dfc7b7

Update the logic for which boolean arrays should raise IndexError

bba4fdc

Add a note to the type confusion doc about using == when comparing ar…

915f49f

…ray types

Allow creating Tuples of IntegerArrays

2cbdd91

So far we disallow any other indices in the tuple. This also makes it so that Tuple.expand() broadcasts any arrays together.

Make str and repr on Tuple do the right thing with arrays

bbbcb3a

Don't print "IntegerArray" in the Tuple repr if the array doesn't hav…

464830a

…e size 0

Add some basic checks for the _copy flag of ArrayIndex

469936e

Note in the docstring of IntegerArray.reduce() that it can return an …

e813e9e

…Integer()

Fix test_ndindex() test to actually run

2d54bbc

Fix printing of Tuple(...)

cf0d873

Improve some documentation around integer array indices

573acdd

Make warnings in the tests give errors

da006c4

Fix check_same() for warnings being raised as exceptions

052aa06

Fix pyflakes error

38e5aa7

Fix an issue in the tests to avoids array comparisons

7820b59

Improve test coverage

0429ee8

Remove some dead code

1663f85

Now that Tuple.newshape() calls expand() instead of reduce(), it does not need to handle ellipses.

Remove the duplicate test_tuple_expand_hypothesis() test

d3c7d93

The test_ndindex_expand_hypothesis() already tests tuples because the ndindices strategy generates tuples, and it already tests all the same things that the test_tuple_expand_hypothesis test tests.

Move newshape and expand hypothesis tests into their own files

faf719a

This removes duplication for the newshape tests. The integer and slice exhaustive tests are still in the respective files for the types.

asmeurer added 23 commits August 19, 2020 20:03

Start supporting tuples including boolean arrays

1014444

So far only the Tuple constructor an expand() are implemented properly.

Disallow boolean scalars separated by slices, ellipses, or newaxes

06dc0d3

Disallow mixing boolean scalars with arrays in a Tuple

94f8fe8

There are some weird semantics with them where they sort of act like shape () arrays and sort of don't. It isn't that important to support right now, so we'll just leave it unimplemented.

Fix Tuple.newshape for boolean scalars

60f53ee

The logic isn't correct in the case where boolean scalars are mixed with array indices, but this is currently disabled in the Tuple constructor.

Fix Tuple.reduce() to handle boolean arrays properly

2dfd020

Prefer ints and slices to newaxes when shrinking hypothesis Tuple exa…

ff50681

…mples

Fix broadcasting of Integers in Tuple.expand()

b915f85

Test additional properties of Tuple.expand()

4b4b2e3

Add a note about boolean arrays in the Tuple.expand() docstring

9d0162f

Only convert Integers into arrays in Tuple.expand when there are arrays

1a345ff

Fix a test condition

1ba9c55

Add some @example test cases

4c2d7cc

Add some @examples

b9fadb9

Handle a corner case where array bounds are not checked properly in T…

94a6a9d

…uple.expand

Add some @examples for code coverage

c62a065

Add an @example to increase coverage

b8bdc5f

Add an @example for coverage

2d0cf96

Add an @example for coverage

754c4ee

Correctly handle a corner case where bounds are not checked in Tuple.…

a0452ac

…reduce()

Add an @example

bfc6eaa

Do boolean array targetting at the exception level

f082aba

That way we can accurately make a target value any time a boolean array doesn't match, as well as any time it does. This doesn't actually work well yet, but I think it's at least better than what I was doing before.

Manually add a bug to Tuple.newshape to test hypothesis targeting

4a9b5d9

The hypothesis tests should catch this bug, but so far I haven't been able to make them without using explicit @examples.

asmeurer added the Nice-to-have label Aug 24, 2020

asmeurer marked this pull request as draft August 24, 2020 23:57

asmeurer mentioned this pull request Aug 24, 2020

Hypothesis not finding failing examples #77

Open

asmeurer added 2 commits August 24, 2020 18:57

Call target() with float inputs

0f21974

Introduce a more subtle bug for the tests to find

3dd52f4

asmeurer closed this Aug 26, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Try using hypothesis.target() to better test BooleanArrays #78

Try using hypothesis.target() to better test BooleanArrays #78

asmeurer commented Aug 24, 2020

asmeurer commented Aug 26, 2020

Try using hypothesis.target() to better test BooleanArrays #78

Try using hypothesis.target() to better test BooleanArrays #78

Conversation

asmeurer commented Aug 24, 2020

asmeurer commented Aug 26, 2020