0.12.2
Added
-
New tool:
futhark autotune
, for tuning the threshold parameters
used by incremental flattening. Based on work by Svend Lund
Breddam, Simon Rotendahl, and Carl Mathias Graae Larsen. -
New tool:
futhark dataget
, for extracting test input data. Most
will probably never use this. -
Programs compiled with the
cuda
backend now take options
--default-group-size
,--default-num-groups
, and
--default-tile-size
. -
Segmented
reduce_by_index
are now substantially fasted for small
histograms. -
New functions:
f32.lerp
andf64.lerp
, for linear interpolation.
Fixed
-
Fixes to aliasing of record updates.
-
Fixed unnecessary array duplicates after coalescing optimisations.
-
reduce_by_index
nested inmap
s will no longer sometimes
require huge amounts of memory. -
Source location now correct for unknown infix operators.
-
Function parameters are no longer in scope of themselves (#798).
-
Fixed a nasty out-of-bounds error in handling of irregular allocations.
-
The
floor
/ceil
functions inf32
/f64
now handle infinities
correctly (and are also faster). -
Using
%
on floats now computes fmod instead of crashing the compiler.