Skip to content

Commit

Permalink
Refine the lowering of onnx.concat (#1778)
Browse files Browse the repository at this point in the history
* update shape inference and test

Signed-off-by: chentong319 <[email protected]>

* lowering

Signed-off-by: chentong319 <[email protected]>

* lowering

Signed-off-by: chentong319 <[email protected]>

* Update onnx-mlir product version (#1727)

* Update onnx-mlir product version

Signed-off-by: Megan Hampton <[email protected]>

* fix einsum decomposition bug (#1730)

Signed-off-by: Soren Lassen <[email protected]>

Signed-off-by: Soren Lassen <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* Update version to 0.3.1 (#1728)

Signed-off-by: Megan Hampton <[email protected]>

* Fix issues to compile Yolov3-12. (#1726)

Stop conversion to enable onnx.LeakyRelu on NNPA when the stickified layouts of input and output are not the same.
Signed-off-by: Yasushi Negishi <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* simplified if lowering (#1717)

borrowed ideas from ConvertTrivialIfToSelect from
https://github.com/llvm/llvm-project/blob/main/mlir/lib/Dialect/SCF/IR/SCF.cpp

Signed-off-by: Soren Lassen <[email protected]>

Signed-off-by: Soren Lassen <[email protected]>
Co-authored-by: chentong319 <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* Add definitions

Signed-off-by: Megan Hampton <[email protected]>

* parse string type from getTypeMap() (#1735)

added a parse lit test that passes with this fix

Signed-off-by: Soren Lassen <[email protected]>

Signed-off-by: Soren Lassen <[email protected]>
Co-authored-by: chentong319 <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* fix 2 compiler warnings (#1736)

* fix sprintf-is-deprecated clang warning

warning from Apple clang version 14.0.0 (clang-1400.0.29.201):
```
.../src/Conversion/ONNXToKrnl/PerfectHash.cpp:47:5: warning: 'sprintf' is deprecated: This function is provided for compatibility reasons only.  Due to security concerns inherent in the design of sprintf(3), it is highly recommended that you use snprintf(3) instead. [-Wdeprecated-declarations]
    sprintf(str, "%lld", (long long)val);
/Library/Developer/CommandLineTools/SDKs/MacOSX13.0.sdk/usr/include/stdio.h:188:1: note: 'sprintf' has been explicitly marked deprecated here
__deprecated_msg("This function is provided for compatibility reasons only.  Due to security concerns inherent in the design of sprintf(3), it is highly recommended that you use snprintf(3) instead.")
```

Signed-off-by: Soren Lassen <[email protected]>

* fix copy constructor warning

copied solution from InstrumentONNXPass.cpp

fixes compiler warning in Linux CI pipelines:
```
.../src/Transform/ONNX/ConvOpt.cpp: In copy constructor '{anonymous}::ConvOptONNXToONNXPass::ConvOptONNXToONNXPass(const {anonymous}::ConvOptONNXToONNXPass&)':
.../src/Transform/ONNX/ConvOpt.cpp:211:3: warning: base class 'class mlir::PassWrapper<{anonymous}::ConvOptONNXToONNXPass, mlir::OperationPass<mlir::func::FuncOp> >' should be explicitly initialized in the copy constructor [-Wextra]
   ConvOptONNXToONNXPass(const ConvOptONNXToONNXPass &pass) {}
```

Signed-off-by: Soren Lassen <[email protected]>

Signed-off-by: Soren Lassen <[email protected]>
Co-authored-by: chentong319 <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* cleanup some include files (#1733)

Signed-off-by: Soren Lassen <[email protected]>

Signed-off-by: Soren Lassen <[email protected]>
Co-authored-by: chentong319 <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* Fix remaining warnings on Windows and turn on warning-as-error (#1706)

This change applies the remaining fixes needed for warnings to be treated as errors on Windows and then turns them on on Windows only. Of particular interest are a couple of changes:

* The `check-docs` target needs an update to properly parse the directive even when it contains parenthesis in the path. For example: `C:/Program Files (x86)/Microsoft Visual Studio/Shared/Python39_64/python.exe`
* The Windows CI build now builds `onnx` first and then builds all of the targets that are enabled in onnx-mlir. This will make sure that no targets get missed
* The remaining warnings (4927) in `ConstProp.cpp` are fixed as well.

Signed-off-by: Stella Stamenova <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* Fix synax

Signed-off-by: Megan Hampton <[email protected]>

* Simplify shape-related operations (#1695)

* Simplify shape-related operations

Signed-off-by: Tung D. Le <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* run-onnx-lib script and docs fixes (#1705)

* update documentation for build-run-onnx-lib.sh
* clarified Mac issue with running statically linked version in the directory where model library was built
* moved usage string up to the top of RunONNXLib.cpp, so it's visible next to the top of file block comment,
* allow llvm-project location to be set with LLVM_PROJECT, otherwise read from $MLIR_DIR if defined

Signed-off-by: Soren Lassen <[email protected]>
Co-authored-by: Alexandre Eichenberger <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* remove initializedTensors map from ONNX parser (#1739)

instead insert a constant for each initializer in the frontend_symbols_
symbol map

this is better for several reasons:

1. initializedTensors were incorrectly visible to function bodies (in
   TryImportFunctionCallNode)

2. nested bindings didn't hide initializedTensors within their scope

3. it's more efficient to create a constant for each initializer once
   rather than every time an initializer is accessed

4. it's simpler to look up a single symbol mapping

Signed-off-by: Soren Lassen <[email protected]>

Signed-off-by: Soren Lassen <[email protected]>
Co-authored-by: chentong319 <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* Add decompositions for v11 split, squeeze, and unsqueeze (#1702)

* Add decompositions for v11 split, squeeze, and unsqueeze

Co-authored-by: Roberto DiCecco <[email protected]>
Signed-off-by: Philip Lassen <[email protected]>

* Fix lit test

Signed-off-by: Philip Lassen <[email protected]>

* Add lit tests

Signed-off-by: Philip Lassen <[email protected]>

* Delete unneccesary decomp for unsqueeze

Signed-off-by: Philip Lassen <[email protected]>

Signed-off-by: Philip Lassen <[email protected]>
Co-authored-by: Roberto DiCecco <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* factored input shaping into ModelInputShaper (#1740)

* factored input shaping into ModelInputShaper

this makes the it easier to read and understand the input shaping
logic and also makes the rest of the FrontendDialectTransformer
implementation shorter and easier to read

Signed-off-by: Soren Lassen <[email protected]>

* documented public methods

Signed-off-by: Soren Lassen <[email protected]>

Signed-off-by: Soren Lassen <[email protected]>
Co-authored-by: Tung D. Le <[email protected]>
Co-authored-by: Philip Lassen <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* Ingest all of Opset 16  (#1704)

* Ingest al of opset 16

Signed-off-by: Philip Lassen <[email protected]>

* Delete GridSample test which is no longer relevant

Signed-off-by: Philip Lassen <[email protected]>

* Use NullStringAttr insead of "none" string

Signed-off-by: Philip Lassen <[email protected]>

Signed-off-by: Philip Lassen <[email protected]>
Co-authored-by: chentong319 <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* Fix onnx-mlir.py docker wrapper (#1734)

* Requiring input files to have known suffix (.onnx, .json, or .mlir) greatly
simplifies the onnx-mlir.py docker wrapper.

Signed-off-by: Gong Su <[email protected]>

* Return exit code

Signed-off-by: Gong Su <[email protected]>

* Add copyright notice

Signed-off-by: Gong Su <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* Update protobuf (#1747)

* Update to protobuf 3.20.2

Signed-off-by: Charles Volzka <[email protected]>

* Try 3.18.3

Signed-off-by: Charles Volzka <[email protected]>

Signed-off-by: Charles Volzka <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* Add the product version and fix formatting

Signed-off-by: Megan Hampton <[email protected]>

* Make updates to product version

Signed-off-by: Megan Hampton <[email protected]>

* further updates

Signed-off-by: Megan Hampton <[email protected]>

* Add product version text file

Signed-off-by: Megan Hampton <[email protected]>

* Removed the memory leaks directly in execution session (#1746)

* removed the memory leaks directly in execution session
Signed-off-by: Alexandre Eichenberger <[email protected]>
Co-authored-by: chentong319 <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* Add support for ArgMin (#1737)

* add support for ArgMin

Signed-off-by: Hengyu Meng <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* Inclusive terminology update (#1750)

* Update `master` reference to `main` in onnx-mlir doc
* Update rapidcheck to newer commit with inclusive terminology changes
* Update benchmark to 1.6.2 with inclusive terminology changes
* Update pybind11 to 2.10 with inclusive terminology changes

Signed-off-by: Megan Hampton <[email protected]>

* Warning and instructions that --onnx-op-stats needs more than EmitONNXIR (#1719)

Signed-off-by: Alexandre Eichenberger <[email protected]>
Co-authored-by: chentong319 <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* Move product version text file

Signed-off-by: Megan Hampton <[email protected]>

* Remove comment

Signed-off-by: Megan Hampton <[email protected]>

* Output the product version for the compiler/version

Signed-off-by: Megan Hampton <[email protected]>

* Cleaned up commit of the InferTypes change. (#1753)

Signed-off-by: Brad Messer <[email protected]>

Co-authored-by: Soren Lassen <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* setup.py install is deprecated, use pip install instead (#1752)

* setup.py install is deprecated, use pip install instead (see https://blog.ganssle.io/articles/2021/10/setup-py-deprecated.html for more details).

Signed-off-by: Gong Su <[email protected]>

* - pip install doesn't work for protobuf, revert back to setup.py install
- use --cpp_implementation for better performance

Signed-off-by: Gong Su <[email protected]>

* Update Windows CI

Signed-off-by: Gong Su <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* adding reference to onnx-mlir-serving (#1745)

Signed-off-by: Alexandre Eichenberger <[email protected]>
Co-authored-by: chentong319 <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* [TOSA] Update type converter and unary ops (#1553)

Signed-off-by: Philipp Braun <[email protected]>
Co-authored-by: Alexandre Eichenberger <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* Fix syntax

Signed-off-by: Megan Hampton <[email protected]>

* Remove strip

Signed-off-by: Megan Hampton <[email protected]>

* Revert "Cleaned up commit of the InferTypes change. (#1753)"

This reverts commit d08c5ac.

Signed-off-by: Megan Hampton <[email protected]>

* Cleaned up commit of the InferTypes change. (#1753)

Signed-off-by: Brad Messer <[email protected]>

Co-authored-by: Soren Lassen <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* Fix for GatherND verifier (#1754)

Co-authored-by: Hadi Jooybar <[email protected]>
Signed-off-by: Philip Lassen <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* Fix OnnxBuilder::concat where axis was not used (#1759)

Signed-off-by: Tung D. Le <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* Simplify GatherOp when its inputs are dimensions (#1755)

Signed-off-by: Tung D. Le <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* Assign each question mark a unique negative integer value (#1757)

* Assign each question mark a unique negative integer value

Signed-off-by: Tung D. Le <[email protected]>

* Add a mutex to protect the counter

Signed-off-by: Tung D. Le <[email protected]>

* Using decrement

Signed-off-by: Tung D. Le <[email protected]>

Signed-off-by: Tung D. Le <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* Change the regex

Signed-off-by: Megan Hampton <[email protected]>

* Fix regex and display product numbers

Signed-off-by: Megan Hampton <[email protected]>

* normalize axis in ScatterElements verify (#1760)

Signed-off-by: Philip Lassen <[email protected]>

Signed-off-by: Philip Lassen <[email protected]>
Co-authored-by: Alexandre Eichenberger <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* redo ArrayRefOrSmallVector with enable_if (#1758)

closes #1729

* redo ArrayRefOrSmallVector with enable_if

attempt to address issue #1729: Build error on Linux

Signed-off-by: Soren Lassen <[email protected]>

* added @qedawkins fix: begin,end -> data,size

Signed-off-by: Soren Lassen <[email protected]>

Signed-off-by: Soren Lassen <[email protected]>
Co-authored-by: Quinn Dawkins [email protected]
Co-authored-by: Philip Lassen <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* fix syntax

Signed-off-by: Megan Hampton <[email protected]>

* Handle scalar tensor tensor<type> in SimplifyShapeRelatesOps pass (#1764)

Signed-off-by: Tung D. Le <[email protected]>

Signed-off-by: Tung D. Le <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* namespace cleanup (#1763)

* namespace cleanup

removed `using namespace mlir` from ConstPropHelper.hpp and added
requisite `mlir::` and `llvm::` in header files to get everything
to compile again

also put ConstPropHelper definitions in namespace onnx_mlir

Signed-off-by: Soren Lassen <[email protected]>

* added more missing `mlir/llvm::`

Signed-off-by: Soren Lassen <[email protected]>

Signed-off-by: Soren Lassen <[email protected]>
Co-authored-by: Tung D. Le <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>

* Signed-off-by: Megan Hampton <[email protected]>

Fix Clang format error

* Make use of the product vendor flag

Signed-off-by: Megan Hampton <[email protected]>

* Address feedback

Signed-off-by: Megan Hampton <[email protected]>

Signed-off-by: Megan Hampton <[email protected]>
Signed-off-by: Soren Lassen <[email protected]>
Signed-off-by: Stella Stamenova <[email protected]>
Signed-off-by: Tung D. Le <[email protected]>
Signed-off-by: Philip Lassen <[email protected]>
Signed-off-by: Gong Su <[email protected]>
Signed-off-by: Charles Volzka <[email protected]>
Signed-off-by: Hengyu Meng <[email protected]>
Signed-off-by: Alexandre Eichenberger <[email protected]>
Signed-off-by: Philipp Braun <[email protected]>
Co-authored-by: Megan Hampton <[email protected]>
Co-authored-by: Soren Lassen <[email protected]>
Co-authored-by: Charles Volzka <[email protected]>
Co-authored-by: Yasushi Negishi <[email protected]>
Co-authored-by: chentong319 <[email protected]>
Co-authored-by: Stella Stamenova <[email protected]>
Co-authored-by: Tung D. Le <[email protected]>
Co-authored-by: Alexandre Eichenberger <[email protected]>
Co-authored-by: Philip Lassen <[email protected]>
Co-authored-by: Roberto DiCecco <[email protected]>
Co-authored-by: gongsu832 <[email protected]>
Co-authored-by: Meng, Hengyu <[email protected]>
Co-authored-by: Brad Messer <[email protected]>
Co-authored-by: Philipp Braun <[email protected]>
Co-authored-by: Hadi Jooybar <[email protected]>

* different initial

Signed-off-by: chentong319 <[email protected]>

Signed-off-by: chentong319 <[email protected]>
Signed-off-by: Megan Hampton <[email protected]>
Signed-off-by: Soren Lassen <[email protected]>
Signed-off-by: Stella Stamenova <[email protected]>
Signed-off-by: Tung D. Le <[email protected]>
Signed-off-by: Philip Lassen <[email protected]>
Signed-off-by: Gong Su <[email protected]>
Signed-off-by: Charles Volzka <[email protected]>
Signed-off-by: Hengyu Meng <[email protected]>
Signed-off-by: Alexandre Eichenberger <[email protected]>
Signed-off-by: Philipp Braun <[email protected]>
Co-authored-by: hamptonm1 <[email protected]>
Co-authored-by: Megan Hampton <[email protected]>
Co-authored-by: Soren Lassen <[email protected]>
Co-authored-by: Charles Volzka <[email protected]>
Co-authored-by: Yasushi Negishi <[email protected]>
Co-authored-by: Stella Stamenova <[email protected]>
Co-authored-by: Tung D. Le <[email protected]>
Co-authored-by: Alexandre Eichenberger <[email protected]>
Co-authored-by: Philip Lassen <[email protected]>
Co-authored-by: Roberto DiCecco <[email protected]>
Co-authored-by: gongsu832 <[email protected]>
Co-authored-by: Meng, Hengyu <[email protected]>
Co-authored-by: Brad Messer <[email protected]>
Co-authored-by: Philipp Braun <[email protected]>
Co-authored-by: Hadi Jooybar <[email protected]>
  • Loading branch information
16 people authored Oct 13, 2022
1 parent b80c880 commit adf349e
Show file tree
Hide file tree
Showing 4 changed files with 256 additions and 15 deletions.
24 changes: 19 additions & 5 deletions src/Conversion/ONNXToKrnl/Tensor/Concat.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -40,6 +40,7 @@ struct ONNXConcatOpLowering : public ConversionPattern {
assert(succeeded(shapecomputed) && "Could not compute output shape");

auto axis = concatOp.axis();
assert(axis >= 0 && "negative axis is supposed to have been normalized");
unsigned int inputNum = operands.size();

// Convert the output type to MemRefType.
Expand All @@ -57,16 +58,28 @@ struct ONNXConcatOpLowering : public ConversionPattern {
MultiDialectBuilder<KrnlBuilder> create(rewriter, loc);

// Creates loops, one for each input.
// Since the each input should have same size for each dimension(except
// axis), we will try to make the loop upper bound the same for futher
// optimization. Difference may come from constant vs. dynamic, or dynamic
// dim of different inputs.
KrnlBuilder createKrnl(rewriter, loc);
SmallVector<IndexExpr, 4> commonUB(shapeHelper.dimsForOutput());
// IndexExprScope IEScope(&rewriter, loc);
IndexExpr accumulatedOffset = LiteralIndexExpr(0);
for (unsigned int i = 0; i < inputNum; ++i) {
// Since the acculatedOffsetValue will be used in a nested IndexExprScope,
// we get the Value of this IndexExpr and pass it as a symbol
Value accumulatedOffsetValue = accumulatedOffset.getValue();
OpBuilder::InsertionGuard insertGuard(rewriter);
// Create loop.
ValueRange loopDef = createKrnl.defineLoops(rank);
SmallVector<IndexExpr, 4> lbs(rank, LiteralIndexExpr(0));
MemRefBoundsIndexCapture bounds(operands[i]);
SmallVector<IndexExpr, 4> ubs;
bounds.getDimList(ubs);
createKrnl.iterateIE(loopDef, loopDef, lbs, ubs,
// For each input, only the dimension 'axis' is different
commonUB[axis] = ubs[axis];
createKrnl.iterateIE(loopDef, loopDef, lbs, commonUB,
[&](KrnlBuilder &createKrnl, ValueRange loopInd) {
// Indices for the read and write.
SmallVector<Value, 4> readIndices, writeIndices;
Expand All @@ -76,17 +89,18 @@ struct ONNXConcatOpLowering : public ConversionPattern {
else {
IndexExprScope IEScope(&rewriter, loc);
IndexExpr writeOffset = DimIndexExpr(loopInd[r]);
for (unsigned int j = 0; j < i; j++) {
MemRefBoundsIndexCapture operandJBounds(operands[j]);
writeOffset = writeOffset + operandJBounds.getDim(r);
}
IndexExpr accumulatedOffsetIE =
SymbolIndexExpr(accumulatedOffsetValue);
writeOffset = writeOffset + accumulatedOffsetIE;
writeIndices.emplace_back(writeOffset.getValue());
}
}
// Insert copy.
Value loadData = createKrnl.load(operands[i], loopInd);
createKrnl.store(loadData, alloc, writeIndices);
});
MemRefBoundsIndexCapture operandJBounds(operands[i]);
accumulatedOffset = accumulatedOffset + operandJBounds.getDim(axis);
}
rewriter.replaceOp(op, alloc);
return success();
Expand Down
36 changes: 26 additions & 10 deletions src/Dialect/ONNX/ShapeInference/Concat.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -37,19 +37,35 @@ LogicalResult ONNXConcatOpShapeHelper::computeShape(
if (axisIndex < 0)
axisIndex += commonRank;

IndexExpr cumulativeAxisSize = LiteralIndexExpr(0);
for (unsigned i = 0; i < numInputs; ++i) {
// For Concat Op, the size of each dimension of inputs should be the same,
// except for concatenated dimension. To simplify the result, constant
// size is used if there is one. Otherwise, the dimension of the first
// input tensor (implementation dependent) is used for the output tensor.
DimsExpr outputDims(commonRank);
MemRefBoundsIndexCapture firstInputBounds(operandAdaptor.inputs()[0]);
for (unsigned dim = 0; dim < commonRank; dim++) {
outputDims[dim] = firstInputBounds.getDim(dim);
}
IndexExpr cumulativeAxisSize =
DimIndexExpr(firstInputBounds.getDim(axisIndex));

// Handle the rest of input
for (unsigned i = 1; i < numInputs; ++i) {
Value currentInput = operandAdaptor.inputs()[i];
MemRefBoundsIndexCapture currInputBounds(currentInput);
DimIndexExpr currentSize(currInputBounds.getDim(axisIndex));
cumulativeAxisSize = cumulativeAxisSize + currentSize;
for (unsigned dim = 0; dim < commonRank; dim++) {
if (dim == axisIndex) {
DimIndexExpr currentSize(currInputBounds.getDim(axisIndex));
cumulativeAxisSize = cumulativeAxisSize + currentSize;
} else {
if (currInputBounds.getDim(dim).isLiteral()) {
// The size of current dimension of current input is a constant
outputDims[dim] = currInputBounds.getDim(dim);
}
}
}
}

DimsExpr outputDims(commonRank);
MemRefBoundsIndexCapture firstInputBounds(firstInput);
for (unsigned i = 0; i < commonRank; i++)
outputDims[i] =
(i == axisIndex) ? cumulativeAxisSize : firstInputBounds.getDim(i);
outputDims[axisIndex] = cumulativeAxisSize;

setOutputDims(outputDims);
return success();
Expand Down
Loading

0 comments on commit adf349e

Please sign in to comment.