This is a forked version of Tensorflow, with modified XLA profiling estimated costs of each HLO nodes. TODO create profile for batch normalization