Add n-dimensional transpose to core * add n-dimensional transpose to core * add performance test, write sequentially and address review comments