v0.15.0
·
2118 commits
to master
since this release
Summary
The 0.15.0 represents a milestone in which dpctl.tensor.usm_ndarray object now implements all special Python operators, except __matmul__ and __rmatmul__.
The dpctl.tensor increases its array-API conformance test suite pass rate to 81.8%, (passed: 916, failed: 84, skipped: 119).
Details
Added
- Added
dpctl.tensor.floor,dpctl.tensor.ceil,dpctl.tensor.truncelementwise functions. - Added
dpctl.tensor.hypot,dpctl.tensor.logaddexpelementwise functions. - Added trigonometric (
dpctl.tensor.sin,dpctl.tensor.cos,dpctl.tensor.tan) and hyperbolic (dpctl.tensor.sinh,dpctl.tensor.cosh,dpctl.tensor.tanh) elementwise functions and their inverses (dpctl.tensor.asin,dpctl.tensor.asinh,dpctl.tensor.acos,dpctl.tensor.acosh,dpctl.tensor.atan,dpctl.tensor.atanh). - Added
dpctl.tensor.roundfunction. - Added
dpctl.tensor.signanddpctl.tensor.remainderelementwise functions. - Added bitwise elementwise functions
dpctl.tensor.bitwise_and,dpctl.tensor.bitwise_xor,dpctl.tensor.bitwise_or,dpctl.tensor.bitwise_invert - Added bitwise shift functions
dpctl.tensor.bitwise_left_shiftanddpctl.tensor.bitwise_right_shift. - Added
dpctl.tensor.atan2anddpctl.tensor.signbitelementwise functions. - Added
dpctl.tensor.minumumanddpctl.tensor.maximumbinary elementwise functions. - Supported equality checking and hashing for
dpctl.SyclPlatform. - Implemented
typesproperty for all unary and binary elementwise functions #1361 - Added
dpctl.tensor.repeatanddpctl.tensor.tilefunctions. - Added
dpctl.tensor.matrix_transposefunction.
Changed
- Enabled support for Python arithmetic, in-place arithmetic, reflexive arithmetic, comparison, and bitwise operators for
dpctl.tensor.usm_ndarraytype #1324. - Removed
dpctl.tensor.numpy_usm_sharedobsolete class and associated tests which were being skipped #1310 - Transitioned
dpctlcodebase to Cython 3. - Improved performance of boolean reduction functions
dpctl.tensor.allanddpctl.tensor.any. - Improved performance of summation function
dpctl.tensor.sum. - Improved in-place arithmetic operations for addition, subtraction and multiplication.
- Updated codebase per SYCL-2020 intel/llvm compiler deprecation warnings.
- Improved performance of advanced boolean indexing for arrays whose size fits in 32-bit signed integer type.
- Removed deprecated
DPCTLDevice_GetMaxWorkItemSizesfunction from the SyclInterface library. - Improved performance of
dpctl.tensor.reshapein the case when a copy is being made. - Improved performance of
dpctl.tensor.rollfunction.