v0.14.0
·
3185 commits
to master
since this release
[0.14.0] - 11/18/2022
Added
- Implemented
dpctl.tensor.linspacefunction from array-API #875. - Implemented
dpctl.tensor.eyefunction from array-API #896. - Implemented
dpctl.tensor.trilanddpctl.tensor.triufunctions from array-API #910. - Added data type objects to
dpctl.tensornamespace,finfo,iinfo,can_cast, andresult_typefunctions #913. - Implemented
dpctl.tensor.meshgridcreation function from array-API #920. - Implemented convenience class to represent output of
dpctl.tensor.usm_ndarray.flagsproperty #921. - Added new device attributes and kernel's device-specific attributes #894.
- Added
dpctl.utils.onetrace_enabledcontext manager for targeted trace collection #903. - Added support for
streamkeyword in__dlpack__method, enabling support for sendingusm_ndarrayusing mpi4py #906. dpctl.tensor.asarraycan now transition data between incompatible devices, #951.- Introduced
"syclinterface/dpctl_sycl_types_casters.hpp"header file with declaration of conversion routines between SYCL type pointers and SyclInterface library opaque pointers #960. - Added C-API to
dpctl.program.SyclKernelanddpctl.program.SyclProgram. Added type casters for new types to "dpctl4pybind11" and added an example demonstrating its use #970. - Introduced "dpctl/sycl.pxd" Cython declaration file to streamline use of SYCL functions from Cython, and added an example demonstrating its use #981.
- Added experimental support for sharing data allocated on sub-devices via dlpack #984.
- Added
dpctl.SyclDevice.sub_group_sizesproperty to retrieve supported sizes of sub-group by the device #985.
Changed
- Improved queue compatibility testing in
dpctl.tensor's implementation module #900. - Added automatic measurement of array-API conformance test suite in CI #901.
- Improved performance of array metadata transfer from host to device #912.
- Used
os.add_dll_directoryon Windows to ensure thatDPCTLSyclInterfacelibrary can be found #918. - Refactored
dpctl.tensor's implementation module #941 to streamline adding new functionality. Streamlineddpctl::tensor::usm_ndarrayclass implementation. - Added debugging messaging in case when
DPCTLDynamicLib::getSymbolencounters errors #956. - Updated code base according to changes in DPC++ compiler #952, #957, #958.
- Changed
dpctlto use pybind11 2.10.1 #967. - Extended
dpctl.tensor.fullto accept 0d and higher dimensional arrays for fill-value parameter #982 and #995.
Fixed
- Improved SyclDevice constructor error message #893.
- Fixed issue gh-890 about
dpctl.tensor.reshapefunction #915. - Fixed unexpected
UnboundLocalErrorexception in #922. - Fixed bugs in
dpctl.tensor.arangein #945. - Fixed issue with type inferencing in
dpctl.tensor.asarrayin #949. - Added missing docstrings for
dpctl.SyclDeviceproperties #964.