Skip to content

adityalj/Tensile

 
 

Repository files navigation

Tensile is a tool for creating benchmark-driven backend libraries for GEMMs, GEMM-like problems (such as batched GEMM), and general N-dimensional tensor contractions on a GPU. The Tensile library is mainly used as backend library to rocBLAS. Tensile acts as the performance backbone for a wide variety of 'compute' applications running on AMD GPUs.

See Tensile Wiki for documentation.

About

Stretching GPU performance for GEMMs and tensor contractions.

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 50.9%
  • C++ 29.8%
  • Assembly 15.1%
  • TeX 1.4%
  • CMake 1.1%
  • Shell 1.1%
  • Other 0.6%