Hidet is an open-source deep learning compiler, written in Python. It supports end-to-end compilation of DNN models from PyTorch and ONNX to efficient cuda kernels. A series of graph-level and ...
Abstract: The domain of cross compiler development pertains to the process wherein code written in one programming language is translated to another so that it may be compatible with another ...
A small C wrapper around the parts of the LLVM C++ API we need that are not already exposed by the LLVM C API. A ctypes Python wrapper around the C API. A pure Python implementation of the subset of ...
Abstract: HPC systems are increasingly complex with many tunable parameters impacting applications' metrics-e.g., performance, energy consumption. The main challenges of these systems are finding the ...