4Hardware·5h ago
Hawk: Harnessing Hardware-Aware Knowledge for High-Performance NPU Kernel Generation
Researchers have introduced Hawk, a framework designed to automate the generation of high-performance kernels for Neural Processing Units. By integrating hardware-specific constraints into the development process, the system aims to reduce the manual labor currently required to optimize code for complex memory hierarchies.
Covered by 1 source
- AarXiv CS.AI↗Junyi Wen, Ruiyan Zhuang, Yongjia Xu, Pengtu Li, Rui Zou, Hongyi Chen, Chingman Wan, Puxu Yang, Wuhui Chen, Yanlin Wang5h ago