王兴晟

个人信息Personal Information

教授   博士生导师   硕士生导师  

性别:男

在职信息:在职

所在单位:集成电路学院

学历:研究生(博士)毕业

学位:工学博士学位

毕业院校:格拉斯哥大学

学科:微电子学与固体电子学

论文成果

当前位置: Chinese homepage >> 科学研究 >> 论文成果

ISARA: An Island-style Systolic Array Reconfigurable Accelerator based on Memristors for Deep Neural Networks

点击次数:

论文类型:期刊论文

第一作者:阳帆

通讯作者:王兴晟

合写作者:李楠,王乐天,江品锋,缪向水

发表刊物:IEEE Transactions On Very Large Scale Integration Systems

收录刊物:SCI

所属单位:华中科技大学

刊物所在地:美国

DOI码:10.1109/TVLSI.2024.3521394

发表时间:2025-02-18

摘要:The demand for edge artificial intelligence (AI) is significant, particularly in revolutionary technological areas such as the Internet of Things, autonomous driving, and industrial control. However, reliable and high-performance edge AI is still constrained by computing hardware, and improving the performance and reliability of edge AI accelerators remains a key focus for researchers. This work proposes a memristor/resistive random access memory (RRAM)-based island-style systolic array reconfigurable accelerator (ISARA) that meets the reliability and performance requirements of edge AI. Inspired by the island-style architecture of FPGAs, this work proposes a flexible-tile architecture based on RRAM processing element (PE) islands, optimizing the data flow within the systolic array. The design of network-on-chip reduces data processing latency. In addition, to enhance computational efficiency, this work incorporates a bit-fusion scheme within the flexible tile, which reduces analog-to-digital converter (ADC) power consumption and addresses the conductance variation of RRAM. To date, only a few works have completed the entire process from simulation, design, and fabrication to hardware testing. This work fully realizes the design and validation of a new accelerator based on RRAM chips, demonstrating the reliability of RRAM-based systolic array accelerators for the first time. After deploying algorithms, the hardware accelerator achieved recognition rates comparable to software. Compared to similar works, ISARA's computational efficiency exceeds theirs and has flexible reconfigurability. The same deep neural network (DNN) models are adopted for evaluation and compared to other accelerators, and ISARA's processing latency is reduced by 200 times.