You can also find my publications on my Google Scholar profile.

2025

SSDTrain: An Activation Offloading Framework to SSDs for Faster Large Language Model Training [PDF]

Kun Wu*, Jeongmin Brian Park*, Xiaofan Zhang* , Mert Hidayetoğlu, Vikram Sharma Mailthody, Sitao Huang, Steven Sam Lumetta, Wen-mei Hwu (*equal contributors)
62nd Design Automation Conference (DAC), San Francisco, CA, June 2025

2024

ShiftAddLLM: Accelerating Pretrained LLMs via Post-Training Multiplication-Less Reparameterization [PDF]

Haoran You, Yipin Guo, Yichao Fu, Wei Zhou, Huihong Shi, Xiaofan Zhang, Souvik Kundu, Amir Yazdanbakhsh, Yingyan Lin
38th Annual Conference on Neural Information Processing Systems (NeurIPS), Vancouver, Canada, Dec. 2024

New Solutions on LLM Acceleration, Optimization, and Application [PDF]

[Invited] Yingbing Huang, Jiaxin Wan, Hanchen Ye, Manvi Jha, Jinghua Wang, Yuhong Li, Xiaofan Zhang, Deming Chen
61st Design Automation Conference (DAC), San Francisco, CA, June 2024

AutoAI2C: An Automated Hardware Generator for DNN Acceleration on both FPGA and ASIC [PDF]

​Yongan Zhang, Xiaofan Zhang, Pengfei Xu, Yang Zhao, Cong Hao, Deming Chen, Yingyan Lin
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (TCAD), 2024

Software/Hardware Co-design for LLM and Its Application for Design Verification [PDF]

Jiaxin Wan, Yingbing Huang, Yuhong Li, Hanchen Ye, Jinghua Wang, Xiaofan Zhang, Deming Chen
29th Asia and South Pacific Design Automation Conference (ASP-DAC), Jan. 2024

HomeSGN: A Smarter Home with Novel Rule Mining Enabled by a Scorer-Generator GAN [PDF]

Zehua Yuan, Junhao Pan, Xiaofan Zhang, Deming Chen
29th Asia and South Pacific Design Automation Conference (ASP-DAC), Jan. 2024

2023

Compilation and Optimizations for Efficient Machine Learning on Embedded Systems [PDF]

Xiaofan Zhang, Yao Chen, Cong Hao, Sitao Huang, Yuhong Li, Deming Chen
Book chapter in Embedded Machine Learning for Cyber-Physical, IoT, and Edge Computing: Software Optimizations and Hardware/Software Co-design, Springer Nature

Augmenting Hessians with Inter-Layer Dependencies for Mixed-Precision Post-Training Quantization [PDF]

Clemens Schaefer, Navid Lambert-Shirzad, Xiaofan Zhang, Chiachen Chou, Tom Jablin, Jian Li, Elfie Guo, Caitlin Stanton, Siddharth Joshi, Yu Emma Wang
arXiv preprint arXiv:2306.04879, 2023

EH-DNAS: End-to-End Hardware-aware Differentiable Neural Architecture Search [PDF]

Qian Jiang, Xiaofan Zhang*, Deming Chen, Minh N. Do, Raymond A. Yeh (equal contributors)
40th International Conference on Machine Learning (ICML) Workshop on Differentiable Almost Everything, July 2023