Posts by Collection

portfolio

Portfolio item number 1

Short description of portfolio item number 1

Portfolio item number 2

Short description of portfolio item number 2

publications

A Systematic Survey of General Sparse Matrix-Matrix Multiplication

Published in ACM Computing Surveys, 2020

This paper is a systematic survey of SpGEMM.

Recommended citation: Jianhua Gao, Weixing Ji, Fangli Chang, Shiyu Han, Bingxin Wei, Zeming Liu, and Yizhuo Wang. (2023). "A Systematic Survey of General Sparse Matrix-Matrix Multiplication." ACM Computing Surveys. 55(12), 244:1-36. https://doi.org/10.1145/3571157

Fast Piecewise Polynomial Fitting of Time-Series Data for Streaming Computing

Published in IEEE Access, 2020

This paper proposes a fast piecewise Polynomial Fitting method of time-series data.

Recommended citation: Jianhua Gao, Weixing Ji, Lulu Zhang, Senhao Shao, Yizhuo Wang, Feng Shi. (2020). "Fast Piecewise Polynomial Fitting of Time-Series Data for Streaming Computing." IEEE Access. 8:43764-43775. https://ieeexplore.ieee.org/abstract/document/9016024/

Cube-Based Incremental Outlier Detection for Streaming Computing

Published in Information Sciences, 2020

This paper proposes a cube-based outlier detection method CB-ILOF.

Recommended citation: Jianhua Gao, Weixing Ji, Lulu Zhang, Anmin Li, Yizhuo Wang, Zongyu Zhang. (2020). "Cube-Based Incremental Outlier Detection for Streaming Computing." Information Sciences. 517:361-376. https://doi.org/10.1016/j.ins.2019.12.060

AMF-CSR: Adaptive Multi-Row Folding of CSR for SpMV on GPU

Published in 2021 IEEE 27th International Conference on Parallel and Distributed Systems (ICPADS), 2021

This paper proposes a new GPU-based SpMV algorithm AMF-CSR.

Recommended citation: Jianhua Gao, Weixing Ji, Senhao Shao, Yizhuo Wang, Feng Shi. (2021). "AMF-CSR: Adaptive Multi-Row Folding of CSR for SpMV on GPU." 2021 IEEE 27th International Conference on Parallel and Distributed Systems. 418-425. https://ieeexplore.ieee.org/document/9763779

Towards Optimal Fast Matrix Multiplication on CPU-GPU Platforms

Published in International Conference on Parallel and Distributed Computing: Applications and Technologies (PDCAT), 2021

This paper proposes a CPU-GPU heterogenous implementation for the Winograd algorithm.

Recommended citation: Senhao Shao, Yizhuo Wang, Weixing Ji, Jianhua Gao. (2022). "Towards Optimal Fast Matrix Multiplication on CPU-GPU Platforms." International Conference on Parallel and Distributed Computing: Applications and Technologies (PDCAT). 223–236. https://link.springer.com/chapter/10.1007/978-3-030-96772-7_21

MMSparse: 2D Partitioning of Sparse Matrix Based on Mathematical Morphology

Published in Mathematical Morphology, 2022

This paper proposes a 2D partition of sparse matrix based on Mathematical Morphology.

Recommended citation: Zhaonian Tan, Weixing Ji, Jianhua Gao, Yizhuo Wang, Feng Shi. (2020). "MMSparse: 2D Partitioning of Sparse Matrix Based on Mathematical Morphology." Future Generation Computer Systems. 108:521-532. Future Generation Computer Systems

TaiChi: A Hybrid Compression Format for Binary Sparse Matrix-Vector Multiplication on GPU

Published in IEEE Transactions on Parallel and Distributed Systems, 2022

This paper proposes a new compression format for binary sparse matrix.

Recommended citation: Jianhua Gao, Weixing Ji, Zhaonian Tan, Yizhuo Wang, Feng Shi. (2022). "TaiChi: A Hybrid Compression Format for Binary Sparse Matrix-Vector Multiplication on GPU." IEEE Transactions on Parallel and Distributed Systems. 33(12):3732-3745. https://ieeexplore.ieee.org/document/9763312

talks

基于数据分布特征的稀疏矩阵向量乘优化研究

Published: December 18, 2022

本报告所汇报的内容主要覆盖课题组两个已经发表的工作和一个正在进行的工作。两个已发表的工作分别是：“TaiChi: A Hybrid Compression Format for Binary Sparse Matrix-Vector Multiplication on GPU”和“AMF-CSR: Adaptive Multi-Row Folding of CSR for SpMV on GPU”。第一个工作于2022年4月在国际高水平期刊TPDS上出版，第二个工作被国际高水平会议ICPADS2021收录。第三个工作“Revisiting Thread Configuration of SpMV Kernels on GPU: A Machine Learning Based Approach”是课题组目前正在进行的工作。本报告所汇报的内容致力于提升面向GPU的SpMV的计算效率，针对高数据传输开销、低缓存命中率和不均衡负载、单一线程配置方案的局限性这三个问题提出了相应的解决方案。

teaching

Parallel Progamming: Principle and Practice

Undergraduate course, Beijing Normal University, School of Artificial Intelligence, 2023

This course provides students with a comprehensive understanding of the principles, techniques, and best practices involved in developing parallel programs. Parallel programming is a fundamental aspect of high-performance computing and plays a crucial role in harnessing the power of modern computer architectures.

Jianhua Gao (高建花)

Posts by Collection

portfolio

publications

talks

teaching