• Jinhu Jiang, Rongchao Dong, Zhongjun Zhou, Changheng Song, Wenwen Wang, Pen-Chung Yew, Weihua Zhang. More with Less — Deriving More Translation Rules with Less Training Data for DBTs Using Parameterization. Proceedings of the 53rd Annual IEEE/ACM International Symposium on Microarchitecture (Open Source, MICRO 2020, CCF A)
• Sui Chen, Lei Liu, Weihua Zhang, and Lu Peng. Architectural Support for NVRAM Persistence in GPUs. IEEE Transactions on Parallel and Distributed Systems (TPDS 2020, CCF A)
• Weihua Zhang,Zhaofeng Yan,Yuzhe Lin,Chuanlei Zhao,Lu Peng. A High Throughput B+tree for SIMD architectures. IEEE Transactions on Parallel and Distributed Systems (TPDS 2020, CCF A)
2019
• Changheng Song, Wenwen Wang, Pen-Chung Yew, Weihua Zhang. Unleashing the Power of Learning: An Enhanced Learning-based Approach for Dynamic Binary Translation. Proceedings of the 2019 USENIX Annual Technical Conference (ATC 2019, CCF A)
• Zhaofeng Yan, Yuzhe Lin, Lu Peng, Weihua Zhang. Harmonia: A High Throughput B+ Tree for GPUs. Proceedings of the 24th Symposium on Principles and Practice of Parallel Programming (Open Source, PPoPP 2019, CCF A)
• Weihua Zhang, Xin Wang, Shiyu Ji, Ziyun Wei, Zhaoguo Wang, Haibo Chen. Scaling Concurrent Index Structures under Contention Using HTM. IEEE Transactions on Parallel and Distributed Systems (TPDS 2018, CCF A)
• Shaoming Chen, Lu Peng, Samuel Irving, Zhou Zhao, Weihua Zhang and Ashok Srivastava. qSwitch: Dynamical Off-Chip Bandwidth Allocation between Local and Remote Accesses. IEEE Transactions on Computer-Aided Design of Integrated Circuits And System (TCAD 2018, CCF A)
• Samuel Irving,Bin Li,Shaoming Chen,Lu Peng,Weihua Zhang,Lide Duan. Computer comparisons in the presence of performance variation. Frontiers of Computer Science (FCS 2018, CCF B)
• Weihua Zhang, Xiaofeng Ji, Yunping Lu, Haojun Wang, Haibo Chen, Pen-Chung Yew. Prophet: A Parallel Instruction-Oriented Many-Core Simulator. IEEE Transaction on Parallel and Distributed Systems (TPDS 2017, CCF A)
• Weihua Zhang, Xiaofeng Ji, Bo Song, Shiqiang Yu, Haibo Chen, Pen-Chung Yew, Tao Li, Wenyun Zhao. VarCatcher: A Framework for Tackling Performance Variability of Parallel Workloads on Multi-core. IEEE Transaction on Parallel and Distributed Systems (TPDS 2017, CCF A)
• Xin Wang, Weihua Zhang, Zhaoguo Wang, Ziyun Wei, Haibo Chen, Wenyun Zhao. Eunomia: A Scalable, Contention-Conscious HTM-Friendly B+Tree. ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP 2017, CCF A)
• Yunping Lu, Xin Wang, Weihua Zhang, Haibo Chen, Lu Peng, Wenyun Zhao. Performance Analysis of Multimedia Retrieval Workloads Running on Multicore. IEEE Transaction on Parallel and Distributed Systems (TPDS 2016, CCF A)
• Weihua Zhang, Shiqiang Yu, Haojun Wang, Zhuofang Dai, Haibo Chen. Hardware Support for Concurrent Detection of Multiple Concurrency Bugs on Fused CPU-GPU Architectures. IEEE Transactions on Computers (TC 2016, CCF A)
• Weihua Zhang, Haojun Wang, Yunping Lu, Haibo Chen and Wenyun Zhao. A Loosely-Coupled Full-System Multicore Simulation Framework. IEEE Transaction on Parallel and Distributed Systems (JPDC 2016, CCF B)
• Xin Wang, Xiaofeng Ji, Yunping Lu, Yi Li, Weihua Zhang, Wenyun Zhao. Understanding the Architectural Characteristics of EDA Algorithms. The 45th International Conference on Parallel Processing (ICPP 2016, CCF B)
• Yang Yu, Tianyang Lei, Weihua Zhang, Haibo Chen, Binyu Zang. Performance Analysis and Optimization of Full Garbage Collection in a Production JVM. The 12th Annual International Conference on Virtual Execution Environments (VEE 2016, CCF B)