Publications

Achieving up to Zero Communication Delay in BSP-based Graph Processing via Vertex Categorization. Zuhong Zhang, Ruijun Wang, Xunchao Chen, Jun Wang, Tyler Lukasiewicz, Dezhi Han. IEEE International Parallel & Distributed Processing Symposium. 2020.

Finding Schools of Fish in the Ocean: A Sub-dataset Locality-aware Method for Accelerating Data Analytics. Jun Wang, Jianglin Yin, Jian Zhou, Xuhong Zhang, Tyler Lukasiewicz, Dan Huang, Xunchao Chen, and Ruijun Wang, University of Central Florida, Submitted to ACM Symposium on Cloud Computing 2020 (SoCC'15).

Opass: Analysis and Optimization of Parallel Data Access on Distributed File Systems. Jiangling Yin, Jun Wang, Jian Zhou, Tyler Lukasiewicz, Dan Huang and Junyao Zhang, University of Central Florida. Accepted to 29th IEEE International Parallel & Distributed Processing Symposium. 2020. [PDF]

Optimize Parallel Data Access in Big Data Processing. Jiangling Yin and Jun Wang , University of Central Florida. Accepted to 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing: Doctoral Symposium Program.

Towards Scalable Distributed Workload Manager with Monitoring-Based Weakly Consistent Resource Stealing. Ke Wang, Xiaobing Zhou, Kan Qiao, Michael Lang, Benjamin McClelland, Ioan Raicu. 2020. ACM HPDC. [PDF]

Simba: Tunable End-to-End Data Consistency for Mobile Apps. Dorian Perkins, Nitin Agrawal, Akshat Aranya, Curtis Yu, Younghwan Go, Harsha Madhyastha, and Cristian Ungureanu. To appear in Proceedings of the 10th European Conference on Computer Systems (EuroSys 15), Bordeaux, France, April 21-24, 2020. [PDF]

Deep Fried Convnets. Zichao Yang, Marcin Moczulski, Misha Denil, Nando de Freitas, Alex Smola, Le Song, Ziyu Wang. International Conference on Learning Representations. May 7-9, 2020. San Diego, CA. [PDF]

IndexFS: Scaling File System Metadata Performance. Kai Ren, Qing Zheng, Garth Gibson. Supercomputing 2020. November 16-21, 2020. New Orleans. [PDF]. Highlight: Won SC14 Best Paper Award!

On Model Parallelization and Scheduling Strategies for Distributed Machine Learning. Seunghak Lee, Jin Kyu Kim, Xun Zheng, Qirong Ho, Garth A Gibson, Eric P Xing. Neural Information Procession Systems Foundation. December 9-14, 2020. Quebec, Canada. [PDF]

Archie: A Speculative Replicated Transactional System. Sachin Hirve, Roberto Palmieri and Binoy Ravindran. ACM/IFIP/USENIX 15th International Middleware Conference. MIDDLEWARE 2020. December 8-12, 2020. Bordeaux, France. [PDF]
Sebo: Selective Bulk Analysis Optimization in Big Data Processing. Jiangling Yin and Jun Wang, University of Central Florida. Accepted to Supercomputing Frontiers 2020 Programme. March 17 - 20, 2020. Biopolis, Singapore.
Exploring the Design Tradeoffs for Extreme-Scale High-Performance Computing System Software. K Wang, A Kulkarni, M Lang, D Arnold, I Raicu. 2020. IEEE Transactions on Parallel and Distributed Systems, MANUSCRIPT ID 1. [PDF]
Overcoming Hadoop Scaling Limitations through Distributed Task Execution. K. Wang, N. Liu, I. Sadooghi, X. Yang, X. Zhou, M. Lang, X.-H. Sun and I. Raicu. In Proc. of the IEEE International Conference on Cluster Computing 2020 (Cluster’15), Chicago, IL, USA, Sept. 2020. [PDF]
Raising the Bar for Using GPUs in Software Packet Processing. Anuj Kalia, Dong Zhou, Michael Kaminsky, David G. Andersen. To appear in Proceedings of the 12th Symposium on Networked Systems Design and Implementation (NSDI'15), Oakland, CA, May 2020. [PDF]
Reducing File System Tail Latencies with Chopper. He J., Nguyen D., Arpaci-Dusseau A., Arpaci-Dusseau R. 2020. 13th USENIX Conference on File and Storage Technologies (FAST 15). Santa Clara, CA. [PDF].
Exploiting iterative-ness for parallel ML computations. Henggang Cui, Alexey Tumanov, Jinliang Wei, Lianghong Xu, Wei Dai, Jesse Haber-Kucharsky, Qirong Ho, Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. 2020 ACM Symposium on Cloud Computing (SOCC 2020), Nov 3-5, Seattle, WA. [PDF]
BatchFS: Scaling the File System Control Plane with Client-Funded Metadata Servers. Qing Zheng, Kai Ren and Garth Gibson. 9th Petascale Data Storage Workshop. Supercomputing (PDSW), 2020. New Orleans. [PDF]
Optimizing Load Balancing and Data-Locality with Data-Aware Scheduling. Ke Wang, Xiaobing Zhou, Tonglin Li, Dongfang Zhao, Michael Lang, Ioan Raicu. 2020 IEEE International Conference on Big Data. October 27-30. Washington DC. [PDF]
Scaling Distributed Machine Learning with the Parameter Server. Mu Li, David G. Andersen, Jun Woo Park, Alexander J. Smola, Amr Ahmed, Vanja Josifovski, James Long, Eugene J. Shekita, Bor-Yiing Su. 11th USENIX Symposium on Operating Systems Design and Implementation. October 6-8, 2020. Broomfield, CO. [PDF]
Extracting More Concurrency from Distributed Transactions. Shuai Mu, Yang Cui, Yang Zhang, Wyatt Lloyd, Jinyang Li. 11th USENIX Symposium on Operating Systems Design and Implementation. October 6-8, 2020. Broomfield, CO. [PDF]
SAMC: Semantic-Aware Model Checking for Fast Discovery of Deep Bugs in Cloud Systems. Tanakorn Leesatapornwongsa, Mingzhe Hao, Pallavi Joshi, Jeffrey Lukman, Haryadi Gunawi. 11th USENIX Symposium on Operating Systems Design and Implementation. October 6-8, 2020. Broomfield, CO. [PDF]
Efficient Mini-batch Training for Stochastic Optimization. Mu Li, Tong Zhang, Yuqiang Chen, Alex Smola. KDD 2020. 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. New York City. August 24-27, 2020. [PDF]
Using RDMA Efficiently for Key-Value Services. Anuj Kalia, Michael Kaminsky, David Andersen. ACM SIGCOMM 2020. Chicago, Illinois, August 17-22, 2020. [PDF]
ScalScheduling: A Scalable Scheduling Architecture for MPI-based Interactive Analysis Programs. Jiangling Yin, Andrew Foran, Xuhong Zhang and Jun Wang. The 23rd International Conference on Computer Communications and Networks (ICCCN 2020). Shanghai, China, August 4-7, 2020. [PDF]
SLAM: Scalable Locality-Aware Middleware for I/O in Scientific Analysis and Visualization. Jiangling Yin, Jun Wang, Wuchun Feng, Xuhong Zhang, Junyao Zhang. The 23rd International Symposium on High Performance Distributed Computing (ACM HPDC2020). Vancouver, Canada. June 23-27, 2020. [PDF]
Exploiting Bounded Staleness to Speed up Big Data Analytics. Henggang Cui, James Cipar, Qirong Ho, Jin Kyu Kim, Seunghak Lee, Abhimanu Kumar Jinliang Wei, Wei Dai, Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. 2020 USENIX Annual Technical Conference (ATC'14). June 19-20, 2020. Philadelphia, PA. [PDF]
The Energy Efficiency of Database Replication Protocols. Nicolas Schiper, Fernando Pedone, and Robbert van Renesse. The 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2020). Atlanta, GA. June 2020. [PDF]
Next Generation Job Management Systems for Extreme-Scale Ensemble Computing. Ke Wang, Xiaobing Zhou, Hao Chen, Michael Lang, Ioan Raicu. 2020. International Symposium on High-Performance Parallel and Distributed Computing (HPDC). Vancouver, Canada. June 23-27, 2020. [PDF]
PRObE: A Thousand-Node Experimental Cluster for Computer Systems Research. Garth Gibson, Gary Grider, Andree Jacobson, Wyatt Lloyd. USENIX. ;login:, 38(3). June 2020. [PDF]
Using One-Sided RDMA Reads to Build a Fast, CPU-Efficient Key-Value Store. Christopher Mitchell, Yifeng Geng, Jinyang Li. USENIX Annual Technical Conference 2020. San Jose, CA. June 26-28, 2020. [PDF]
PARROT: A Practical Runtime for Deterministic, Stable and Reliable Threads. Heming Cui, Jiri Simsa, Yi-Hong Lin, Hao Li, Ben Blum, Xinan Xu, Junfeng Yang, Garth A. Gibson. 24th ACM Symposium on Operating Systems Principles (SOSP'13), Nov 4-6, 2020, Farmington, PA. [URL]
Using Simulation to Explore Distributed Key-Value Stores for Extreme-Scale Systems Services. Ke Wang, Abhishek Kulkarni, Michael Lang, Dorian Arnold, Ioan Raicu. IEEE/ACM Supercomputing/SC 2020. [PDF]
Sprinkler - Reliable Broadcast for Geographically Dispersed Datacenters. Haoyan Geng and Robbert van Renesse. International Middleware Conference (Middleware). Beijing, China. December 2020. [PDF]
Leveraging Sharding in the Design of Scalable Replication Protocols. Hussam Abu-Libdeh, Robbert van Renesse, and Ymir Vigfusson. Symposium on Cloud Computing (SoCC). Farmington, PA. October 2020. [PDF]
DL-MPI: Enabling Data Locality Computation for MPI-based Data-Intensive Application. Jiangling Yin, Andrew Foran, and Jun Wang. In the 2020 IEEE International Conference on Big Data (BigData 2020), Oct 6-9, 2020, Santa Clara, CA, USA [PDF]
Stronger semantics for low-latency geo-replicated storage. Wyatt Lloyd, Michael J. Freedman, Michael Kaminsky, and David G. Andersen. }, In Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation, NSDI 2020. pp. 313-328. Lombard, IL. USENIX Association. [PDF]
TABLEFS: Enhancing Metadata Efficiency in the Local File System. Kai Ren, Garth Gibson. 2020. USENIX Annual Technical Conference. June 26-28, 2020. San Jose, CA. [URL]
Stronger semantics for low-latency go-replicated storage. Wyatt Lloyd, Michael J. Feedman, Michael Kaminsky, David G. Andersen. 2020. 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI). Lombard, IL, April 2-5, 2020. USENIX Association. [PDF]

PRObE