• Proteus: Agile ML Elasticity through Tiered Reliability in Dynamic Resource Markets.  Aaron Harlap, Alexey Tumanov, Andrew Chung, Greg Ganger, Phil Gibbons.  ACM European Conference on Computer Systems, 2017 (EuroSys'17), 23rd-26th April, 2017, Belgrade, Serbia. Supersedes Carnegie Mellon University Parallel Data Lab Technical Report CMU-PDL-16-102. May 2016. [PDF]
    FaSST: Fast, Scalable and Simple Distributed Transactions with Two-Sided (RDMA) Datagram RPCs. Kalia A, Kaminsky M, Andersen DG. 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16). October 2016. [PDF]
    Sapprox: Enabling Efficient and Accurate Approximations on Sub-datasets with Distribution-aware Online Sampling. Xuhohg Zhang, Jun Wang, Jiangling Yin. 2017. Accepted to VLDB 17. Munich, Germany. August 8 - September 21, 2017. [PDF]
    Addressing the Straggler Problem for Iterative convergent parallel ML. Aaron Harlap, Henggain Cui, Wei Dai, Jinliang Wei, Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. 2016. ACM Symposium on Cloud Computing. October 5-7, 2016. Santa Clara, CA. [PDF]
  • The SNOW Theorem and Latency-Optimal Read-Only TransactionsHaonan Lu, Christopher Hodsdon, Khiem Ngo, Shuai Mu, Wyatt Lloyd. In Proc. 12th Symposium on Operating Systems Design and Implementation (OSDI 16), October 2016. [PDF]
  • Design Guidelines for High Performance RDMA Systems. Anuj Kalia, Michael Kaminsky, David G. Andersen. 2016 USENIX Annual Techical Conference. June 22-24, 2016. Denver, CO. [PDF]
  • STRADS: A Distributed Framework for Scheduled Model Parallel Machine Learning. Jin Kyu Kim, Qirong Ho, Seunghak Lee, Xun Zheng, Wei Dai, Garth Gibson, Eric P. Xing. ACM European Conference on Computer Systems, 2016. EuroSys’16. April 18-21, 2016, London, UK. [PDF]
  • TetriSched: Global Rescheduling with Adaptive Plan-ahead in Dynamic Heterogeneous Clusters. Alexey Tumanov, Timothy Zhu, Jun Woo Park, Michael A. Kozuch, Mor Harchol-Balter, Gregory R. Ganger. ACM European Conference on Computer Systems, 2016. EuroSys’16. April 2016, London, UK. [PDF]
  • GeePS: Scalable Deep Learning on Distributed GPUs with a GPU-Specialized Parameter Server. Henggang Cui, Hao Zhang, Gregory R. Ganger, Phillip B. Gibbons, and Eric P. Xing. EuroSys'16. ACM European Conference on Computer Systems. April 2016. London, UK.
  • Experiences in using os-level virtualization for block I/O. Huang, D., Wang, J., Liu, Q., Yin, J., Zhang, X., & Chen, X. November 2015. In Proceedings of the 10th Parallel Data Storage Workshop (pp. 13-18). ACM.
  • Achieving up to Zero Communication Delay in BSP-based Graph Processing via Vertex Categorization. Zuhong Zhang, Ruijun Wang, Xunchao Chen, Jun Wang, Tyler Lukasiewicz, Dezhi Han. IEEE International Parallel & Distributed Processing Symposium. 2015. 
  • Finding Schools of Fish in the Ocean: A Sub-dataset Locality-aware Method for Accelerating Data Analytics. Jun Wang, Jianglin Yin, Jian Zhou, Xuhong Zhang, Tyler Lukasiewicz, Dan Huang, Xunchao Chen, and Ruijun Wang, University of Central Florida,  Submitted to ACM Symposium on Cloud Computing 2015 (SoCC'15). 
  • Opass: Analysis and Optimization of Parallel Data Access on Distributed File Systems. Jiangling Yin, Jun Wang, Jian Zhou, Tyler Lukasiewicz, Dan Huang and Junyao Zhang, University of Central Florida. Accepted to 29th IEEE International Parallel & Distributed Processing Symposium. 2015. [PDF]
  • Optimize Parallel Data Access in Big Data Processing. Jiangling Yin and Jun Wang , University of Central Florida. Accepted to 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing: Doctoral Symposium Program. 
  • Towards Scalable Distributed Workload Manager with Monitoring-Based Weakly Consistent Resource Stealing.  Ke Wang, Xiaobing Zhou, Kan Qiao, Michael Lang, Benjamin McClelland, Ioan Raicu. 2015. ACM HPDC. [PDF]
  • Simba: Tunable End-to-End Data Consistency for Mobile Apps. Dorian Perkins, Nitin Agrawal, Akshat Aranya, Curtis Yu, Younghwan Go, Harsha Madhyastha, and Cristian Ungureanu. To appear in Proceedings of the 10th European Conference on Computer Systems (EuroSys 15), Bordeaux, France, April 21-24, 2015. [PDF]
  • Deep Fried Convnets. Zichao Yang, Marcin Moczulski, Misha Denil, Nando de Freitas, Alex Smola, Le Song, Ziyu Wang. International Conference on Learning Representations. May 7-9, 2015. San Diego, CA. [PDF]
  • IndexFS: Scaling File System Metadata Performance. Kai Ren, Qing Zheng, Garth Gibson. Supercomputing 2014. November 16-21, 2014. New Orleans. [PDF]. Highlight: Won SC14 Best Paper Award! 
  • On Model Parallelization and Scheduling Strategies for Distributed Machine Learning. Seunghak Lee, Jin Kyu Kim, Xun Zheng, Qirong Ho, Garth A Gibson, Eric P Xing. Neural Information Procession Systems Foundation. December 9-14, 2014. Quebec, Canada. [PDF]
  • Archie: A Speculative Replicated Transactional System. Sachin Hirve, Roberto Palmieri and Binoy Ravindran. ACM/IFIP/USENIX 15th International Middleware Conference. MIDDLEWARE 2014. December 8-12, 2014. Bordeaux, France. [PDF]
  • Sebo: Selective Bulk Analysis Optimization in Big Data Processing. Jiangling Yin and Jun Wang, University of Central Florida. Accepted to Supercomputing Frontiers 2015 Programme. March 17 - 20, 2015. Biopolis, Singapore.
  • Exploring the Design Tradeoffs for Extreme-Scale High-Performance Computing System Software. K Wang, A Kulkarni, M Lang, D Arnold, I Raicu. 2015. IEEE Transactions on Parallel and Distributed Systems, MANUSCRIPT ID 1. [PDF]
  • Overcoming Hadoop Scaling Limitations through Distributed Task Execution. K. Wang, N. Liu, I. Sadooghi, X. Yang, X. Zhou, M. Lang, X.-H. Sun and I. Raicu. In Proc. of the IEEE International Conference on Cluster Computing 2015 (Cluster’15), Chicago, IL, USA, Sept. 2015. [PDF]
  • Raising the Bar for Using GPUs in Software Packet Processing. Anuj Kalia, Dong Zhou, Michael Kaminsky, David G. Andersen. To appear in Proceedings of the 12th Symposium on Networked Systems Design and Implementation (NSDI'15), Oakland, CA, May 2015. [PDF]
  • Reducing File System Tail Latencies with Chopper. He J., Nguyen D., Arpaci-Dusseau A., Arpaci-Dusseau R. 2015. 13th USENIX Conference on File and Storage Technologies (FAST 15). Santa Clara, CA. [PDF]. 
  • Exploiting iterative-ness for parallel ML computations. Henggang Cui, Alexey Tumanov, Jinliang Wei, Lianghong Xu, Wei Dai, Jesse Haber-Kucharsky, Qirong Ho, Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. 2014 ACM Symposium on Cloud Computing (SOCC 2014), Nov 3-5, Seattle, WA. [PDF]
  • BatchFS: Scaling the File System Control Plane with Client-Funded Metadata Servers. Qing Zheng, Kai Ren and Garth Gibson. 9th Petascale Data Storage Workshop. Supercomputing (PDSW), 2014.  New Orleans. [PDF]
  • Optimizing Load Balancing and Data-Locality with Data-Aware Scheduling. Ke Wang, Xiaobing Zhou, Tonglin Li, Dongfang Zhao, Michael Lang, Ioan Raicu. 2014 IEEE International Conference on Big Data. October 27-30. Washington DC. [PDF]
  • Scaling Distributed Machine Learning with the Parameter Server. Mu Li, David G. Andersen, Jun Woo Park, Alexander J. Smola, Amr Ahmed, Vanja Josifovski, James Long, Eugene J. Shekita, Bor-Yiing Su. 11th USENIX Symposium on Operating Systems Design and Implementation. October 6-8, 2014. Broomfield, CO. [PDF]
  • Extracting More Concurrency from Distributed Transactions. Shuai Mu, Yang Cui, Yang Zhang, Wyatt Lloyd, Jinyang Li. 11th USENIX Symposium on Operating Systems Design and Implementation. October 6-8, 2014. Broomfield, CO. [PDF]
  • SAMC: Semantic-Aware Model Checking for Fast Discovery of Deep Bugs in Cloud Systems. Tanakorn Leesatapornwongsa, Mingzhe Hao, Pallavi Joshi, Jeffrey Lukman, Haryadi Gunawi. 11th USENIX Symposium on Operating Systems Design and Implementation. October 6-8, 2014. Broomfield, CO. [PDF]
  • Efficient Mini-batch Training for Stochastic Optimization. Mu Li, Tong Zhang, Yuqiang Chen, Alex Smola. KDD 2014. 20th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. New York City. August 24-27, 2014. [PDF]
  • Using RDMA Efficiently for Key-Value Services. Anuj Kalia, Michael Kaminsky, David Andersen. ACM SIGCOMM 2014. Chicago, Illinois, August 17-22, 2014. [PDF]
  • ScalScheduling: A Scalable Scheduling Architecture for MPI-based Interactive Analysis Programs. Jiangling Yin, Andrew Foran, Xuhong Zhang and Jun Wang. The 23rd International Conference on Computer Communications and Networks (ICCCN 2014). Shanghai, China, August 4-7, 2014. [PDF]
  • SLAM: Scalable Locality-Aware Middleware for I/O in Scientific Analysis and Visualization. Jiangling Yin, Jun Wang, Wuchun Feng, Xuhong Zhang, Junyao Zhang. The 23rd International Symposium on High Performance Distributed Computing (ACM HPDC2014). Vancouver, Canada. June 23-27, 2014. [PDF]
  • Exploiting Bounded Staleness to Speed up Big Data Analytics. Henggang Cui, James Cipar, Qirong Ho, Jin Kyu Kim, Seunghak Lee, Abhimanu Kumar Jinliang Wei, Wei Dai, Gregory R. Ganger, Phillip B. Gibbons, Garth A. Gibson, Eric P. Xing. 2014 USENIX Annual Technical Conference (ATC'14). June 19-20, 2014. Philadelphia, PA. [PDF]
  • The Energy Efficiency of Database Replication Protocols. Nicolas Schiper, Fernando Pedone, and Robbert van Renesse. The 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks (DSN 2014). Atlanta, GA. June 2014. [PDF]
  • Next Generation Job Management Systems for Extreme-Scale Ensemble Computing. Ke Wang, Xiaobing Zhou, Hao Chen, Michael Lang, Ioan Raicu. 2014. International Symposium on  High-Performance Parallel and Distributed Computing (HPDC). Vancouver, Canada. June 23-27, 2014. [PDF]
  • Using One-Sided RDMA Reads to Build a Fast, CPU-Efficient Key-Value Store. Christopher Mitchell, Yifeng Geng, Jinyang Li. USENIX Annual Technical Conference 2013. San Jose, CA. June 26-28, 2013.  [PDF]
  • PARROT: A Practical Runtime for Deterministic, Stable and Reliable Threads. Heming Cui, Jiri Simsa, Yi-Hong Lin, Hao Li, Ben Blum, Xinan Xu, Junfeng Yang, Garth A. Gibson. 24th ACM Symposium on Operating Systems Principles (SOSP'13), Nov 4-6, 2013, Farmington, PA. [URL]
  • Using Simulation to Explore Distributed Key-Value Stores for Extreme-Scale Systems Services. Ke Wang, Abhishek Kulkarni, Michael Lang, Dorian Arnold, Ioan Raicu.  IEEE/ACM Supercomputing/SC 2013. [PDF]
  • Sprinkler - Reliable Broadcast for Geographically Dispersed Datacenters. Haoyan Geng and Robbert van Renesse. International Middleware Conference (Middleware). Beijing, China. December 2013. [PDF]
  • Leveraging Sharding in the Design of Scalable Replication Protocols. Hussam Abu-Libdeh, Robbert van Renesse, and Ymir Vigfusson. Symposium on Cloud Computing (SoCC). Farmington, PA. October 2013. [PDF]
  • DL-MPI: Enabling Data Locality Computation for MPI-based Data-Intensive Application. Jiangling Yin, Andrew Foran, and Jun Wang. In the 2013 IEEE International Conference on Big Data (BigData 2013), Oct 6-9, 2013, Santa Clara, CA, USA [PDF]
  • Stronger semantics for low-latency geo-replicated storage. Wyatt Lloyd, Michael J. Freedman, Michael Kaminsky, and David G. Andersen., In Proceedings of the 10th USENIX conference on Networked Systems Design and Implementation, NSDI 2013. pp. 313-328. Lombard, IL. USENIX Association. [PDF]
  • TABLEFS: Enhancing Metadata Efficiency in the Local File System. Kai Ren, Garth Gibson. 2013. USENIX Annual Technical Conference. June 26-28, 2013. San Jose, CA. [URL]
  • Stronger semantics for low-latency go-replicated storage. Wyatt Lloyd, Michael J. Feedman, Michael Kaminsky, David G. Andersen. 2013. 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI). Lombard, IL, April 2-5, 2013. USENIX Association. [PDF]

p.p1 {margin: 0.0px 0.0px 0.0px 0.0px; font: 14.0px 'Helvetica Neue'; color: #323333; -webkit-text-stroke: #323333} span.s1 {font-kerning: none} span.s2 {font-kerning: none; color: #606060; -webkit-text-stroke: 0px #606060}