Zhiling Lan, Ph.D.
Parallel and Distributed Systems, High Performance Computing
- FENCE: Fault awareness Enabled Computing Environment,
- Exploring Large-scale Adaptive Applications on Distributed Systems
- Towards Petascale Cosmological Simulations (funded by NSF)
- Recognized for outstanding teaching performance by IIT Dean of college of science and letters, 2006
- "Fate of the Universe Award", HPC Games of SC’1999, Portland, OR (1999)
Z. Lan and Y. Li, “Adaptive Fault Management of Parallel Applications for High Performance Computing”, to appear in IEEE Trans. on Computers.
Z. Lan, V. Taylor and Y. Li, “DistDLB: Improving the Efficiency of Cosmology Simulations in Distributed Computing Environments through Hierarchical Load Balancing”, Journal of Parallel and Distributed Computing, Vol. 66(5), pp. 716-731, 2006.
Z. Lan and P. Deshikachar, “Performance Analysis of a Large-scale Cosmology Application on Three Cluster Systems”, International Journal of High Performance Computing and Networking (IJHPCN), 2006.
Z. Lan, V. Taylor, and G. Bryan, “Exploring Cosmology Applications on Distributed Environments”, Journal of Future Generation Computer Systems, Vol. 19(6), pp. 839-847, August, 2003.
Z. Lan, V. Taylor, and G. Bryan, “A novel dynamic load balancing scheme for parallel systems”, Journal of Parallel and Distributed Computing, Vol . 62/12, pp. 1763 – 1781, 2002.
Z. Lan, Z. Zheng, and Y. Li, "Toward Automated Anomaly Identification in Large-Scale Systems", to appear in IEEE Trans. on Parallel and Distributed Systems, 2009.
Z. Zheng, Z. Lan, B-H. Park, and A. Geist, "System Log Pre-processing to Improve Failure Prediction", Proc. of DSN'09, 2009.
Y. Li and Z. Lan, "A Fast Recovery Mechanism for Checkpointing in Networked Environments", Proc. of DSN'08, 2008.
Z. Lan and Y. Li, "Adaptive Fault Management of Parallel Applications for High Performance Computing", IEEE Trans. on Computers, vol. 57(12), pp. 1647-1660, 2008.
Y. Li, Z. Lan, P. Gujrati, and X. Sun, "Fault-Aware Runtime Strategies for High Performance Computing", IEEE Trans. on Parallel and Distributed Systems, vol. 20(4), pp. 460-473, 2009.
J. Gu, Z. Zheng, Z. Lan, J. White, E. Hocks, and B-H. Park, "Dynamic Meta-Learning for Failure Prediction in Large-scale Systems: A Case Study", Proc. of ICPP'08, 2008.