# Zheng Zhang **AWS Shanghai AI Lab** Email: [zhaz@amazon.com](mailto\:zhaz@amazon.com), [zz@nyu.edu](mailto\:zz@nyu.edu), [zzhang@gmail.com](mailto\:zzhang@gmail.com) Website: [https://zzhang-cn.github.io/](https://zzhang-cn.github.io/) Phone: +86-15810126260 ## Education - **1984-1987**: B.S., Electronical Engineering, Fudan University, Shanghai, P.R.C. - **1990-1992**: M.S., Electrical and Computer Engineering, University of Texas at Dallas - **1993-1996**: Ph.D., Electrical and Computer Engineering, University of Illinois at Urbana-Champaign ## Research Interests - Forward-looking fundamental research on Large Language Models, focusing on their design, computational properties, applications in real-world contexts, and transformative potential in scientific discovery. - Development of practical and innovative frameworks to detect, diagnose, and mitigate hallucinations and reasoning limitations in LLMs, applicable across diverse modalities. - Addressing educational challenges in an AI-driven future by fostering Renaissance-like thinking, achieving exceptional performance with AI tools, and strengthening core abilities without tool dependence—ultimately empowering individuals to surpass pre-AI capabilities. Developed [**LLM4LLM** Project](https://github.com/zzhang-cn/LLM4LLM/), an innovative curriculum using LLMs to explain their own mechanisms through interactive learning. [Recent talk](https://news.qq.com/rain/a/20250125A03RV300?uid%5B0%5D=100130518884&uid%5B1%5D=100130518884&suid=&media_id=) ## Professional Experience ### AWS Shanghai AI Lab (2018-Present) - **Senior Principal Scientist and Lab Director** (Founding Director), built the lab from ground up with focus on machine learning/AI systems, algorithms and applications - Led the lab to strong research and industry impact: - Built and mentored research teams achieving \~100 publications in top-tier venues - Established research directions leading to significant technology transfers to AWS services - Founded and led development of [DGL](https://www.dgl.ai/) (Deep Graph Library): Leading open-source graph neural network platform, widely adopted in academia and industry - Built robust research pipeline through internship program (~200 interns to date, with strong placement record) - Established innovative industry-academia bridge through Consulting Professor Program - Successfully incubated and built two additional high-performing AWS teams ### NYU Shanghai (2014-2018) - Tenured **Global Network Professor** of Computer Science - Founder and advisor to the open-source project [MXNet](https://mxnet.apache.org/versions/1.9.1/), later adopted by Amazon as its official ML platform - Pioneered cross-disciplinary research: Initiated collaboration between neuroscience and machine learning for natural language understanding - Redesigned core computer science curriculum: Developed "Introduction to Computer Science" from ground up, integrating history, core algorithms, machine learning basics, and hands-on system building in one course. ### Microsoft Research Asia (2002-2014) - **Principal Researcher**. Founded and led System Research Group, growing it into a leading systems research organization in Asia - Member of Senior Leadership Team, shaping research strategy and direction of MSRA - Built research communities that bridged Asia and global systems research: - Founded SIGOPS APSys Workshop, now a premier systems venue in Asia-Pacific - Established ChinaSys community, connecting systems researchers across institutions - Recognized with multiple Microsoft Gold Star and Achievement Awards for research excellence and impact ## Publications [Google Scholar Profile](https://scholar.google.com.hk/citations?user=k0KiE4wAAAAJ\&hl=en) | Citations: 19,209 | h-index: 64 | i10-index: 146 (Since 2020: Citations: 10,375 h-index: 37, i10-index: 75) ### Selected Works #### Open-source frameworks for scalable machine learning and graph machine learning. Initiated popular scalable machine learning and graph neural network framework through robust, open-source platforms (\~2.9K and \~1.9K citations, for MXNet and DGL, respectively). - Minjie Wang, Da Zheng, Zihao Ye, Quan Gan, Mufei Li, Xiang Song, Jinjing Zhou, Chao Ma, Lingfan Yu, Yu Gai, Tianjun Xiao, Tong He, George Karypis, Jinyang Li, Zheng Zhang. **Deep Graph Library: A Graph-Centric, Highly-Performant Package for Graph Neural Networks**. arXiv:1909.01315. Sep 2019 - Chen T, Li M, Li Y, Lin M, Wang N, Wang M, Xiao T, Xu B, Zhang C, Zhang Z. **Mxnet: A flexible and efficient machine learning library for heterogeneous distributed systems**. arXiv:1512.01274. Dec 2015. #### Large Language Models Rethinking LLMs as computing devices and addressing their key limitations; mechanisms and benchmarks to perform hallucination detection. - Zhang Z. **Comprehension Without Competence: Architectural Limits of LLMs in Symbolic Computation and Reasoning**. TMLR, Nov 2025. - Ru D, Qiu L, Hu X, Zhang T, Shi P, Chang S, Jiayang C, Wang C, Sun S, Li H, Zhang Z. **Ragchecker: A fine-grained framework for diagnosing retrieval-augmented generation**. *NeurIPS 2024 D&B Track*. 2024 Aug 15. - Hu X, Ru D, Qiu L, Guo Q, Zhang T, Xu Y, Luo Y, Liu P, Zhang Y, Zhang Z. **RefChecker: Reference-based Fine-grained Hallucination Checker and Benchmark for Large Language Models**. *EMNLP 2024*. #### Graph Neural Networks, Algorithms and Applications GNN algorithms and system co-design, application and benchmarking of GNN for relational data, and applications in text and multi-agent trajectory prediction. - Huang K, Jiang H, Wang M, Xiao G, Wipf D, Song X, Gan Q, Huang Z, Zhai J, Zhang Z. **FreshGNN: Reducing Memory Access via Stable Historical Embeddings for Graph Neural Network Training**. VLDB Endowment. 2024 Feb 1;17(6):1473-86. - Wang M, Gan Q, Wipf D, Cai Z, Li N, Tang J, Zhang Z, et al. **4DBInfer: A 4D Benchmarking Toolbox for Graph-Centric Predictive Modeling on Relational DBs**. *arXiv:2404.18209*. 28 Apr 2024. - Guo Q, Jin Z, Wang Z, Qiu X, Zhang W, Zhu J, Zhang Z, Wipf D. **Fork or fail: Cycle-consistent training with many-to-one mappings**. *AISTATS*. 2021 Mar; pp. 1828-36. - Guo Q, Jin Z, Qiu X, Zhang W, Wipf D, Zhang Z. **Cyclegt: Unsupervised graph-to-text and text-to-graph generation via cycle training**. *arXiv:2006.04702*. 2020 Jun 8. - Li L, Yao J, Wenliang L, et al. **Grin: Generative relation and intention network for multi-agent trajectory prediction**. *NeurIPS*. 2021 Dec;34:27107-18. #### Transformer Architecture Optimizations Early solutions for addressing long-range context challenges, influencing subsequent innovations in attention mechanisms. - Guo Q, Qiu X, Liu P, Shao Y, Xue X, Zhang Z. **Star-transformer**. *NeurIPS 2019*. 2019 Feb 25. - Ye Z, Guo Q, Gan Q, Qiu X, Zhang Z. **Bp-transformer: Modelling long-range context via binary partitioning**. *ICLR 2020*. 2019 Nov 11. #### Core Computer Vision Problems Research key challenges in computer vision, object-centric learning and sequential image attention. - Seitzer M, Horn M, Zadaianchuk A, et al. **Bridging the gap to real-world object-centric learning**. *ICLR* 2023. - Welleck S, Mao J, Cho K, Zhang Z. **Saliency-based sequential image attention with multiset prediction**. *NeurIPS*. 2017 - Xiao T, Zhang J, Yang K, Peng Y, Zhang Z. **Error-driven incremental learning in deep convolutional neural network for large-scale image classification**. *ACM Multimedia*. 2014 Nov; pp. 177-86. - Xiao T, Xu Y, Yang K, Zhang J, Peng Y, Zhang Z. **The application of two-level attention models in deep convolutional neural network for fine-grained image classification**. *CVPR*. 2015; pp. 842-50. ## Grants and Research Support - **Major Program Grant (2015-2019)**: Science and Technology Commission Shanghai Municipal (15JC140014), Joint research of brain mechanism of language processing and computational modeling in artificial intelligence, 5M RMB, PI, project initiator and coordinator. - **Research Grant (2015-2018)** Science and Technology Commission Shanghai Municipal (15XD1503000), cross-disciplinary research of cognitive process in bio-system and artificial intelligence, 400K RMB, PI. ## Awards and Honors ### Major Recognition - China's National "Thousand Talent Program" recipient (2016) ### Best Paper Awards - Best Paper, Eurosys (2012) - Best Student Paper, Eurosys (2008) - Best Paper, USENIX LISA (2003) ### Industry Achievement Awards - Excellence in Collaboration, Cloud and China Enterprise, Microsoft (2014) - Microsoft 10 Year's Achievement Award (2012) - Microsoft Gold Star Award (2005, 2007) - HP Diamond Award (2001) - HP Star Award (1997) ## Professional Service ### Leadership in Research Communities - **Founder and Steering Committee Member**, SIGOPS APSys (2009-2014) - **Founder**, ChinaSys (cross-institute systems research community in China) - **General Co-Chair**, ACM SIGOPS APSys Workshop 2011 - **PC Co-Chair**, ACM SIGOPS APSys Workshop 2013 ### Program Committee Service - ACM Eurosys 2017 - ACM OSDI 2016 - ACM/USENIX Middleware 2014 - USENIX FAST 2014 ### Conference Organization - **Organizer**, AI-Brain Workshop, NYU Shanghai (2016) ### Journal and Grant Reviews - **Invited Expert Reviewer**, National Science Foundation of China (NSFC) (2004-2005) ## Patents 46 US Patent awards total