Hello World!
My name is Xiaoyun Li. I’m a machine learning scientist at LinkedIn, developing algorithms and theory for big data and machine learning. Before joining Linkedin, I was a research scientist at Baidu Research.
Education
- B.S. Financial Mathematics, Shanghai Jiao Tong University, 2015
- M.S. Financial Statistics and Risk Management, Rutgers University, 2017
- Ph.D Statistics, Rutgers University, 2021
I was fortunate to work with Prof. Ping Li.
Thesis committee: Prof. Ping Li, Prof. Cun-Hui Zhang, Prof. Min Xu, Prof. William Steiger.
Work Experience
Teaching Assistant, Department of Statistics, Rutgers University
– Introduction to Statistics, Advanced Simulation, Data MiningBaidu Research
LinkedIn
Interests
Statistics, Machine Learning, Randomized Algorithms, Information Retrieval, Privacy, Distributed Optimization, Deep Learning, Basketball, Guitar… and anything interesting!
Academic Services
- Conference Reviewer/PC Member: Since 2020, I have served as the reviewer/PC/SPC for NeurIPS, ICML, ICLR, SODA, AISTATS, KDD, WWW, AAAI, IJCAI conference
- Journal Reviewer: IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), Transactions on Machine Learning Research (TMLR), Machine Learning, IEEE/CAA Journal of Automatica Sinica
Selected Publications
( * : alphabetical order)
News:
- Xinmeng Huang*, Ping Li* and Xiaoyun Li*. Stochastic Controlled Averaging for Federated Learning with Communication Compression, [paper] is accepted to ICLR 2024 as spotlight paper.
- Two papers on sketching and clustering with differential privacy are accepted to NeurIPS 2023.
- Two papers on federated learning and clustering are accepted to ICML 2023.
Dimensionality Reduction & Compression:
- Xiaoyun Li and Ping Li. Generalization Error Analysis of Quantized Compressive Learning, Neural Information Processing Systems (NeurIPS) 2019 (Oral spotlight), [paper].
- Xiaoyun Li and Ping Li. Random Projections with Asymmetric Quantization, Neural Information Processing Systems (NeurIPS) 2019, [paper].
- Xiaoyun Li and Ping Li. Quantization Algorithms for Random Fourier Features, International Conference on Machine Learning (ICML) 2021, [paper].
- Xiaoyun Li and Ping Li. One-Sketch-for-All: Non-linear Random Features from Compressed Linear Measurements, International Conference on Artificial Intelligence and Statistics (AISTATS) 2021, [paper].
- Xiaoyun Li, Chenxi Wu and Ping Li. IVFS: Simple and Efficient Feature Selection for High Dimensional Topology Preservation, AAAI Conference on Artificial Intelligence (AAAI) 2020, [paper].
- Ping Li and Xiaoyun Li. OPORP: One Permutation + One Random Projection, SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2023 [paper].
- Zhiqiang Xu, Dong Li, Weijie Zhao, Xing Shen, Tianbo Huang, Xiaoyun Li, and Ping Li. Agile and Accurate CTR Prediction Model Training for Massive-Scale Online Advertising Systems, ACM International Conference on Management of Data (SIGMOD) 2021, [paper].
Randomized Algorithms & Hashing:
- Xiaoyun Li and Ping Li. SignRFF: Sign Random Fourier Features, Neural Information Processing Systems (NeurIPS) 2022, [paper].
- Xiaoyun Li and Ping Li. C-MinHash: Improving Min-wise Hashing with Circulant Permutation, International Conference on Machine Learning (ICML) 2022, [paper].
- Xiaoyun Li and Ping Li. Rejection Sampling for Weighted Jaccard Similarity Revisited, AAAI Conference on Artificial Intelligence (AAAI) 2021, [paper].
- Xiaoyun Li, Jie Gui and Ping Li. Randomized Kernel Multi-view Discriminant Analysis, European Conference on Artificial Intelligence (ECAI) 2020, [paper].
- Ping Li*, Xiaoyun Li* and Cun-Hui Zhang*. Re-randomized Densification for One Permutation Hashing and Bin-wise Consistent Weighted Sampling, Neural Information Processing Systems (NeurIPS) 2019, [paper].
- Ping Li*, Xiaoyun Li*, Gennady Samorodnitsky* and Weijie Zhao*. Consistent Sampling Through Extremal Process, The Web Conference (WWW) 2021, [paper].
- Ping Li*, Xiaoyun Li* and Gennady Samorodnitsky*. P-MinHash Algorithm for Continuous Probability Measures: Theory and Application to Machine Learning, Conference on Information and Knowledge Management (CIKM) 2022, [paper].
- Peng Yang, Xiaoyun Li and Ping Li. Graph-based Adversarial Online Kernel Learning with Adaptive Embedding, International Conference on Data Mining (ICDM) 2021, [paper].
- Xinyi Zheng, Weijie Zhao, Xiaoyun Li and Ping Li. Building K-Anonymous User Cohorts with Consecutive Consistent Weighted Sampling (CCWS), International ACM SIGIR Conference on Information Retrieval (SIGIR) 2023.
Distributed/Federated Optimization:
- Xiaoyun Li and Ping Li. Analysis of Error Feedback in Federated Non-convex Optimization with Biased Compression: Fast Convergence and Partial Participation, International Conference on Machine Learning (ICML) 2023 (Oral) [paper].
- Xiaoyun Li, Belhal Karimi and Ping Li. On Distributed Adaptive Optimization with Gradient Compression, International Conference on Learning Representations (ICLR) 2022, [paper].
- Belhal Karimi*, Ping Li*, and Xiaoyun Li*. Fed-LAMB: Layer-wise and Dimension-wise Locally Adaptive Federated Learning, Conference on Uncertainty in Artificial Intelligence (UAI) 2023 [paper].
- Farzin Haddadpour*, Belhal Karimi*, Ping Li* and Xiaoyun Li*. FedSKETCH: Communication-Efficient and Private Federated Learning via Sketching, [Arxiv].
- Xiangyi Chen, Xiaoyun Li and Ping Li. Towards Communication-Efficient Adaptive Gradient Method, ACM-IMS Foundations of Data Science Conference (FODS) 2020, [paper].
- Jun-Kun Wang, Xiaoyun Li, Belhal Karimi and Ping Li. An Optimistic Acceleration of AMSGrad for Nonconvex Optimization, Asian Conference on Machine Learning (ACML) 2021, [paper].
- Weijie Zhao, Xuewu Jiao, Mingqing Hu, Xiaoyun Li, Xiangyu Zhang, and Ping Li. Communication-Efficient Terabyte-Scale Model Training Framework for Online Advertising, IEEE International Conference on Big Data (IEEE BigData) 2023, to appear.
Privacy, Clustering, Statistics:
- Long Feng*, Tiefeng Jiang*, Xiaoyun Li* and Binghui Liu*. Asymptotic Independence of the Sum and Maximum of Dependent Random Variables with Applications to High-dimensional Tests, Statistica Sinica (2023), [paper].
- Ping Li* and Xiaoyun Li*. Smooth Flipping Probability for Differential Private Sign Random Projection Methods, Neural Information Processing Systems (NeurIPS) 2023 [paper].
- Chenglin Fan*, Ping Li* and Xiaoyun Li*. k-Median Clustering via Metric Embedding: Towards Better Initialization with Differential Privacy, Neural Information Processing Systems (NeurIPS) 2023 [paper].
- Chenglin Fan*, Ping Li* and Xiaoyun Li*. LSDS++: Dual Sampling for Accelerated k-means++, International Conference on Machine Learning (ICML) 2023 [paper].
- Chenglin Fan*, Ping Li* and Xiaoyun Li*. Private Graph All-Pairwise-Shortest-Path Distance Release with Improved Error Rate, Neural Information Processing Systems (NeurIPS) 2022, [paper].
- Huang Fang, Xiaoyun Li, Chenglin Fan and Ping Li. Improved Convergence of Differential Private SGD with Gradient Clipping, International Conference on Learning Representations (ICLR) 2023, [paper].
- Xueyan Niu, Xiaoyun Li and Ping Li. Learning Cluster Causal Diagrams: An Information-Theoretic Approach, International Joint Conference on Artificial Intelligence (IJCAI) 2022, [paper].
Computer Vision:
Tan Yu, Xiaoyun Li and Ping Li. Fast and Compact Bilinear Pooling by Shifted Random Maclaurin, AAAI Conference on Artificial Intelligence (AAAI) 2021, [paper].