Publications
publications by categories in reversed chronological order. * denotes equal contribution; † denotes corresponding authorship.
2024
-
Journal
-
preprintTraining-time Neuron Alignment for Improving Linear Mode Connectivity and Model Fusionpreprint 2024
-
preprint
-
preprintPathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathologypreprint (arXiv:2401.16355) 2024
-
preprintSwitch EMA: A Free Lunch for Better Flatness and Sharpnesspreprint (arXiv:2402.09240) 2024
-
preprintCollabEdit: Towards Non-destructive Collaborative Knowledge EditingICLR workshop SeT LLM, ICLR workshop DPFM 2024
-
preprintDEFT: Flash Tree-Attention with IO-Awareness for Efficient Tree-search-based LLM InferenceICLR workshop AGI (Oral) 2024
2023
-
NeurIPS 2023DELTA: Diverse Client Sampling for Fasting Federated LearningIn Advances in Neural Information Processing Systems (NeurIPS) 2023
-
preprintFedRC: Tackling Diverse Distribution Shifts Challenge in Federated Learning by Robust Clusteringpreprint (arXiv:2301.12379), appeared in ICLR Workshop ML4IoT 2023
-
preprintRevisiting Implicit Models: Sparsity Trade-offs Capability in Weight-tied Modelappeared in ICLR Workshop SNN 2023
-
preprint
2022
-
OPT 2022Decentralized Stochastic Optimization with Client SamplingOptimization and machine learning (OPT) workshop, in NeurIPS 2022
2021
-
OPT 2021Understanding Memorization from the Perspective of Optimization via Efficient Influence EstimationOptimization and machine learning (OPT) workshop, in NeurIPS 2021
-
NeurIPS 2021An Improved Analysis of Gradient Tracking for Decentralized Machine LearningIn Advances in Neural Information Processing Systems (NeurIPS) 2021
-
ICML 2021Consensus Control for Decentralized Deep LearningIn International Conference on Machine Learning (ICML) 2021
-
preprintTowards Federated Learning on Time-Evolving Heterogeneous Datapreprint (arXiv:2112.13246), appeared in FL-ICML workshop 2021
2020
-
ICML 2020Extrapolation for Large-batch Training in Deep LearningIn International Conference on Machine Learning (ICML) 2020
-
CLVision 2020Generalized Class Incremental LearningIn Conference on Computer Vision and Pattern Recognition (CVPR) Workshop on Continual Learning 2020
2019
-
TPDS 2019Ga-par: Dependable Microservice Orchestration Framework for Geo-distributed CloudsIEEE Transactions on Parallel and Distributed Systems (TPDS) 2019
2018
2017
-
IJCAI 2017Hybrid Neural Networks for Learning the Trend in Time SeriesIn Proceedings of the twenty-sixth international joint conference on artificial intelligence (IJCAI) 2017
-
IEEE 2017Fog Orchestration for IoT Services: Issues, Challenges and DirectionsIEEE Internet Computing 2017