Publications

publications by categories in reversed chronological order. * denotes equal contribution.

preprint

  1. Baichuan-Omni Technical Report
    Li, Yadong, Sun, Haoze, Lin, Mingan, Li, Tianpeng, Dong, Guosheng, Zhang, Tao, Ding, Bowen, Song, Wei, Cheng, Zhenglin, Huo, Yuqi, Chen, Song, Lu, Xu, Pan, Da, Zhang, Shusen, Wu, Xin, Liang, Zheng, Liu, Jun, Zhang, Tao, Lu, Keer, Zhao, Yaqi, Shen, Yanjun, Yang, Fan, Yu, KaichengLin, Tao, Xu, Jianhua, Zhou, Zenan, and Chen, Weipeng
    (arXiv:2410.08565) preprint
  2. Co-Evolution Learning
    Long, Zhenwei*, Sun, Peng*Cheng, ZhenglinLiu, Deyuan, and Lin, Tao
    preprint
  3. Dynamic Mixture of Experts: An Auto-tuning Approach for Efficient Transformer Models
    (arXiv:2405.14297) preprint
  4. DeFT: Decoding with Flash Tree-Attention for Efficient Tree-structured LLM Inference
    (arXiv:2404.00242), abridged in ICLR workshop AGI (Oral). preprint
  5. ELICIT: LLM Augmentation Via External In-context Capability
    Wang, Futing*, Yan, Jianhao*, Zhang, Yue, and Lin, Tao
    (arXiv:2410.09343) preprint
  6. PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration
    Sun, Yuxuan, Zhang, Yunlong, Si, Yixuan, Zhu, Chenglu, Shui, Zhongyi, Zhang, Kai, Li, Jingxiong, Xinheng, Lyu, Lin, Tao†, and Yang, Lin†
    (arXiv:2407.00203) preprint
  7. Cognitive Insights and Stable Coalition Matching for Fostering Multi-Agent Cooperation
    Shao, Jiaqi, Yuan, Tianjun, Lin, TaoCao, Xuanyu, and Luo, Bing
    (arXiv:2405.18044) preprint
  8. MorphAgent: Empowering Agents through Self-Evolving Profiles and Decentralized Collaboration
    (2410.15048) preprint
  9. GIFT: Unlocking Full Potential of Labels in Distilled Dataset at Near-zero Cost
    (arXiv:2405.14736) preprint
  10. Information Compensation: A Fix for Any-scale Dataset Distillation
    ICLR workshop DMLR preprint
  11. Privacy as a Free Lunch: Crafting Initial Distilled Datasets through the Kaleidoscope
    Shi, Shuo, Sun, PengShang, XinyiDu, Tianyu, Zhang, Xuhong, Yin, Jianwei, and Lin, Tao
    preprint
  12. Client2Vec: Improving Federated Learning by Distribution Shifts Aware Client Indexing
    Guo, Yongxin, Wang, Lin, Tang, Xiaoying, and Lin, Tao
    (arXiv:2405.16233) preprint
  13. Find Your Optimal Assignments On-the-fly: A Holistic Framework for Clustered Federated Learning
    (arXiv:2310.05397) preprint
  14. Federated Unlearning: a Perspective of Stability and Fairness
    ICLR workshop PML preprint
  15. Improving Group Connectivity for Generalization of Federated Deep Learning
    Li, Zexi*, Lin, Jie*, Li, Zhiqi*, Zhu, Didi, Ye, Rui, Shen, Tao, Lin, Tao†, and Wu, Chao†
    (arXiv:2402.18949), abridged in FL@FM-NeurIPS’24 (Oral). preprint
  16. CollabEdit: Towards Non-destructive Collaborative Knowledge Editing
    Zheng, Jiamu, Zhang, JinghuaiDu, Tianyu, Zhang, Xuhong, Yin, Jianwei, and Lin, Tao
    (arXiv:2410.09508), ICLR 2024 workshop SeT LLM and ICLR 2024 workshop DPFM. preprint
  17. Switch EMA: A Free Lunch for Better Flatness and Sharpness
    Li, SiyuanLiu, Zicheng, Juanxi, Tian, Wang, Ge, Wang, Zedong, Jin, Weiyang, Wu, Di, Tan, ChengLin, Tao, Liu, Yang, Sun, Baigui, and Li, Stan Z.
    (arXiv:2402.09240) preprint

2024

  1. NeurIPS 2024
    Efficiency for Free: Ideal Data Are Transportable Representations
    Sun, Peng, Jiang, Yi, and Lin, Tao
    In Advances in Neural Information Processing Systems (NeurIPS) 2024
  2. NeurIPS 2024
    Cooperative Hardware-Prompt Learning for Snapshot Compressive Imaging
    In Advances in Neural Information Processing Systems (NeurIPS) 2024
  3. ECCV 2024 (Oral)
    PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology
    Sun, Yuxuan, Wu, Hao, Zhu, Chenglu, Zheng, Sunyi, Chen, Qizi, Zhang, Kai, Zhang, Yunlong, Wan, Dan, Lan, Xiaoxiao, Zheng, Mengyue, Li, Jingxiong, Lyu, Xinheng, Lin, Tao†, and Yang, Lin†
    In European Conference on Computer Vision (ECCV), Best Paper Candidate @ ECCV 2024
  4. ICML 2024
    FedRC: Tackling Diverse Distribution Shifts Challenge in Federated Learning by Robust Clustering
    In International Conference on Machine Learning (ICML), abridged in ICLR Workshop ML4IoT 2024
  5. CVPR 2024
    On the Diversity and Realism of Distilled Dataset: An Efficient Dataset Distillation Paradigm
    Sun, PengShi, Bei, Yu, Daiwei, and Lin, Tao
    In Computer Vision and Pattern Recognition Conference (CVPR) 2024
  6. ICLR 2024
    Towards Robust Multi-Modal Reasoning via Model Selection
    Liu, Xiangyan*, Li, Rongxue*, Ji, Wei, and Lin, Tao
    In International Conference on Learning Representations (ICLR) 2024
  7. ICLR 2024
    Increasing Model Capacity for Free: A Simple Strategy for Parameter Efficient Fine-tuning
    Song, Haobo*, Zhao, Hao*, Majumder, Soumajit, and Lin, Tao
    In International Conference on Learning Representations (ICLR) 2024
  8. Journal
    Decentralized Gradient Tracking with Local Steps
    Optimization Methods and Software 2024

2023

  1. NeurIPS 2023
    DELTA: Diverse Client Sampling for Fasting Federated Learning
    Wang, Lin, Guo, YongxinLin, Tao, and Tang, Xiaoying
    In Advances in Neural Information Processing Systems (NeurIPS) 2023
  2. ICCV 2023
    No Fear of Classifier Biases: Neural Collapse Inspired Federated Learning with Synthetic and Fixed Classifier
    In International Conference on Computer Vision (ICCV) 2023
  3. ICML 2023
    On Pitfalls of Test-time Adaptation
    In International Conference on Machine Learning (ICML), abridged in ICLR Workshop on Trustworthy ML 2023 and ICLR Workshop on DG (spotlight) 2023
  4. ICML 2023
    FedBR: Improving Federated Learning on Heterogeneous Data via Local Learning Bias Reduction
    In International Conference on Machine Learning (ICML) 2023
  5. ICML 2023
    Revisiting Weighted Aggregation in Federated Learning with Neural Networks
    In International Conference on Machine Learning (ICML) 2023
  6. ICLR 2023
    Test-Time Robust Personalization for Federated Learning
    In International Conference on Learning Representations (ICLR) 2023

2022

  1. IMWUT 2022
    Learning Disentangled Behaviour Patterns for Wearable-based Human Activity Recognition
    Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies (IMWUT) 2022
  2. OPT 2022
    Decentralized Stochastic Optimization with Client Sampling
    Optimization and machine learning (OPT) workshop, in NeurIPS 2022

2021

  1. OPT 2021
    Understanding Memorization from the Perspective of Optimization via Efficient Influence Estimation
    Liu, Futong, Lin, Tao†, and Jaggi, Martin
    Optimization and machine learning (OPT) workshop, in NeurIPS 2021
  2. NeurIPS 2021
    RelaySum for Decentralized Deep Learning on Heterogeneous Data
    In Advances in Neural Information Processing Systems (NeurIPS) 2021
  3. NeurIPS 2021
    An Improved Analysis of Gradient Tracking for Decentralized Machine Learning
    In Advances in Neural Information Processing Systems (NeurIPS) 2021
  4. ICML 2021
    Quasi-global Momentum: Accelerating Decentralized Deep Learning on Heterogeneous Data
    In International Conference on Machine Learning (ICML) 2021
  5. ICML 2021
    Consensus Control for Decentralized Deep Learning
    In International Conference on Machine Learning (ICML) 2021
  6. preprint
    Towards Federated Learning on Time-Evolving Heterogeneous Data
    preprint (arXiv:2112.13246), appeared in FL-ICML workshop 2021

2020

  1. NeurIPS 2020
    Ensemble Distillation for Robust Model Fusion in Federated Learning
    In Advances in Neural Information Processing Systems (NeurIPS) 2020
  2. NeurIPS 2020
    On the Loss Landscape of Adversarial Training: Identifying Challenges and How to Overcome Them
    In Advances in Neural Information Processing Systems (NeurIPS) 2020
  3. ICML 2020
    Extrapolation for Large-batch Training in Deep Learning
    In International Conference on Machine Learning (ICML) 2020
  4. EMNLP 2020
    Masking as an Efficient Alternative to Finetuning for Pretrained Language Models
    In Empirical Methods in Natural Language Processing (EMNLP) 2020
  5. ICLR 2020
    Decentralized Deep Learning with Arbitrary Communication Compression
    In International Conference on Learning Representations (ICLR) 2020
  6. ICLR 2020
    Don’t Use Large Mini-batches, Use Local SGD
    In International Conference on Learning Representations (ICLR) 2020
  7. ICLR 2020
    Dynamic Model Pruning with Feedback
    In International Conference on Learning Representations (ICLR) 2020
  8. CLVision 2020
    Generalized Class Incremental Learning
    Mi, Fei*Kong, Lingjing*Lin, TaoYu, Kaicheng, and Faltings, Boi
    In Conference on Computer Vision and Pattern Recognition (CVPR) Workshop on Continual Learning 2020

2019

  1. ICML 2019
    Exploring Interpretable LSTM Neural Networks over Multi-variable Data
    In International Conference on Machine Learning (ICML) 2019
  2. TPDS 2019
    Ga-par: Dependable Microservice Orchestration Framework for Geo-distributed Clouds
    Wen, ZhenyuLin, TaoYang, Renyu, Ji, Shouling, Ranjan, Rajiv, Romanovsky, Alexander, Lin, Changting, and Xu, Jie
    IEEE Transactions on Parallel and Distributed Systems (TPDS) 2019

2018

  1. NeurIPS 2018
    Training DNNs with Hybrid Block Floating Point
    In Advances in Neural Information Processing Systems (NeurIPS) 2018

2017

  1. IJCAI 2017
    Hybrid Neural Networks for Learning the Trend in Time Series
    In Proceedings of the twenty-sixth international joint conference on artificial intelligence (IJCAI) 2017
  2. IEEE 2017
    Fog Orchestration for IoT Services: Issues, Challenges and Directions
    Wen, ZhenyuYang, Renyu, Garraghan, Peter, Lin, TaoXu, Jie, and Rovatsos, Michael
    IEEE Internet Computing 2017
  3. VLDB 2017
    Heterogeneous Recommendations: What You Might Like to Read after Watching Interstellar
    Proceedings of the Very Large Data Base Endowment (VLDB) 2017