Publications

You can also find my articles on my Google Scholar profile.

Conference Papers

Rethinking Learning Rate Tuning in the Era of Large Language Models

Published in 2023 IEEE 5th International Conference on Cognitive Machine Intelligence (CogMI), 2023

This paper explores the challenges of learning rate tuning for Large Language Models (LLMs) and introduces LRBench++ for benchmarking.

Recommended citation: Jin, H., Wei, W., Wang, X., Zhang, W., & Wu, Y. (2023). "Rethinking Learning Rate Tuning in the Era of Large Language Models." 2023 IEEE 5th International Conference on Cognitive Machine Intelligence (CogMI), 112-121.
Download Paper

Preprints

CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge Collaboration

Published in arXiv Preprint, 2024

This paper proposes CE-CoLLM, a novel cloud-edge collaboration framework for efficient and adaptive inference of Large Language Models (LLMs).

Recommended citation: Jin, H., & Wu, Y. (2024). "CE-CoLLM: Efficient and Adaptive Large Language Models Through Cloud-Edge Collaboration." arXiv Preprint. arXiv:2411.02829.
Download Paper

DA-MoE: Towards Dynamic Expert Allocation for Mixture-of-Experts Models