Scaling Laws Meet Model Architecture: Toward Inference-Efficient LLMs Permalink
Song Bian, Tao Yu, Shivaram Venkataraman, Youngsuk Park.
PC/Reviewer: NeurIPS, ICML, AISTATS, ICLR, KDD, SDM
Song Bian, Tao Yu, Shivaram Venkataraman, Youngsuk Park.
Ahmed Khaled, Kaan Ozkara, Tao Yu, Mingyi Hong, Youngsuk Park.
Albert Tseng, Tao Yu, Youngsuk Park.
Kaan Ozkara, Tao Yu, Youngsuk Park.
Tao Yu, Gaurav Gupta, Karthick Gopalswamy, Amith R Mamidala, et al.
Tao Yu, Congzheng Song, Jianyu Wang, Mona Chitnis.
Tao Yu*, Toni J.B. Liu*, Albert Tseng, Christopher De Sa.
Albert Tseng, Tao Yu, Toni J.B. Liu, Christopher De Sa.
Tao Yu, Christopher De Sa.
Tao Yu*, Yichi Zhang*, Zhiru Zhang, Christopher De Sa.
Tao Yu, Wentao Guo, Jianan Canal Li, Tiancheng Yuan, Christopher De Sa.
Tao Yu, Christopher De Sa.
Tao Yu, Eugene Bagdasaryan, Vitaly Shmatikov.
Tao Yu, Christopher De Sa.
Tao Yu*, Shengyuan Hu*, Chuan Guo, Weilun Chao, Kilian Q. Weinberger.
Felix Wu*, Tianyi Zhang*, Amauri Holanda de Souza Jr.*, Christopher Fifty, Tao Yu, Kilian Q. Weinberger.
Tao Yu, Huan long, John Hopcroft.
Talk at IJCAI 2025, Palais des congrès Montréal, QC, Canada
Talk at SANDS Seminar Series, S3S 2024, KAUST
Workshop talk at Workshop on Privacy Preserving Machine Learning, PPML 2024, Online
Talk at VSAONLINE 2022, Online
Talk at NeurIPS 2019 Spotlight, Vancouver, BC, Canada