Xi Lin

林熙

Hangzhou, Zhejiang · erix0-remove-25@outlook.com

🏫 I’m a senior student at Zhejiang University, majoring in Computer Science and Technology.

🔭 I’m interested in Efficient AI through algorithm–system co-design, focusing on hardware-friendly sparse and quantized module design as well as efficient inference strategies.

🚀 I’m currently a leader of ZJUSCT, a super computing team at Zhejiang University which has won several international super computing competitions.

News

📢 New Preprint: TriAttention

2026-04-07

Excited to share our latest preprint on TriAttention, an efficient KV cache compression method for long reasoning in large language models. By leveraging Q/K concentration in pre-RoPE space and a trigonometric series to score key importance, TriAttention matches Full Attention reasoning accuracy on AIME25 while achieving 2.5x higher throughput or 10.7x KV memory reduction. Check out the project page and code!

📢 New Preprint: Pyramid Sparse Attention (PSA)

2025-12-03

Excited to share our latest preprint on Pyramid Sparse Attention (PSA), a novel attention mechanism designed to enhance efficiency in video understanding and generation tasks. PSA leverages a multi-level sparse attention strategy to allocate computational resources effectively, enabling the model to focus on critical tokens while reducing redundancy. Check out the paper on project page!

Publications

A collection of my research publications and academic papers.

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

TriAttention: Efficient Long Reasoning with Trigonometric KV Compression

Weian Mao, Xi Lin, Wei Huang, Yuxin Xie, Tianfu Fu, Bohan Zhuang, Song Han, Yukang Chen

PreprintApril 2026

TriAttention proposes a novel KV cache compression approach for long reasoning in LLMs. It leverages trigonometric series based on fixed centers in pre-RoPE space to score key importance, achieving 2.5x higher throughput or 10.7x KV memory reduction while matching Full Attention reasoning accuracy.
PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation

PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation

Xiaolong Li*, Youping Gu*, Xi Lin*, Weijie Wang, Bohan Zhuang

* Equal contribution

PreprintDecember 2025

PSA introduce an efficient attention mechanism to accelerate video understanding and generation. It leverages a multi-level sparse attention strategy, enabling the model to effectively mitigates information loss while preserving computational efficiency under a low compute budget.

Experience

Team Leader

Led a team of students in participating in various supercomputing competitions, focusing on machine learning systems (MLSys) and high-performance computing (HPC) solutions.

May 2025 - Present

Teaching Assistant

HPC101, Zhejiang University

Assisted in the HPC101 course, focusing on high-performance computing concepts and practical applications.

  • Contributed to experiment design and implementation about deep learning.
  • Given a lecture on deep learning system.
  • Collaborated with other TAs in scheduling the course, refining course materials and providing support to students.

Summer 2025

Member

Contributed to the development of HPC solutions and participated in various supercomputing competitions both domestically and internationally.

October 2023 - May 2025

Teaching Assistant

Computer Architecture Course, Zhejiang University

Assisted Prof. Chang in teaching the Computer Architecture course.

  • Provided support in grading assignments and exams.
  • Refined course and experiment materials.
  • Examined students’ works in experiments and provided feedback.

Spring 2025

Teaching Assistant

HPC101, Zhejiang University

Assisted in the HPC101 course, focusing on high-performance computing concepts and practical applications.

  • Contributed to experiment design and implementation about MPI/OpenMP.
  • Given a lecture on MPI and OpenMP parallel programming models, with co-lecturer Chenxiao Li and Jiarui Guo.

Summer 2024

Awards

5th Linear Solver Algorithm and Performance Optimization Competition

Third Prize

Aug 2025

Zhejiang University Scholarship

First Prize

Oct 2024

4th Linear Solver Algorithm and Performance Optimization Competition

Second Prize

Jul 2024

ISC 2024 Student Cluster Competition (online)

4th Place

May 2024

ASC24 Student Supercomputer Challenge

Second Prize

Feb 2024

Zhejiang University Scholarship

Third Prize

Oct 2023

Education

Zhejiang University

Bachelor of Science
Computer Science and Technology

2022 - Present

Blog

Visit my personal blog to read about my thoughts, experiences, and technical insights.

Nifty tech tag lists from Wouter Beeftink