ASPLOS’24 – Lightning Talks – Session 2D – SpecInfer: Accelerating Large Language Model Serving with Channel: ACM SIGARCH493 views • 28/06/2024