ASPLOS’24 – Lightning Talks – Session 2D – SpecInfer: Accelerating Large Language Model Serving with

Channel: ACM SIGARCH
493 views • 28/06/2024