20
A Selective Survey of Efficient Speculative Decoding Techniques for LLM Inference
(blog.codingconfessions.com)
This is a most excellent place for technology news and articles.