About
I'm currently working on inference engines for LLMs.
You can find more about my journey here .
Recent Posts
TensorRT LLM first impressions
Published: at 02:22 PMLearnings from building a TensorRT engine for serving Llama-3 on a RTX 4090.
I'm currently working on inference engines for LLMs.
You can find more about my journey here .
Learnings from building a TensorRT engine for serving Llama-3 on a RTX 4090.