About
Welcome to my personal blog, where I share insights from my experiences in programming and AI. I’m currently building cutting-edge inference engines for large language models (LLMs) at Nvidia.
You can find more about me here .
Recent Posts
Why Numeric Responses Alone Fall Short for Function Calling in LLMs
Published: at 04:33 PMWhy Numeric Responses Alone Fall Short for Function Calling in LLMs
TensorRT LLM first impressions
Published: at 02:22 PMLearnings from building a TensorRT engine for serving Llama-3 on a RTX 4090.