Eldar Kurtić

Eldar Kurtić

Principal Research Scientist

Find Eldar here:

Eldar is a research scientist specializing in efficient inference techniques for large machine learning models, with a particular focus on sparsity, quantization, and speculative decoding. His work focuses on developing methods to accelerate inference within the vLLM engine, bridging cutting-edge research with practical deployment solutions. In his spare time, Eldar enjoys making GPUs go brrr.