I'm a CS student at UT Austin, passionate about building solutions to real problems.
Designing a resource allocation mechanism to reduce the variability in encoding time for input modalities in multimodal AI models, reducing pipeline bubbles during inference