I'm a CS student at UT Austin, passionate about building solutions to real problems.
Here are some notes to myself.
Designing a resource allocation mechanism to reduce the variability in encoding time for input modalities in multimodal AI models, reducing pipeline bubbles during inference