My biggest areas of expertise are generative AI and video understanding with AI.
My early works include:
These works investigate the statistical aspects of styles for diffusion, speeding up the optimization process, the generation of motion sequences, and getting Bokeh effects with time-varying prompts.
After the wide availability of SD models, in applications, we focused on various customization/consistency methods. We were one of the first to investigate the idea of inference-time customization. Now this has become a major area in the field.
In these more recent works, my focus has been on generation consistency, video generation and leveraging agent abilities to generation video data.
In these works, we processed videos with distributed pipelines. We applied transformers, state of the art multi-modal models, detection models, and so on to process Soccer videos.