Fuwen Tan

I work on making image/video generation models and large language models faster, with recent work on distillation, quantization, and efficient inference. I am a main developer of MAI-Image-2.5-Flash, an efficiency-optimized variant of MAI-Image-2.5, and received a Best Paper Finalist at CVPR 2019.

Research

ICME 2013Depth filtering

High-Quality Kinect Depth Filtering For Real-time 3D Telepresence

Mengyao Zhao, Fuwen Tan, Chi-Wing Fu, Chi-Keung Tang, Jianfei Cai, Tat Jen Cham

SIGGRAPH Asia 2012Shape composition

Field-guided Registration for Feature-conforming Shape Composition

Hui Huang, Minglun Gong, Daniel Cohen-Or, Yaobin Ouyang, Fuwen Tan, Hao Zhang

Thesis

University of Virginia mark

Learning Local Representations of Images and Text

Images and text exhibit hierarchical structures: scenes are built from objects, sentences from words. This thesis develops techniques for learning local representations of images and text, with applications in visual recognition, retrieval, and synthesis.