Fuwen Tan

fuwen.tan@gmail.com

About me

I'm a researcher in Efficient Machine Learning, working on making LLM and Diffusion Models faster.

Research

Progressive Mixed-Precision Decoding for Efficient LLM Inference

Hao Mark Chen, Fuwen Tan, Alexandros Kouris, Royson Lee, Hongxiang Fan, Stylianos I. Venieris

International Conference on Learning Representations (ICLR), 2025.

MobileQuant: Mobile-friendly Quantization for On-device Language Models

Fuwen Tan, Royson Lee, Lukasz Dudziak, Shell Xu Hu, Sourav Bhattacharya, Timothy Hospedales, Georgios Tzimiropoulos, Brais Martinez

Conf. on Empirical Methods in Natural Language Processing, EMNLP Findings, 2024

Effective Self-supervised Pre-training on Low-compute Networks without Distillation

Fuwen Tan, Fatemeh Saleh, Brais Martinez

International Conference on Learning Representations (ICLR), 2023.

[ paper ] [ code ] [ poster ] [ slide ] [bibtex]

iBoot: Image-bootstrapped Self-Supervised Video Representation Learning

Fatemeh Saleh, Fuwen Tan, Adrian Bulat, Georgios Tzimiropoulos, Brais Martinez

EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers

Junting Pan, Adrian Bulat, Fuwen Tan, Xiatian Zhu, Lukasz Dudziak, Hongsheng Li, Georgios Tzimiropoulos, Brais Martinez

European Conference on Computer Vision (ECCV), 2022.

Instance-level Image Retrieval using Reranking Transformers

Fuwen Tan, Jiangbo Yuan, Vicente Ordonez

International Conference on Computer Vision (ICCV), 2021.

Curriculum Labeling: Self-paced Pseudo-Labeling for Semi-Supervised Learning

Paola Cascante-Bonilla, Fuwen Tan, Yanjun Qi, Vicente Ordonez

AAAI Conference on Artificial Intelligence (AAAI), 2021.

Drill-down: Interactive Retrieval of Complex Scenes using Natural Language Queries

Fuwen Tan, Paola Cascante-Bonilla, Xiaoxiao Guo, Hui Wu, Song Feng, Vicente Ordonez

Conf. on Neural Information Processing Systems (NeurIPS), 2019

[ paper ] [ code ] [ poster ] [ bibtex ]

Text2Scene: Generating Compositional Scenes from Textual Descriptions

Fuwen Tan, Song Feng, Vicente Ordonez

Conf. on Computer Vision and Pattern Recognition (CVPR), 2019, (~Oral presentation + Best Paper Finalist)

Posts from NVIDIA Developer News, IBM Research Blog

[ paper ] [ code ] [ poster ] [ slides ] [ bibtex ]

Where and Who? Automatic Semantic-Aware Person Composition

Fuwen Tan, Crispin Bernier, Benjamin Cohen, Vicente Ordonez, Connelly Barnes

Winter Conf. on Applications of Computer Vision (WACV), 2018

[ paper ] [ supplemental PDF ] [ code ] [ video ] [ bibtex ]

FaceCollage: A Rapidly Deployable System for Real-time Head Reconstruction for On-The-Go 3D Telepresence

Fuwen Tan, Chi-Wing Fu, Teng Deng, Jianfei Cai, Tat Jen Cham

ACM Multimedia (ACM MM, full paper), 2017

[ paper ] [ video] [ poster ] [ bibtex ]

High-Quality Kinect Depth Filtering For Real-time 3D Telepresence

Mengyao Zhao, Fuwen Tan, Chi-Wing Fu, Chi-Keung Tang, Jianfei Cai, Tat Jen Cham

Conf. on Multimedia and Expo (ICME), 2013

Field-guided Registration for Feature-conforming Shape Composition

Hui Huang, Minglun Gong, Daniel Cohen-Or, Yaobin Ouyang, Fuwen Tan, Hao Zhang

ACM Transactions on Graphics (Proc. SIGGRAPH Asia), 2012

Thesis

PhD Dissertation: Learning Local Representations of Images and Text

Images and text inherently exhibit hierarchical structures, e.g. scenes built from objects, sentences built from words. In many computer vision and natural language processing tasks, learning accurate prediction models requires analyzing the correlation of the local primitives of both the input and output data. In this thesis, we develop techniques for learning local representations of images and text and demonstrate their effectiveness on visual recognition, retrieval, and synthesis. ...

Fuwen Tan

fuwen.tan@gmail.com

About me

Research

International Conference on Learning Representations (ICLR), 2025.

[ paper ] [ code ] [bibtex]

Conf. on Empirical Methods in Natural Language Processing, EMNLP Findings, 2024

[ paper ] [ code ] [bibtex]

International Conference on Learning Representations (ICLR), 2023.

[ paper ] [ code ] [ poster ] [ slide ] [bibtex]

[ paper ] [bibtex]

European Conference on Computer Vision (ECCV), 2022.

[ paper ] [ code ] [bibtex]

International Conference on Computer Vision (ICCV), 2021.

[ paper ] [ code ] [bibtex]

AAAI Conference on Artificial Intelligence (AAAI), 2021.

[ paper ] [ code ] [ bibtex ]

Conf. on Neural Information Processing Systems (NeurIPS), 2019

[ paper ] [ code ] [ poster ] [ bibtex ]

Conf. on Computer Vision and Pattern Recognition (CVPR), 2019, (~Oral presentation + Best Paper Finalist)

Posts from NVIDIA Developer News, IBM Research Blog

[ paper ] [ code ] [ poster ] [ slides ] [ bibtex ]

Winter Conf. on Applications of Computer Vision (WACV), 2018

[ paper ] [ supplemental PDF ] [ code ] [ video ] [ bibtex ]

ACM Multimedia (ACM MM, full paper), 2017

[ paper ] [ video] [ poster ] [ bibtex ]

High-Quality Kinect Depth Filtering For Real-time 3D Telepresence

Conf. on Multimedia and Expo (ICME), 2013

[ IEEE Xplorer ] [bibtex]

Field-guided Registration for Feature-conforming Shape Composition

ACM Transactions on Graphics (Proc. SIGGRAPH Asia), 2012

[ project ] [paper] [bibtex]

Thesis

[ thesis ] [ slides ]