Mingxiao Huo

I'm an ece master student at Carnegie Mellon University in Pittsburgh, my research interests lie in ai, robotics and system.

At CMU I work with Prof. Katerina Fragkiadaki on 3d vision and robotics. Prevously, I was working in UC Berkeley with Prof. Masayoshi Tomizuka(distinguished member of National Academy of Engineering) and Prof. Mingyu Ding on robotics.

Email  /  Github

profile photo

Selected Publications

I'm interested in robotics, deep learning, 3d vision, and computer system. Some papers are highlighted.

Human-oriented Representation Learning for Robotic Manipulation
Mingxiao Huo, Mingyu Ding, Chenfeng Xu, Thomas Tian, Xinghao Zhu, Yao Mu, Lingfeng Sun, Masayoshi Tomizuka, Wei Zhan
RSS, 2024 (Oral)
project page / arXiv / paper

Keywords: Manipulation, Representation Learning, Computer Vision

Sparse Diffusion Policy: A Sparse, Reusable, and Flexible Policy for Robot Learning
Mingxiao Huo*, Yifei Zhang*, Yixiao Wang*, Thomas Tian, Xiang Zhang, Yichen Xie, Chenfeng Xu, Pengliang Ji, Wei Zhan, Mingyu Ding, Masayoshi Tomizuka
CoRL, 2024
project page / arXiv / paper / code

Keywords: Manipulation, Mixure of Experts, Policy Learning

Joint Pedestrian Trajectory Prediction through Posterior Sampling
Haotian Lin, Yixiao Wang, Mingxiao Huo, Chensheng Peng, Zhiyuan Liu, Masayoshi Tomizuka
IROS, 2024 (Oral Pitch)
project page / arXiv

Keywords: Diffusion Model, Motion Prediction

AbHE: All Attention-Based Homography Estimation
Mingxiao Huo, Zhihao Zhang, Xinyang Ren, Xianqiang Yang, Chao Ye
IEEE Transactions on Instrumentation and Measurement(TIM), 2023
code / arXiv

Keywords: 3d Vision, Generative Model

An Empirical Study on Tree-based Speculative Decoding
Mingxiao Huo
2024 (in submission)
code(coming soon)

Keywords: Computer system, LLM serving

Robust Multi-Object 4D Generation for In-the-wild Videos
Wen-Hsuan Chu*, Lei Ke*, Jianmeng Liu*, Mingxiao Huo, Pavel Tokmakov, Katerina Fragkiadaki
CVPR, 2025
arxiv(coming soon)

Keywords: 3d Vision, 4d Reconstruction

Interesting Projects

I have done some interesting projects in computer system, ai and mechanics design.

Self-driving trash bin car

Keywords: Mechanics Design, Computer vision, Control System

video

TL;DR: A self driving car build from scratch, with motors to control speed and direction. Using computer vision algorithm to detect trash and react with controller.

A parallel render

Keywords: Cuda, Computer grapics, Parallel processing


code

TL;DR: A cuda based image rending with parallel processing in circle base.

Tactile audio imitation learning

Keywords: Tactile sensor, Audio processing

TL;DR: An imitation learing pipeline with tactile sensor and audio feedback.

UDP transport layer

Keywords: Computer network


code(coming soon)

TL;DR: A reliable transport layer based on UDP, which can transfer content files between mulJple peers.