First-Author Research
My research lies in Computer Vision and Artificial Intelligence. Aims to explore the potential of generative models for AIGC and Trajectory Prediction.
I worked on Diffusion Models, AIGC, VLM, Video Synthesis/Editing, Image Editing, Multimodal Learning, Trajectory Prediction, NeRF, GANs.
|
|
Token Dynamics as Long Video Representation for Video Understanding
(ongoing with Amazon)
|
|
OOSTraj: Out-of-Sight Trajectory Prediction With Vision-Positioning Denoising
Haichao Zhang, Yi Xu,
Hongsheng Lu,
Takayuki Shimizu,
Yun Fu
(First work on out-of-sight trajectory prediction.)
IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024 (CVPR'24)
[to be appear soon] [arxiv] [project page] [code]
|
|
Layout Sequence Prediction From Noisy Mobile Modality
(See Beyond Vision: Denoising Diffusion Model for Layout Trajectory Prediction from Noisy Mobile Modality)
Haichao Zhang, Yi Xu,
Hongsheng Lu,
Takayuki Shimizu,
Yun Fu
31st ACM International Conference on Multimedia (ACM MM'23)[arxiv][project page][video][code(coming soon)] |
|
Camouflaged Image Synthesis Is All You Need to Boost Camouflaged Detection
Haichao Zhang, Can Qin, Yu Yin, Yun Fu
In subission [arxiv] |
|
Sketch Me A Video
Haichao Zhang, Gang Yu, Tao Chen, Guozhong Luo
preprint[arxiv] |
|
Fine-grained Identity Preserving Landmark Synthesis for Face Reenactment
Haichao Zhang, Youcheng Ben, Weixi Zhang, Tao Chen, Gang Yu, Bin Fu
preprint[arxiv] |
|
Restore DeepFakes Video Frames by Identifying Individual Motion Styles
Haichao Zhang, Zhe-Ming Lu, Hao Luo, Ya-Pei Feng
Electronic Letters [page]
|
Some Very Old Projects
Several years ago, I delved into the fascinating world of sensor modalities and signal processing, sparking a keen interest in embedded platforms. That experience led me to explore further into artificial intelligence and computer vision.
|
|
Wheelchair Control System via analysis eye-blinking EMG and EEG
Provincial Grand Prize at the Challenge Cup Competition of Science Achievement in China
project video (in Chinese)
Mar. 2017
Proposed to detect eye blink EMG noise mixed in EEG signal, which uses the intense eye blink signal to control the direction of wheelchairs, while analysis EEG to predict tension and relaxation degree to control the speed of the wheelchair.
An affordable solution for paralyzed patients to control their wheelchairs and move independently.
|
|
Low power abnormal ECG detection system based on MSP430
National First Prize at National Biomedical Engineering Innovative Design Competition
project video (in Chinese)
Nov. 2016
Responsible for developing upper computer software which received and filtered signals in the spectral domain from the MSP430 PCB board and developing an algorithm to detect the abnormal ECG.
|
|
Sign language recognition system of wearable bending sensor gloves
First Prize at Mobile Application Innovation Contest of North China
Jul. 2016
Responsible for programming the embedding microprocessor to sample the analog signal of the bending sensor on the gloves, which is used to predict the sign language, and showing prediction results on the app.
|
|
Vision-based paper money and coin sorting machine
Summer 2015
Responsible for programming the embedding microprocessors to control the mechanical structure and developing upper machine software to detect the kind of paper money in traditional image processing method, then sort them. |
|
Multimedia Information Hiding Technology of Unstructured Data
Alibaba-ZJU Joint Research Institute of Frontier Technologies Research Project
Summer 2018
Responsible for developing C++ software "Shared Memory Based Code Hiding Platform.
Particpated in video stream watermarking algorithm.
|
|
Academic Service
Reviewer
Conference:
Conference on Neural Information Processing Systems (NeurIPS)
ACM International Conference on Multimedia (ACM MM)
ICCV 2023 Workshop on Analysis and Modeling of Faces and Gestures (ICCVW)
CVPR 2024 AI for Content Creation workshop (CVPRW)
Journal:
Multimedia Tools and Applications (MTA)
ACM Transactions on Knowledge Discovery from Data (TKDD)
Teaching Assistant
DS5020 Fundamentals of Linear Algebra and Probability for Data Science
DS5110 Introduction to Data Management
|