Multimedia Processing
The explosive growth of multimedia information over the Internet makes it an urgent task to develop intelligent systems for automatic multimedia processing. Our multimedia research focuses on intelligent processing of image, video, and computer graphics. Current topics range from image and video indexing and retrieval, image and video based biometrics, video based character recognition, to 3D graphics.
Research Projects
1. Chinese News Video Retrieval Engine
As digital video libraries and archives of immense size are becoming available over data networks, efficient video retrieval and browsing have become crucially important. In this project, we construct a Chinese news video retrieval engine using a range of newly developed technology, including automatic news video story parsing, video caption.

2. IDface: Identification and Detection of Face
Automatic face identification and detection has great potential in a large array of application areas, including banking and security system access authentication, video surveillance, mugshot matching for law enforcement, duplicate ID card identification, face-based video compression for video conferencing, and information retrieval. In this project, we develop various algorithms and tools for applications related to face recognition.


3. Video-based Chinese Character Recognition
We develop a handwritten Chinese character recognition system using a video camera. It allows users to write on any regular paper just like using an off-line system. At the same time, using a video camera attached on the computer, the system can capture the stroke temporal information similar to an on-line system.




4. 3D Object Reconstruction from Single 2D Line Drawings
Multimedia applications extensively use 3D models. A 2D line drawing is the simplest and most straightforward way of illustrating a 3D object. It is very helpful if such a drawing can be used for generating a 3D model directly. We have developed a tool that can reconstruct the 3D model with planar faces from a single 2D line drawing. Our approach is finding face topology first and then doing 3D geometry reconstruction.




