Lecture Series in Pattern Recognition
题 目 (TITLE)：Cross-media retrieval, hashing and ranking
讲 座 人 (SPEAKER)：Prof. Fei Wu, Zhejiang University
主 持 人 (CHAIR)： Dr. Jitao Sang
时 间 (TIME)： August 15(Thursday), 2013, 11:05 - 11:30 AM
地 点 (VENUE)：No.1 Conference Room (3rd floor), Intelligence Building
Nowadays, many real-world applications involve multimodal data. Cross-media retrieval is imperative to many applications of practical interest, such as finding relevant textual documents of a tourist spot that best match a given image of the spot or finding a set of images that visually best illustrate a given text description. However, the heterogeneity-gap between multi-modal data has been widely understood as a fundamental barrier to successful cross-media retrieval. In this talk, I will introduce three recent works in cross-media retrieval: a) Supervised coupleddictionary learning with group structures for Multi-Modal retrieval: this method utilizes the class information to jointly learn discriminative multi-modal dictionaries as well as mapping functions between different modalities for cross-media retrieval; b)Sparse Multi-Modal Hashing: this method obtains the sparse codesets for the data objects across different modalities via joint multi-modal dictionary learning and therefore expedites the ANN search for cross-media data;c) Cross-Media Semantic Representation via Bi-directional Learning to Rank: this method considers learning a cross-media representation model from the perspective of optimizing a listwise ranking problem while taking advantage of bi-directional ranking examples.
Fei Wu received his B.Sc., M.Sc. and Ph.D. degrees in computer science from Lanzhou University, University of Macau and Zhejiang University in 1996, 1999 and 2002 respectively. From October, 2009 to August 2010, Fei Wu was a visiting scholar at Prof. Bin Yu's group, University of California, Berkeley. Currently, He is a full professor at the college of computer science, Zhejiang University. He is the vice-director of institute of artificial intelligence of Zhejiang University and the vice-director of Key Laboratory of Visual Perception (Zhejiang University) , Ministry of Education and Microsoft. He serves as the PC member of ACM Multimedia 2012, 2013. He was awarded the Program for New Century Excellent Talents in University (NCET) by the Ministry of Education in 2012. His research interests mainly include multimedia retrieval, sparse representation and machine learning.