中国科学院自动化研究所   设为首页   加入收藏  联系我们
 
English
网站首页     实验室概况     研究队伍     组织机构     学术交流     科研成果     人才培养     开放课题     创新文化     资源共享     联系我们
    学术讲座

2013-9-24 Document Image Classification and Retrieval

模式识别系列讲座

Lecture Series in Pattern Recognition

 

    (TITLE)Document Image Classification and Retrieval

(SPEAKER)Prof. David Doermann (University of Maryland College Park, US)

(CHAIR)Prof. Chenglin Liu

     (TIME): September 24(Tuesday), 2013, 14:30-16:30 PM

    (VENUE):No.1 Conference Room (3rd floor), Intelligence Building

报告摘要(ABSTRACT):

Traditional approaches to document retrieval focus on conversion to electronic text followed by indexing of the text content. Recently some work in the community has focused on indexing document image content directly.  In this talk, we will overview work at Maryland on Classification and Indexing that scales to millions of documents. First we present a learning based approach for computing structural similarities among document images for unsupervised exploration in large document collections. The approach is based on multiple levels of content and structure. At a local level, a bag-of-visual words based on SURF features provides an effective way of computing content similarity. The document is then recursively partitioned and a histogram of codewords is computed for each partition. Structural similarity is computed using a random forest classifier trained with these histogram features. We experiment with three diverse datasets of document images varying in size, degree of structural similarity, and types of document images. Second, we present a scalable algorithm for segmentation free content retrieval in document images. The contributions of this paper include the use of the SURF feature for image passage retrieval, a novel indexing algorithm for efficient retrieval of SURF features and a method to filter results using the orientation of local features and geometric constraints. Results demonstrate that logo, signature block and stamp retrieval can be performed with high accurately and efficiently scaled to a large datasets.

Dr. Doermann will be available to meet with students. He will also highlight the University of Maryland graduate program as part of his talk, so students considering graduate school in the US are encouraged to attend.

 

报告人简介(BIOGRAPHY):

Dr. David Doermann is a senior research scientist in UMIACS.  He received a B.Sc. degree in Computer Science and Mathematics from Bloomsburg University in 1987, and a M.Sc. degree in 1989 in the Department of Computer Science at the University of Maryland, College Park. He continued his studies in the Computer Vision Laboratory, where he earned a Ph.D. 1993. Since 1993, he has served as co-director of the Laboratory for Language and Media Processing in the University of Maryland's Institute for Advanced Computer Studies and as an adjunct member of the graduate faculty.

His team of researchers focuses on topics related to document image analysis and multimedia information processing. In 2002 he received an Honorary Doctorate of Technology Sciences from the University of Oulu for his contributions to digital media processing and document analysis research. He is a founding co-editor of the International Journal on Document Analysis and Recognition, has the General Chair or Co-Chair of over a half dozen international conferences and workshops and was the General Chair of the International Conference on Document Analysis and Recognition (ICDAR) held in Washington DC in 2013. He has over 30 journal publications and over 160 refereed conference papers.

友情链接
 
中科院自动化研究所 模式识别国家重点实验室 事业单位  京ICP备14019135号-3
NLPR, INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES