题 目（TITLE）：Automatic Ontology Construction
讲 座 人（SPEAKER）: Prof. Lu Qin, Department of Computing, the Hong Kong Polytechnic University,
主 持 人 (CHAIR)：Prof. Chengqing Zong
时 间 (TIME)：10:00am, March 18, 2010
地 点 (VENUE)： Room 1115, 11th floor
An ontology representing a domain specific knowledge space is constructed through domain specific terms. The concepts behind these terms are described by certain attributes, and the relations among the different concepts. The discovery of new terms is most useful only if it can be used to extend the knowledge of a domain. This requires the identification of new terms and finding relationship of these new terms with existing concepts in the ontology.
This talk is composed of two part. In the first part, a new terminology extraction method will be presented. The algorithm identifies features of the relatively stable and domain independent term delimiters rather than that of the terms. For term verification, a link analysis based method is proposed to calculate the relevance between term candidates and the sentences in the domain specific corpus from which the candidates are extracted.
In the second part, a clustering based method is presented for domain relevant relation extraction including both relation type discovery and relation instance extraction. Given two raw corpora, one in the general domain, one in an application domain, domain specific verbs connecting different instances are extracted based on syntactic dependency as well as a small set of domain concept instance seeds. Relation types are then discovered based on verb clustering followed by relation instance extraction.
Prof. Lu has over 20 years of working experience both in academic research and in industrial applications on open systems design, standardization, and natural language processing. Her earlier work on codeset announcement has been widely adopted in modern operating systems and programming languages to handle different encodings for different language environments. She has successfully helped to make structured encoding of Chinese character components which lead to a much faster encoding process for Chinese characters. She spearheaded the standardization of the Hong Kong Supplementary Character Set, the first and only commonly adopted character set for Hong Kong. She is the Rapporteur of the ISO/IEC JTC1/SC2/WG2/IRG.
Prof. Lu’s research work is mostly focused on using natural language processing method on information extraction and text mining. She has conducted extensive work on Chinese collocation extraction, terminology extraction, and ontology construction. Her research has received over 2million funding from the CERG and over 10million funding from ITf. Her leadership has also lead to the completion of a Hong Kong Jockey Club funded project, ASAB98, using text-to-speech technology to assist the visually blind to access computers and the internet. Prof. Lu received her B.S. in E.E. from Beijing Normal University, M.S. and Ph.D. in computer science from the University of Illinois at Urbana-Champaign.