The First International Joint Conference on Natural Language Processing (IJCNLP-04)

CONFERENCE PROGRAM

Conference Program is also available in MS Word and PDF format.

Click the following shortcuts to jump to the respective date:

FIRST DAY: 22 March 2004 (Monday)

SECOND DAY: 23 March 2004 (Tuesday)

THIRD DAY: 24 March 2004 (Wednesday)

FIRST DAY: 22 March 2004 (Monday) ^top

9:00 – 9:15	Opening Ceremony
9:15 – 9:55	Opening Address: Prof. Makoto Nagao
9:55 – 10:00	Break
10:00 – 10:50	Invited Speech: Probabilistic Models for Rich Linguistic Information (Prof. Mark Johnson)
10:50 – 11:10	Refreshment Break
	Taggers, Chunkers, Shallow Parsers – I		Text and Sentence Generation		Machine Translation and Multilinguality – I
11:10 – 11:35	High Speed Unknown Word Prediction Using Support Vector Machine For Chinese Text-to-Speech Systems Juhong Ha, Yu Zheng and Gary Geunbae Lee		Detection of Incorrect Case Assignments in Automatically Generated Paraphrases of Japanese Sentences Atsushi Fujita, Kentaro Inui and Yuji Matsumoto		Automatic Learning of Parallel Dependency Treelet Pairs Yuan Ding and Martha Palmer
11:35 – 12:00	Chinese Chunk Identification Using SVMs plus Sigmoid Yongmei Tan, Tianshun Yao, Qing Chen and Jingbo Zhu		Feature Selection and Machine Learning for Pronominalization Ji-Eun Roh and Jong-Hyeok Lee		Example-based Machine Translation without Saying Inferable Predicate Eiji Aramaki, Sadao Kurohashi, Hideki Kashioka and Hideki Tanaka
12:00 – 13:35	Lunch Break
	Statistical Models and Machine Learning for NLP – I		NLP Software and Application – I		Panel Discussion
13:35 – 14:00	A Three Level Cache-based Adaptive Chinese Language Model Junlin Zhang, Weimin Qu, Le Sun, Lin Du and Yufang Sun		Automatic Genre Detection of Web Documents Chul Su Lim, Kong Joo Lee and Gil Chang Kim		Panel on Emerging Asian Language Processing Efforts
14:00 – 14:25	Capturing Long Distance Dependency in Language Modeling: An Empirical Study Jianfeng Gao and Hisami Suzuki		Statistical Substring Reduction in Linear Time Xueqiang Lü, Le Zhang and Junfeng Hu
14:25 – 14:50	Word Folding: Taking the Snapshot of Words Instead of the Whole Jin-Dong Kim and Jun'ichi Tsujii		You don’t have to think twice if you carefully tokenize Stefan Klatt and Bernd Bohnet
14:50 – 15:00	Break
	Information Retrieval – I	Theories and Formalisms for Morphology, Syntax and Semantics – I		Poster Presentation – I
15:00 – 15:25	Information Flow Analysis with Chinese Text Paulo Cheong, Dawei Song, Peter Bruza and Kam-Fai Wong	The Automatic Acquisition of Verb Subcategorisations and their Impact on the Performance of an HPSG Parser John Carroll and Alex C. Fang		Using a Smoothing Maximum Entropy Model for Chinese Nominal Entity Tagging –Jinying Chen, Nianwen Xue and Martha Palmer Robust Speaker Identification System Based on Wavelet Transform and Gaussian Mixture Model –Wan-Chen Chen, Ching-Tang Hsieh and Eugene Lai Deterministic dependency structure analyzer for Chinese –Yuchang Cheng, Masayuki Asahara and Yuji Matsumoto Building a parallel bilingual syntactically annotated corpus –Jan Cuřín, Martin Čmejrek, Jiří Havelka and Vladislav Kuboň Fast Reinforcement Learning of Dialogue Policies using Stable Function Approximation –Matthias Denecke, Kohji Dohsaka and Mikio Nakano Selecting Prosody Parameters for Unit Selection Based Chinese TTS –Minghui Dong, Kim-Teng Lua and Jun Xu Making Use of Furigana –Gary Kacmarcik Stochastic Word-Spacing System with Dynamic Increase of Word List –Mi-young Kang, Sung-ja Choi, Ae-sun Yoon and Hyuk-chul Kwon Resolution of Modifier-Head Relation Gaps using Automatically Extracted Metonymic Expressions –Yoji Kiyota, Sadao Kurohashi and Fuyuko Kido Headword Percolation in a Multi-Parser Architecture for Natural Language Understanding –Po Chui Luk, Kui Xu and Helen Meng Recognition of HTML Table Structure –Hidetaka Masuda, Shuichi Tsukamoto and Hiroshi Nakagawa Improving Relevance Feedback in the Language Modeling Approach: Maximum a Posteriori Probability Criterion and Three-component Mixture Model –Seung-Hoon Na, In-Su Kang and Jong-Hyeok Lee A Persistent Feature-Object Database for Intelligent Text Archive Systems –Takashi Ninomiya, Jun'ichi Tsujii and Yusuke Miyao Improving Quality of the Web Corpus –Youichi Sekiguchi and Kazuhide Yamamoto Detecting sentence boundaries in Japanese speech transcriptions using a morphological analyzer –Sachie Tajima, Hidetsugu Nanba and Manabu Okumura Improving PinYin to Chinese Conversion with a Whole Sentence Maximum Entropy Model –Le Zhang and Tianshun Yao How Effective is Query Expansion for Finding Novel Information? –Min Zhang and Shaoping Ma
15:25 – 15:50	BBS Based Hot Topic Retrieval Using Back-Propagation Neural Network Lan You, Yongping Du, Jiayin Ge, Xuanjing Huang and Lide Wu	FML-Based SCF Predefinition Learning for Chinese Verbs Xiwu Han, Tiejun Zhao and Muyun Yang
15:50 – 16:20	Refreshment Break
	Semantic Disambiguation – I	Text Mining in Biomedicine – I
16:20 – 16:45	Influence of WSD on Cross-Language Information Retrieval In-Su Kang, Seung-Hoon Na and Jong-Hyeok Lee	SVM-based Biological Named Entity Recognition using Minimum Edit-Distance Feature Boosted by Virtual Examples Eunji Yi, Gary Geunbae Lee and Soo-Jun Park
16:45 – 17:10	Improving Word Sense Disambiguation by Pseudo Samples Xiaojie Wang and Yuji Matsumoto	Mining Biomedical Abstracts: What’s in a Term? Goran Nenadić, Irena Spasić and Sophia Ananiadou
17:10 – 17:20	Break
	Word Segmentation – I	Lexical Semantics, Ontology and Linguistic Resource – I		Information Extraction, Q/A – I
17:20 – 17:45	Unsupervised Segmentation of Chinese Corpus Using Accessor Variety Haodi Feng, Kang Chen, Chunyu Kit and Xiaotie Deng	Acquiring Bilingual Named Entity Translations from Content-aligned Corpora Tadashi Kumano, Hideki Kashioka, Hideki Tanaka and Takahiro Fukusima		A Novel Pattern Learning Method for Open Domain Question Answering Yongping Du, Xuanjing Huang, Xin Li and Lide Wu
17:45 – 18:10	Chinese Unknown Word Identification Using Class-based LM Guohong Fu and Kang-Kwong Luke	Visual Semantics and Ontology of Eventive Verbs Minhua Ma and Paul Mc Kevitt		Chinese Named Entity Recognition Based on Multilevel Linguistic Features Honglei Guo, Jianmin Jiang, Gang Hu and Tong Zhang
19:00	Banquet

SECOND DAY: 23 March 2004 (Tuesday) ^top

9:00 – 9:50	Invited Speech: The Impact of Information Technology on Communication and Linguistics (Prof. Ching-Chun Hsieh)
9:50 – 10:10	Refreshment Break
	Text Mining	Dialogue and Discourse	Natural Language Technology in Mobile IR and Text Processing User Interfaces
10:10 – 10:35	A Study of Semi-Discrete Matrix Decomposition for LSI in Automated Text Categorization Qiang Wang, XiaoLong Wang and Yi Guan	Improving Noun Phrase Coreference Resolution by Matching Strings Xiaofeng Yang, Guodong Zhou, Jian Su and Chew Lim Tan	Dit4dah: Predictive Pruning For Morse Code Text Entry: Towards Entry Systems For the Seriously Impaired Kumiko Tanaka-Ishii and Ian Frank
10:35 – 11:00	Systematic Construction of Hierarchical Classifier in SVM-based Text Categorization Yongwook Yoon, Changki Lee and Gary Geunbae Lee	Zero Pronoun Resolution based on Automatically Constructed Case Frames and Structural Preference of Antecedents Daisuke Kawahara and Sadao Kurohashi	Spoken versus Written Queries for Mobile Information Access: an Experiment on Mandarin Chinese Heather Du and Fabio Crestani
11:00 – 11:25	Categorizing Unknown Text Patterns for Information Extraction Using a Search Result Mining Approach Chien-Chung Huang, Shui-Lung Chuang and Lee-Feng Chien	Combining Labeled and Unlabeled Data for Learning Cross-document Structural Relationships Zhu Zhang and Dragomir Radev	An Interactive Proofreading System for Inappropriately Selected Words on Using Predictive Text Entry Hideya Iwasaki and Kumiko Tanaka-Ishii
11:25 – 11:35	Break
	Information Retrieval – II	Theories and Formalisms for Morphology, Syntax and Semantics – II	FSA, Parsing Algorithms
11:35 – 12:00	Phoneme-based Transliteration of Foreign Names for OOV Problem Wei Gao, Kam-Fai Wong and Wai Lam	Corpus-oriented Grammar Development for Acquiring a Head-driven Phrase Structure Grammar from the Penn Treebank Yusuke Miyao, Takashi Ninomiya and Jun'ichi Tsujii	Data-Oriented Parsing and the Penn Chinese Treebank Mary Hearne and Andy Way
12:00 – 12:25	Window-based Method for Information Retrieval Qianli Jin, Jun Zhao and Bo Xu	Implementing the Syntax of Japanese Numeral Classifiers Emily M. Bender and Melanie Siegel	Iterative CKY parsing for Probabilistic Context-Free Grammars Yoshimasa Tsuruoka and Jun'ichi Tsujii
12:25 – 13:30	Lunch Break
13:30	Excursion

THIRD DAY: 24 March 2004 (Wednesday) ^top

9:00 – 9:50	Invited Speech: Language Technology for E-Memory Applications (Prof. Hans Uszkoreit)
9:50 – 10:10	Refreshment Break
	Taggers, Chunkers, Shallow Parsers – II	Information Extraction, Q/A – II	Interactive Poster / Demo Session
10:10 – 10:35	Syntactic Analysis of Long Sentences Based on S-clauses Mi-Young Kim and Jong-Hyeok Lee	Causal Relation Extraction Using Cue Phrase and Lexical Pair Probabilities Du-Seong Chang and Key-Sun Choi
10:35 – 11:00	A Nearest-Neighbor Method for Resolving PP-Attachment Ambiguity Shaojun Zhao and Dekang Lin	A re-examination of IR techniques in QA system Yi Chang, Hongbo Xu and Shuo Bai
11:00 – 11:10	Break
	Semantic Disambiguation – II
11:10 – 11:35	The Role of Semantic Information in Learning Question Classifiers Xin Li, Dan Roth and Kevin Small
11:35 – 12:00	Concept-Based Sense Disambiguation for Korean Nouns You-Jin Chung, Kyonghi Moon and Jong-Hyeok Lee
12:00 – 13:35	Lunch Break
	Statistical Models and Machine Learning for NLP - II	Word Segmentation – II	Panel Discussion
13:35 – 14:00	Flexible Margin Selection for Reranking with Full Pairwise Samples Libin Shen and Aravind K. Joshi	Chinese New Word Identification Based on Character Parsing Model Yao Meng, Hao Yu and Fumihito Nishino	Panel on Multilingual NLP for Public Information Services (2008 Digital Olympics)
14:00 – 14:25	Comparing Entropies within the Chinese Language Benjamin K Tsou, Tom B Y Lai and Ka-po Chow	The Use of SVM for Chinese New Word Identification Hongqiao Li, Chang-Ning Huang, Jianfeng Gao and Xiaozhong Fan
14:25 – 14:50	Bilingual Chunk Alignment Based on Interactional Matching and Probabilistic Latent Semantic Indexing Feifan Liu, Qianli Jin, Jun Zhao and Bo Xu	An Example-based Study on Chinese Word Segmentation Using Critical Fragments Qinan Hu, Haihua Pan and Chunyu Kit
14:50 – 15:00	Break

	NLP Software and Application – II	Text Mining in Biomedicine – II	Poster Presentation – II
15:00 – 15:25	Natural Language Database Access using Semi-Automatically Constructed Translation Knowledge In-Su Kang, Jae-Hak J. Bae and Jong-Hyeok Lee	Annotation of Gene Products in the Literature with Gene Ontology Terms using Syntactic Dependencies Jung-jae Kim and Jong C. Park	Improving Back-Transliteration by Combining Information Sources –Slaven Bilac and Hozumi Tanaka A Graph Grammar Approach to Map between Dependency Trees and Topological Models –Bernd Bohnet The Hinoki Treebank: A Treebank for Text Understanding –Francis Bond, Sanae Fujita, Chikara Hashimoto, Kaname Kasahara, Shigeko Nariyama, Eric Nichols, Akira Ohtani, Takaaki Tanaka and Shigeaki Amano Chinese Treebanks and Grammar Extraction –Keh-Jiann Chen and Yu-Ming Hsieh Using a Paraphraser to Improve Machine Translation Evaluation –Andrew Finch, Yasuhiro Akiba and Eiichiro Sumita Mining Table Information on the Internet –Sung-won Jung, Gi-deuk Han and Hyuk-chul Kwon Parsing Mixed Constructions in a Type Feature Structure Grammar –Jong-Bok Kim and Jaehyung Yang Collecting Evaluative Expressions for Opinion Extraction –Nozomi Kobayashi, Kentaro Inui, Yuji Matsumoto, Kenji Tateishi and Toshikazu Fukushima Deep Analysis of Modern Greek –Valia Kordoni and Julia Neu User Adaptation in MT-mediated Communication –Kentaro Ogura, Yoshihiko Hayashi, Saeko Nomura and Toru Ishida Learning to Filter Junk E-Mail from Positive and Unlabeled Examples –Karl-Michael Schneider A Collaborative Ability Measurement for Co-Training –Dan Shen, Jie Zhang, Jian Su, Guodong Zhou and Chew Lim Tan Word Sense Disambiguation using Heterogeneous Language Resources –Kiyoaki Shirai and Takayuki Tamagaki A Comparative Study on the Use of Labeled and Unlabeled Data for Large Margin Classifiers –Hiroya Takamura and Manabu Okumura An English-Hindi Statistical Machine Translation System –Raghavendra Udupa U and Tanveer A Faruquie N-fold Templated Piped Correction –Dekai Wu, Grace Ngai and Marine Carpuat Tagging Complex NEs with Maxent Models: Layered Structures versus Extended Tagset –Deyi Xiong, Hongkui Yu and Qun Liu
15:25 – 15:50	Specification Retrieval – How to Find Attribute-Value Information on the Web Minoru Yoshida and Hiroshi Nakagawa	Unsupervised Event Extraction from Biomedical Literature using Co-occurrence Information and Basic Patterns Hong-woo Chun, Young-sook Hwang and Hae-chang Rim
15:50 – 16:20	Refreshment Break
	Machine Translation and Multilinguality – II	Lexical Semantics, Ontology and Linguistic Resource – II
16:20 – 16:45	Bilingual Sentence Alignment Based on Punctuation Statistics and Lexicon Thomas C. Chuang, Jian-Cheng Wu, Tracy Lin, Wen-Chie Shei and Jason S. Chang	A Novel Approach to Improve Word Translations Extraction from Non-Parallel, Comparable Corpora Yun-Chuang Chiao, Jean-David Sta and Pierre Zweigenbaum
16:45 – 17:10	Practical Translation Pattern Acquisition from Combined Language Resources Mihoko Kitamura and Yuji Matsumoto	Acquiring Selectional Preferences in a Thai Lexical Database Canasai Kruengkrai, Thatsanee Charoenporn, Virach Sornlertlamvanich and Hitoshi Isahara
17:10 – 17:20	Break
17:20 – 17:40	Best Paper Award and Closing Session