The First International Joint Conference on Natural Language Processing (IJCNLP-04)
Home People Conference Program Registration Links
Workshops Satellite Symposium Tutorials Travel Info Archives

  Conference Program

CONFERENCE PROGRAM

Conference Program is also available in MS Word and PDF format.

Click the following shortcuts to jump to the respective date:

FIRST DAY: 22 March 2004 (Monday)

SECOND DAY: 23 March 2004 (Tuesday)

THIRD DAY: 24 March 2004 (Wednesday)


 

FIRST DAY: 22 March 2004 (Monday) top

9:00 – 9:15

Opening Ceremony

9:15 – 9:55

Opening Address: Prof. Makoto Nagao

9:55 – 10:00

Break

10:00 – 10:50

Invited Speech: Probabilistic Models for Rich Linguistic Information (Prof. Mark Johnson)

10:50 – 11:10

Refreshment Break

 

Taggers, Chunkers, Shallow Parsers – I

Text and Sentence Generation

Machine Translation and Multilinguality – I

11:10 – 11:35

High Speed Unknown Word Prediction Using Support Vector Machine For Chinese Text-to-Speech Systems

Juhong Ha, Yu Zheng and Gary Geunbae Lee

Detection of Incorrect Case Assignments in Automatically Generated Paraphrases of Japanese Sentences

Atsushi Fujita, Kentaro Inui and Yuji Matsumoto

Automatic Learning of Parallel Dependency Treelet Pairs

Yuan Ding and Martha Palmer

11:35 – 12:00

Chinese Chunk Identification Using SVMs plus Sigmoid

Yongmei Tan, Tianshun Yao, Qing Chen and Jingbo Zhu

Feature Selection and Machine Learning for Pronominalization

Ji-Eun Roh and Jong-Hyeok Lee

Example-based Machine Translation without Saying Inferable Predicate

Eiji Aramaki, Sadao Kurohashi, Hideki Kashioka and Hideki Tanaka

12:00 – 13:35

Lunch Break

 

Statistical Models and Machine Learning for NLP – I

NLP Software and Application – I

Panel Discussion

13:35 – 14:00

A Three Level Cache-based Adaptive Chinese Language Model

Junlin Zhang, Weimin Qu, Le Sun, Lin Du and Yufang Sun

Automatic Genre Detection of Web Documents

Chul Su Lim, Kong Joo Lee and Gil Chang Kim

Panel on Emerging Asian Language Processing Efforts

14:00 – 14:25

Capturing Long Distance Dependency in Language Modeling: An Empirical Study

Jianfeng Gao and Hisami Suzuki

Statistical Substring Reduction in Linear Time

Xueqiang Lü, Le Zhang and Junfeng Hu

14:25 – 14:50

Word Folding: Taking the Snapshot of Words Instead of the Whole

Jin-Dong Kim and Jun'ichi Tsujii

You don’t have to think twice if you carefully tokenize

Stefan Klatt and Bernd Bohnet

14:50 – 15:00

Break

 

Information Retrieval – I

Theories and Formalisms for Morphology, Syntax and Semantics – I

Poster Presentation – I

15:00 – 15:25

Information Flow Analysis with Chinese Text

Paulo Cheong, Dawei Song, Peter Bruza and Kam-Fai Wong

The Automatic Acquisition of Verb Subcategorisations and their Impact on the Performance of an HPSG Parser

John Carroll and Alex C. Fang

Using a Smoothing Maximum Entropy Model for Chinese Nominal Entity Tagging –Jinying Chen, Nianwen Xue and Martha Palmer

Robust Speaker Identification System Based on Wavelet Transform and Gaussian Mixture Model –Wan-Chen Chen, Ching-Tang Hsieh and Eugene Lai

Deterministic dependency structure analyzer for Chinese –Yuchang Cheng, Masayuki Asahara and Yuji Matsumoto

Building a parallel bilingual syntactically annotated corpus –Jan Cuřín, Martin Čmejrek, Jiří Havelka and Vladislav Kuboň

Fast Reinforcement Learning of Dialogue Policies using Stable Function Approximation –Matthias Denecke, Kohji Dohsaka and Mikio Nakano

Selecting Prosody Parameters for Unit Selection Based Chinese TTS –Minghui Dong, Kim-Teng Lua and Jun Xu

Making Use of Furigana –Gary Kacmarcik

Stochastic Word-Spacing System with Dynamic Increase of Word List –Mi-young Kang, Sung-ja Choi, Ae-sun Yoon and Hyuk-chul Kwon

Resolution of Modifier-Head Relation Gaps using Automatically Extracted Metonymic Expressions –Yoji Kiyota, Sadao Kurohashi and Fuyuko Kido

Headword Percolation in a Multi-Parser Architecture for Natural Language Understanding –Po Chui Luk, Kui Xu and Helen Meng

Recognition of HTML Table Structure –Hidetaka Masuda, Shuichi Tsukamoto and Hiroshi Nakagawa

Improving Relevance Feedback in the Language Modeling Approach: Maximum a Posteriori Probability Criterion and Three-component Mixture Model –Seung-Hoon Na, In-Su Kang and Jong-Hyeok Lee

A Persistent Feature-Object Database for Intelligent Text Archive Systems –Takashi Ninomiya, Jun'ichi Tsujii and Yusuke Miyao

Improving Quality of the Web Corpus –Youichi Sekiguchi and Kazuhide Yamamoto

Detecting sentence boundaries in Japanese speech transcriptions using a morphological analyzer –Sachie Tajima, Hidetsugu Nanba and Manabu Okumura

Improving PinYin to Chinese Conversion with a Whole Sentence Maximum Entropy Model –Le Zhang and Tianshun Yao

How Effective is Query Expansion for Finding Novel Information? –Min Zhang and Shaoping Ma

15:25 – 15:50

BBS Based Hot Topic Retrieval Using Back-Propagation Neural Network

Lan You, Yongping Du, Jiayin Ge, Xuanjing Huang and Lide Wu

FML-Based SCF Predefinition Learning for Chinese Verbs

Xiwu Han, Tiejun Zhao and Muyun Yang

15:50 – 16:20

Refreshment Break

 

Semantic Disambiguation – I

Text Mining in Biomedicine – I

16:20 – 16:45

Influence of WSD on Cross-Language Information Retrieval

In-Su Kang, Seung-Hoon Na and Jong-Hyeok Lee

SVM-based Biological Named Entity Recognition using Minimum Edit-Distance Feature Boosted by Virtual Examples

Eunji Yi, Gary Geunbae Lee and Soo-Jun Park

16:45 – 17:10

Improving Word Sense Disambiguation by Pseudo Samples

Xiaojie Wang and Yuji Matsumoto

Mining Biomedical Abstracts: What’s in a Term?

Goran Nenadić, Irena Spasić and Sophia Ananiadou

17:10 – 17:20

Break

 

Word Segmentation – I

Lexical Semantics, Ontology and Linguistic Resource – I

Information Extraction, Q/A – I

17:20 – 17:45

Unsupervised Segmentation of Chinese Corpus Using Accessor Variety

Haodi Feng, Kang Chen, Chunyu Kit and Xiaotie Deng

Acquiring Bilingual Named Entity Translations from Content-aligned Corpora

Tadashi Kumano, Hideki Kashioka, Hideki Tanaka and Takahiro Fukusima

A Novel Pattern Learning Method for Open Domain Question Answering

Yongping Du, Xuanjing Huang, Xin Li and Lide Wu

17:45 – 18:10

Chinese Unknown Word Identification Using Class-based LM

Guohong Fu and Kang-Kwong Luke

Visual Semantics and Ontology of Eventive Verbs

Minhua Ma and Paul Mc Kevitt

Chinese Named Entity Recognition Based on Multilevel Linguistic Features

Honglei Guo, Jianmin Jiang, Gang Hu and Tong Zhang

19:00

Banquet


SECOND DAY: 23 March 2004 (Tuesday) top

9:00 – 9:50

Invited Speech: The Impact of Information Technology on Communication and Linguistics (Prof. Ching-Chun Hsieh)

9:50 – 10:10

Refreshment Break

 

Text Mining

Dialogue and Discourse

Natural Language Technology in Mobile IR and Text Processing User Interfaces

10:10 – 10:35

A Study of Semi-Discrete Matrix Decomposition for LSI in Automated Text Categorization

Qiang Wang, XiaoLong Wang and Yi Guan

Improving Noun Phrase Coreference Resolution by Matching Strings

Xiaofeng Yang, Guodong Zhou, Jian Su and Chew Lim Tan

Dit4dah: Predictive Pruning For Morse Code Text Entry: Towards Entry Systems For the Seriously Impaired

Kumiko Tanaka-Ishii and Ian Frank

10:35 – 11:00

Systematic Construction of Hierarchical Classifier in SVM-based Text Categorization

Yongwook Yoon, Changki Lee and Gary Geunbae Lee

Zero Pronoun Resolution based on Automatically Constructed Case Frames and Structural Preference of Antecedents

Daisuke Kawahara and Sadao Kurohashi

Spoken versus Written Queries for Mobile Information Access: an Experiment on Mandarin Chinese

Heather Du and Fabio Crestani

11:00 – 11:25

Categorizing Unknown Text Patterns for Information Extraction Using a Search Result Mining Approach

Chien-Chung Huang, Shui-Lung Chuang and Lee-Feng Chien

Combining Labeled and Unlabeled Data for Learning Cross-document Structural Relationships

Zhu Zhang and Dragomir Radev

An Interactive Proofreading System for Inappropriately Selected Words on Using Predictive Text Entry

Hideya Iwasaki and Kumiko Tanaka-Ishii

11:25 – 11:35

Break

 

Information Retrieval – II

Theories and Formalisms for Morphology, Syntax and Semantics – II

FSA, Parsing Algorithms

11:35 – 12:00

Phoneme-based Transliteration of Foreign Names for OOV Problem

Wei Gao, Kam-Fai Wong and Wai Lam

Corpus-oriented Grammar Development for Acquiring a Head-driven Phrase Structure Grammar from the Penn Treebank

Yusuke Miyao, Takashi Ninomiya and Jun'ichi Tsujii

Data-Oriented Parsing and the Penn Chinese Treebank

Mary Hearne and Andy Way

12:00 – 12:25

Window-based Method for Information Retrieval

Qianli Jin, Jun Zhao and Bo Xu

Implementing the Syntax of Japanese Numeral Classifiers

Emily M. Bender and Melanie Siegel

Iterative CKY parsing for Probabilistic Context-Free Grammars

Yoshimasa Tsuruoka and Jun'ichi Tsujii

12:25 – 13:30

Lunch Break

13:30

Excursion


THIRD DAY: 24 March 2004 (Wednesday) top

9:00 – 9:50

Invited Speech: Language Technology for E-Memory Applications (Prof. Hans Uszkoreit)

9:50 – 10:10

Refreshment Break

 

Taggers, Chunkers, Shallow Parsers – II

Information Extraction, Q/A – II

Interactive Poster / Demo Session

10:10 – 10:35

Syntactic Analysis of Long Sentences Based on S-clauses

Mi-Young Kim and Jong-Hyeok Lee

Causal Relation Extraction Using Cue Phrase and Lexical Pair Probabilities

Du-Seong Chang and Key-Sun Choi

 

10:35 – 11:00

A Nearest-Neighbor Method for Resolving PP-Attachment Ambiguity

Shaojun Zhao and Dekang Lin

A re-examination of IR techniques in QA system

Yi Chang, Hongbo Xu and Shuo Bai

11:00 – 11:10

Break

 

Semantic Disambiguation – II

 

11:10 – 11:35

The Role of Semantic Information in Learning Question Classifiers

Xin Li, Dan Roth and Kevin Small

 

11:35 – 12:00

Concept-Based Sense Disambiguation for Korean Nouns

You-Jin Chung, Kyonghi Moon and Jong-Hyeok Lee

 

12:00 – 13:35

Lunch Break

 

Statistical Models and Machine Learning for NLP - II

Word Segmentation – II

Panel Discussion

13:35 – 14:00

Flexible Margin Selection for Reranking with Full Pairwise Samples

Libin Shen and Aravind K. Joshi

Chinese New Word Identification Based on Character Parsing Model

Yao Meng, Hao Yu and Fumihito Nishino

Panel on Multilingual NLP for Public Information Services (2008 Digital Olympics)

14:00 – 14:25

Comparing Entropies within the Chinese Language

Benjamin K Tsou, Tom B Y Lai and Ka-po Chow

The Use of SVM for Chinese New Word Identification

Hongqiao Li, Chang-Ning Huang, Jianfeng Gao and Xiaozhong Fan

14:25 – 14:50

Bilingual Chunk Alignment Based on Interactional Matching and Probabilistic Latent Semantic Indexing

Feifan Liu, Qianli Jin, Jun Zhao and Bo Xu

An Example-based Study on Chinese Word Segmentation Using Critical Fragments

Qinan Hu, Haihua Pan and Chunyu Kit

14:50 – 15:00

Break


 

NLP Software and Application – II

Text Mining in Biomedicine – II

Poster Presentation – II

15:00 – 15:25

Natural Language Database Access using Semi-Automatically Constructed Translation Knowledge

In-Su Kang, Jae-Hak J. Bae and Jong-Hyeok Lee

Annotation of Gene Products in the Literature with Gene Ontology Terms using Syntactic Dependencies

Jung-jae Kim and Jong C. Park

Improving Back-Transliteration by Combining Information Sources –Slaven Bilac and Hozumi Tanaka

A Graph Grammar Approach to Map between Dependency Trees and Topological Models –Bernd Bohnet

The Hinoki Treebank: A Treebank for Text Understanding –Francis Bond, Sanae Fujita, Chikara Hashimoto, Kaname Kasahara, Shigeko Nariyama, Eric Nichols, Akira Ohtani, Takaaki Tanaka and Shigeaki Amano

Chinese Treebanks and Grammar Extraction –Keh-Jiann Chen and Yu-Ming Hsieh

Using a Paraphraser to Improve Machine Translation Evaluation –Andrew Finch, Yasuhiro Akiba and Eiichiro Sumita

Mining Table Information on the Internet –Sung-won Jung, Gi-deuk Han and Hyuk-chul Kwon

Parsing Mixed Constructions in a Type Feature Structure Grammar –Jong-Bok Kim and Jaehyung Yang

Collecting Evaluative Expressions for Opinion Extraction –Nozomi Kobayashi, Kentaro Inui, Yuji Matsumoto, Kenji Tateishi and Toshikazu Fukushima

Deep Analysis of Modern Greek –Valia Kordoni and Julia Neu

User Adaptation in MT-mediated Communication –Kentaro Ogura, Yoshihiko Hayashi, Saeko Nomura and Toru Ishida

Learning to Filter Junk E-Mail from Positive and Unlabeled Examples –Karl-Michael Schneider

A Collaborative Ability Measurement for Co-Training –Dan Shen, Jie Zhang, Jian Su, Guodong Zhou and Chew Lim Tan

Word Sense Disambiguation using Heterogeneous Language Resources –Kiyoaki Shirai and Takayuki Tamagaki

A Comparative Study on the Use of Labeled and Unlabeled Data for Large Margin Classifiers –Hiroya Takamura and Manabu Okumura

An English-Hindi Statistical Machine Translation System –Raghavendra Udupa U and Tanveer A Faruquie

N-fold Templated Piped Correction –Dekai Wu, Grace Ngai and Marine Carpuat

Tagging Complex NEs with Maxent Models: Layered Structures versus Extended Tagset –Deyi Xiong, Hongkui Yu and Qun Liu

15:25 – 15:50

Specification Retrieval – How to Find Attribute-Value Information on the Web

Minoru Yoshida and Hiroshi Nakagawa

Unsupervised Event Extraction from Biomedical Literature using Co-occurrence Information and Basic Patterns

Hong-woo Chun, Young-sook Hwang and Hae-chang Rim

15:50 – 16:20

Refreshment Break

 

Machine Translation and Multilinguality – II

Lexical Semantics, Ontology and Linguistic Resource – II

16:20 – 16:45

Bilingual Sentence Alignment Based on Punctuation Statistics and Lexicon

Thomas C. Chuang, Jian-Cheng Wu, Tracy Lin, Wen-Chie Shei and Jason S. Chang

A Novel Approach to Improve Word Translations Extraction from Non-Parallel, Comparable Corpora

Yun-Chuang Chiao, Jean-David Sta and Pierre Zweigenbaum

16:45 – 17:10

Practical Translation Pattern Acquisition from Combined Language Resources

Mihoko Kitamura and Yuji Matsumoto

Acquiring Selectional Preferences in a Thai Lexical Database

Canasai Kruengkrai, Thatsanee Charoenporn, Virach Sornlertlamvanich and Hitoshi Isahara

17:10 – 17:20

Break

17:20 – 17:40

Best Paper Award and Closing Session