CONFERENCE PROGRAM
Conference Program is also available in MS Word
and PDF format.
Click the following shortcuts to jump to the respective date:
FIRST DAY: 22 March 2004 (Monday)
SECOND DAY: 23 March 2004 (Tuesday)
THIRD DAY: 24 March 2004 (Wednesday)
FIRST DAY: 22 March 2004 (Monday)
top
9:00
– 9:15 |
Opening Ceremony |
9:15
– 9:55 |
Opening Address: Prof. Makoto Nagao |
9:55
– 10:00 |
Break |
10:00
– 10:50 |
Invited Speech: Probabilistic
Models for Rich Linguistic Information (Prof. Mark Johnson) |
10:50
– 11:10 |
Refreshment Break |
|
Taggers, Chunkers, Shallow Parsers – I |
Text and Sentence Generation |
Machine Translation and Multilinguality
– I |
11:10
– 11:35 |
High
Speed Unknown Word Prediction Using Support Vector Machine For
Chinese Text-to-Speech Systems
Juhong Ha, Yu Zheng and Gary Geunbae Lee |
Detection
of Incorrect Case Assignments in Automatically Generated Paraphrases
of Japanese Sentences
Atsushi Fujita, Kentaro Inui and Yuji Matsumoto |
Automatic
Learning of Parallel Dependency Treelet Pairs
Yuan Ding and Martha Palmer |
11:35
– 12:00 |
Chinese
Chunk Identification Using SVMs plus Sigmoid
Yongmei Tan, Tianshun Yao, Qing Chen and Jingbo
Zhu |
Feature
Selection and Machine Learning for Pronominalization
Ji-Eun Roh and Jong-Hyeok Lee |
Example-based
Machine Translation without Saying Inferable Predicate
Eiji Aramaki, Sadao Kurohashi, Hideki Kashioka
and Hideki Tanaka |
12:00
– 13:35 |
Lunch Break |
|
Statistical Models and Machine Learning
for NLP – I |
NLP Software and Application – I |
Panel Discussion |
13:35
– 14:00 |
A
Three Level Cache-based Adaptive Chinese Language Model
Junlin Zhang, Weimin Qu, Le Sun, Lin Du and Yufang
Sun |
Automatic
Genre Detection of Web Documents
Chul Su Lim, Kong Joo Lee and Gil Chang Kim |
Panel
on Emerging Asian Language Processing Efforts |
14:00
– 14:25 |
Capturing
Long Distance Dependency in Language Modeling: An Empirical Study
Jianfeng Gao and Hisami Suzuki |
Statistical
Substring Reduction in Linear Time
Xueqiang Lü, Le Zhang and Junfeng Hu |
14:25
– 14:50 |
Word
Folding: Taking the Snapshot of Words Instead of the Whole
Jin-Dong Kim and Jun'ichi Tsujii |
You
don’t have to think twice if you carefully tokenize
Stefan Klatt and Bernd Bohnet |
14:50
– 15:00 |
Break |
|
Information Retrieval – I |
Theories and Formalisms for Morphology,
Syntax and Semantics – I |
Poster Presentation – I |
15:00
– 15:25 |
Information
Flow Analysis with Chinese Text
Paulo Cheong, Dawei Song, Peter Bruza and
Kam-Fai Wong |
The
Automatic Acquisition of Verb Subcategorisations and their Impact
on the Performance of an HPSG Parser
John Carroll and Alex C. Fang |
Using
a Smoothing Maximum Entropy Model for Chinese Nominal Entity Tagging
–Jinying Chen, Nianwen Xue and Martha Palmer
Robust Speaker Identification System Based on Wavelet
Transform and Gaussian Mixture Model –Wan-Chen Chen, Ching-Tang Hsieh and Eugene Lai
Deterministic dependency structure analyzer for Chinese –Yuchang
Cheng, Masayuki Asahara and Yuji Matsumoto
Building a parallel bilingual syntactically annotated corpus
–Jan Cuřín, Martin Čmejrek, Jiří
Havelka and Vladislav Kuboň
Fast Reinforcement Learning of Dialogue Policies using
Stable Function Approximation –Matthias Denecke, Kohji Dohsaka and Mikio Nakano
Selecting Prosody Parameters for Unit Selection Based Chinese
TTS –Minghui Dong, Kim-Teng Lua and Jun Xu
Making Use of Furigana –Gary Kacmarcik
Stochastic Word-Spacing System with Dynamic Increase of Word
List –Mi-young Kang, Sung-ja Choi, Ae-sun Yoon and Hyuk-chul
Kwon
Resolution of Modifier-Head Relation Gaps using Automatically
Extracted Metonymic Expressions –Yoji Kiyota, Sadao Kurohashi and Fuyuko Kido
Headword Percolation in a Multi-Parser Architecture
for Natural Language Understanding –Po Chui Luk, Kui Xu and Helen Meng
Recognition of HTML Table Structure –Hidetaka
Masuda, Shuichi Tsukamoto and Hiroshi Nakagawa
Improving Relevance Feedback in the Language Modeling Approach:
Maximum a Posteriori Probability Criterion and Three-component
Mixture Model –Seung-Hoon Na, In-Su Kang and Jong-Hyeok
Lee
A Persistent Feature-Object Database for Intelligent Text
Archive Systems –Takashi Ninomiya, Jun'ichi Tsujii and
Yusuke Miyao
Improving Quality of the Web Corpus –Youichi Sekiguchi
and Kazuhide Yamamoto
Detecting sentence boundaries in Japanese speech transcriptions
using a morphological analyzer –Sachie Tajima, Hidetsugu Nanba and Manabu Okumura
Improving PinYin to Chinese Conversion with a Whole
Sentence Maximum Entropy Model –Le Zhang and Tianshun Yao
How Effective is Query Expansion for Finding Novel Information?
–Min Zhang and Shaoping Ma |
15:25
– 15:50 |
BBS
Based Hot Topic Retrieval Using Back-Propagation Neural Network
Lan You, Yongping Du, Jiayin Ge, Xuanjing Huang
and Lide Wu |
FML-Based
SCF Predefinition Learning for Chinese Verbs
Xiwu Han, Tiejun Zhao and Muyun Yang |
15:50
– 16:20 |
Refreshment Break |
|
Semantic Disambiguation – I |
Text Mining in Biomedicine – I |
16:20
– 16:45 |
Influence
of WSD on Cross-Language Information Retrieval
In-Su Kang, Seung-Hoon Na and Jong-Hyeok
Lee |
SVM-based
Biological Named Entity Recognition using Minimum Edit-Distance
Feature Boosted by Virtual Examples
Eunji Yi, Gary Geunbae Lee and Soo-Jun Park |
16:45
– 17:10 |
Improving
Word Sense Disambiguation by Pseudo Samples
Xiaojie Wang and Yuji Matsumoto |
Mining
Biomedical Abstracts: What’s in a Term?
Goran Nenadić, Irena Spasić and Sophia
Ananiadou |
17:10
– 17:20 |
Break |
|
Word Segmentation – I |
Lexical Semantics, Ontology and Linguistic
Resource – I |
Information Extraction, Q/A – I |
17:20
– 17:45 |
Unsupervised
Segmentation of Chinese Corpus Using Accessor Variety
Haodi Feng, Kang Chen, Chunyu Kit and Xiaotie
Deng |
Acquiring
Bilingual Named Entity Translations from Content-aligned Corpora
Tadashi Kumano, Hideki Kashioka, Hideki
Tanaka and Takahiro Fukusima |
A
Novel Pattern Learning Method for Open Domain Question Answering
Yongping Du, Xuanjing Huang, Xin Li and Lide Wu |
17:45
– 18:10 |
Chinese
Unknown Word Identification Using Class-based LM
Guohong Fu and Kang-Kwong Luke |
Visual
Semantics and Ontology of Eventive Verbs
Minhua Ma and Paul Mc Kevitt |
Chinese
Named Entity Recognition Based on Multilevel Linguistic Features
Honglei Guo, Jianmin Jiang, Gang Hu and Tong Zhang |
19:00 |
Banquet |
|
|
|
|
|
|
SECOND DAY: 23 March 2004 (Tuesday)
top
9:00
– 9:50 |
Invited Speech: The Impact
of Information Technology on Communication and Linguistics
(Prof. Ching-Chun Hsieh) |
9:50
– 10:10 |
Refreshment Break |
|
Text Mining |
Dialogue and Discourse |
Natural Language Technology in Mobile IR and Text Processing User Interfaces |
10:10
– 10:35 |
A
Study of Semi-Discrete Matrix Decomposition for LSI in Automated
Text Categorization
Qiang Wang, XiaoLong Wang and Yi Guan |
Improving
Noun Phrase Coreference Resolution by Matching Strings
Xiaofeng Yang, Guodong Zhou, Jian Su and Chew Lim
Tan |
Dit4dah:
Predictive Pruning For Morse Code Text Entry: Towards Entry Systems
For the Seriously Impaired
Kumiko Tanaka-Ishii and Ian Frank |
10:35
– 11:00 |
Systematic
Construction of Hierarchical Classifier in SVM-based Text Categorization
Yongwook Yoon, Changki Lee and Gary Geunbae Lee |
Zero
Pronoun Resolution based on Automatically Constructed Case Frames
and Structural Preference of Antecedents
Daisuke Kawahara and Sadao Kurohashi |
Spoken
versus Written Queries for Mobile
Information Access: an Experiment on Mandarin Chinese
Heather Du and Fabio Crestani |
11:00
– 11:25 |
Categorizing
Unknown Text Patterns for Information Extraction Using a Search
Result Mining Approach
Chien-Chung Huang, Shui-Lung Chuang and Lee-Feng
Chien |
Combining
Labeled and Unlabeled Data for Learning Cross-document Structural
Relationships
Zhu Zhang and Dragomir Radev |
An
Interactive Proofreading System for Inappropriately Selected Words
on Using Predictive Text Entry
Hideya Iwasaki and Kumiko Tanaka-Ishii |
11:25
– 11:35 |
Break |
|
Information Retrieval – II |
Theories and Formalisms for Morphology,
Syntax and Semantics – II |
FSA, Parsing Algorithms |
11:35
– 12:00 |
Phoneme-based
Transliteration of Foreign Names for OOV Problem
Wei Gao, Kam-Fai Wong and Wai Lam |
Corpus-oriented
Grammar Development for Acquiring a Head-driven Phrase Structure
Grammar from the Penn Treebank
Yusuke Miyao, Takashi Ninomiya and Jun'ichi Tsujii |
Data-Oriented
Parsing and the Penn Chinese Treebank
Mary Hearne and Andy Way |
12:00
– 12:25 |
Window-based
Method for Information Retrieval
Qianli Jin, Jun Zhao and Bo Xu |
Implementing
the Syntax of Japanese Numeral Classifiers
Emily M. Bender and Melanie Siegel |
Iterative
CKY parsing for Probabilistic Context-Free Grammars
Yoshimasa Tsuruoka and Jun'ichi Tsujii |
12:25
– 13:30 |
Lunch Break |
13:30 |
Excursion |
THIRD DAY: 24 March 2004 (Wednesday)
top
9:00
– 9:50 |
Invited Speech: Language Technology
for E-Memory Applications (Prof. Hans Uszkoreit) |
9:50
– 10:10 |
Refreshment Break |
|
Taggers, Chunkers, Shallow Parsers – II |
Information Extraction, Q/A – II |
Interactive Poster / Demo Session |
10:10
– 10:35 |
Syntactic
Analysis of Long Sentences Based on S-clauses
Mi-Young Kim and Jong-Hyeok Lee |
Causal
Relation Extraction Using Cue Phrase and Lexical Pair Probabilities
Du-Seong Chang and Key-Sun Choi |
|
10:35
– 11:00 |
A
Nearest-Neighbor Method for Resolving PP-Attachment Ambiguity
Shaojun Zhao and Dekang Lin |
A
re-examination of IR techniques in QA system
Yi Chang, Hongbo Xu and Shuo Bai |
11:00
– 11:10 |
Break |
|
Semantic Disambiguation – II |
|
11:10
– 11:35 |
The
Role of Semantic Information in Learning Question Classifiers
Xin Li, Dan Roth and Kevin Small |
|
11:35
– 12:00 |
Concept-Based
Sense Disambiguation for Korean Nouns
You-Jin Chung, Kyonghi Moon and Jong-Hyeok
Lee |
|
12:00
– 13:35 |
Lunch Break |
|
Statistical Models and Machine Learning
for NLP - II |
Word Segmentation – II |
Panel Discussion |
13:35
– 14:00 |
Flexible
Margin Selection for Reranking with Full Pairwise Samples
Libin Shen and Aravind K. Joshi |
Chinese
New Word Identification Based on Character Parsing Model
Yao Meng, Hao Yu and Fumihito Nishino |
Panel
on Multilingual NLP for Public Information Services (2008 Digital
Olympics) |
14:00
– 14:25 |
Comparing
Entropies within the Chinese Language
Benjamin K Tsou, Tom B Y Lai and Ka-po Chow |
The
Use of SVM for Chinese New Word Identification
Hongqiao Li, Chang-Ning Huang, Jianfeng Gao and
Xiaozhong Fan |
14:25
– 14:50 |
Bilingual
Chunk Alignment Based on Interactional Matching and Probabilistic
Latent Semantic Indexing
Feifan Liu, Qianli Jin, Jun Zhao and Bo Xu |
An
Example-based Study on Chinese Word Segmentation Using Critical
Fragments
Qinan Hu, Haihua Pan and Chunyu Kit |
14:50
– 15:00 |
Break |
|
NLP Software and Application – II |
Text Mining in Biomedicine – II |
Poster Presentation – II |
15:00
– 15:25 |
Natural
Language Database Access using Semi-Automatically Constructed
Translation Knowledge
In-Su Kang, Jae-Hak J. Bae and Jong-Hyeok
Lee |
Annotation
of Gene Products in the Literature with Gene Ontology Terms using
Syntactic Dependencies
Jung-jae Kim and Jong C. Park |
Improving
Back-Transliteration by Combining Information Sources –Slaven
Bilac and Hozumi Tanaka
A Graph Grammar Approach to Map between Dependency Trees
and Topological Models –Bernd
Bohnet
The Hinoki Treebank: A Treebank for Text Understanding –Francis
Bond, Sanae Fujita, Chikara Hashimoto, Kaname Kasahara, Shigeko
Nariyama, Eric Nichols, Akira Ohtani, Takaaki Tanaka and Shigeaki
Amano
Chinese Treebanks and Grammar Extraction –Keh-Jiann
Chen and Yu-Ming Hsieh
Using a Paraphraser to Improve Machine Translation Evaluation
–Andrew Finch, Yasuhiro Akiba and Eiichiro Sumita
Mining Table Information on the Internet –Sung-won Jung, Gi-deuk Han and Hyuk-chul
Kwon
Parsing Mixed Constructions in a Type Feature Structure Grammar
–Jong-Bok Kim and Jaehyung Yang
Collecting Evaluative Expressions for Opinion Extraction –Nozomi
Kobayashi, Kentaro Inui, Yuji Matsumoto, Kenji Tateishi and Toshikazu
Fukushima
Deep Analysis of Modern Greek –Valia Kordoni
and Julia Neu
User Adaptation in MT-mediated Communication –Kentaro Ogura,
Yoshihiko Hayashi, Saeko Nomura and Toru Ishida
Learning to Filter Junk E-Mail from Positive and Unlabeled
Examples –Karl-Michael Schneider
A Collaborative Ability Measurement for Co-Training
–Dan Shen, Jie Zhang, Jian Su, Guodong Zhou
and Chew Lim Tan
Word Sense Disambiguation using Heterogeneous Language Resources
–Kiyoaki Shirai and Takayuki Tamagaki
A Comparative Study on the Use of Labeled and Unlabeled
Data for Large Margin Classifiers –Hiroya Takamura and Manabu Okumura
An English-Hindi Statistical Machine Translation System –Raghavendra
Udupa U and Tanveer A Faruquie
N-fold Templated Piped Correction –Dekai Wu, Grace
Ngai and Marine Carpuat
Tagging Complex NEs with Maxent Models: Layered Structures
versus Extended Tagset –Deyi Xiong, Hongkui Yu and Qun
Liu |
15:25
– 15:50 |
Specification
Retrieval – How to Find Attribute-Value Information on the
Web
Minoru Yoshida and Hiroshi Nakagawa |
Unsupervised
Event Extraction from Biomedical Literature using Co-occurrence
Information and Basic Patterns
Hong-woo Chun, Young-sook Hwang and Hae-chang
Rim |
15:50
– 16:20 |
Refreshment Break |
|
Machine Translation and Multilinguality
– II |
Lexical Semantics, Ontology and Linguistic
Resource – II |
16:20
– 16:45 |
Bilingual
Sentence Alignment Based on Punctuation Statistics and Lexicon
Thomas C. Chuang, Jian-Cheng Wu, Tracy Lin,
Wen-Chie Shei and Jason S. Chang |
A
Novel Approach to Improve Word Translations Extraction from Non-Parallel,
Comparable Corpora
Yun-Chuang Chiao, Jean-David Sta and Pierre Zweigenbaum |
16:45
– 17:10 |
Practical
Translation Pattern Acquisition from Combined Language Resources
Mihoko Kitamura and Yuji Matsumoto |
Acquiring
Selectional Preferences in a Thai Lexical Database
Canasai Kruengkrai, Thatsanee Charoenporn, Virach
Sornlertlamvanich and Hitoshi Isahara |
17:10
– 17:20 |
Break |
17:20
– 17:40 |
Best Paper Award and Closing Session |
|