Organized by:

Conference Program

Overview of the Conference Program

Tutorials

7 Jan (Mon)

1000-1300

T2. How to Add a New Language on the NLP Map: Building Resources and Tools for Languages with Scarce Resources

1400-1700

T1. Social Network inspired Models of NLP and Language Evolution

1400-1700

T3. Introduction to Text Summarization and Other Information Access Technologies

Main Conference
8 Jan (Tue)	9 Jan (Wed)	10 Jan (Thru)
0900-0930 Inauguration 0930-1030 Keynote Prof Aravind Joshi, Univ of Pennsylvania	0900-1100 TS4 (A2-1,B2-1,C2-1)	0900-1030 TS6 (A3-1,B3-1,C3-1)
1100-1230 TS1 (A1-1,B1-1,C1-1)	1100-1230 PS1(30) Poster Session 1	1100-1230 TS7 (A3-2,B3-2,C3-2)
LUNCH	LUNCH	LUNCH
1400-1530 TS2 (A1-2,B1-2,C1-2)	1400-1500 Invited Talk 1 Prof Hyopil Shin, Seoul National University	1400-1500 Invited Talk 2 Dr S.H. Srinivasan, Yahoo
1600-1800 TS3 (A1-3,B1-3,C1-3)	1500-1630 PS2(30) Poster Session 2	1530-1700 TS8 (A3-3,B3-3,C3-3)
1900 Banquet Dinner (Optional)	1600-1730 TS5 (A2-2,B2-2,C2-2)	1700-1800 Panel and Closing Session

Workshops

11 Jan (Fri)

12 Jan (Sat)

0900-1800

W1: NLP for Less Privileged Languages (NLPLPL)

W2: The Sixth SIGHAN Workshop on Chinese Language Processing (SIGHAN6)

W4: Cross Language Information Access (CLIA)

W5: The 6th Workshop on Asian Language Resources (ALR6)

W6: Technologies and Corpora for Asia-Pacific Speech Translation (TCAST)

0900-1230

W2: The Sixth SIGHAN Workshop on Chinese Language Processing (SIGHAN6)

W5: The 6th Workshop on Asian Language Resources (ALR6)

0900-1800

W3: Named Entity Recognition for South and South East Asian Languages (NERSSEAL)

Notes:

Morning Tea/Coffee Break: 10:30 – 11:00
Lunch: 12:30 – 14:00
Afternoon Tea/Coffee Break: 15:30 – 16:00hrs (Exception on Jan 10th: 15:00–16:00)

Session Index of Main Conference

Day1: January 8th (Tue)

A1-1: Word Segmentation/POS tagging
B1-1: Text Mining/Information Extraction (1)
C1-1: Asian Language Processing and Linguistics Issues
A1-2: Parsing/Grammar
B1-2: Text Mining/Information Extraction (2)
C1-2: Summarization
A1-3: Multiple Document Processing
B1-3: Information Retrieval
C1-3: Spoken Language Processing

Day2: January 9th (Wed)

A2-1: Machine Translation (1)
B2-1: Text Mining/Information Extraction (3)
C2-1: Emotion/Sentiment Analysis
Poster session 1
Poster session 2
A2-2: Machine Translation (2)
B2-2: Named Entity Recognition
C2-2: Relation Extraction

Day 3: January 10th (Thu)

A3-1: Statistical Models/Machine Learning for NLP (1)
B3-1: Ontologies and Linguistic Resources (1)
C3-1: Question Answering
A3-2: Statistical Models/Machine Learning for NLP (2)
B3-2: Ontologies and Linguistic Resources (2)
C3-2: Event/Sentence Relation
A3-3: Statistical Machine Translation
B3-3: Ontologies and Linguistic Resources (3)
C3-3: Semantic Similarity

Details of Main Conference Schedule

	Day1: January 8th (Tue)
09:00-09:30	Opening
09:30-10:30	Keynote Speech: Penn Discourse Treebank: Complexity of Dependencies at The Discourse Level and at the Sentence Level Prof. Aravind Joshi, University of Pennsylvania
10:30-11:00	Break
11:00-12:30	Oral session: (3 + 3 + 3) A1-1: Word Segmentation/POS tagging A Lemmatization Method for Modern Mongolian and its Application to Information Retrieval Badam-Osor Khaltar and Atsushi Fujii An Empirical Comparison of Goodness Measures for Unsupervised Chinese Word Segmentation with a Unified Framework Hai Zhao and Chunyu Kit A Hybrid Approach to the Induction of Underlying Morphology Michael Tepper and Fei Xia B1-1:Text Mining/Information Extraction (1) Context-Sensitive Convolution Tree Kernel for Pronoun Resolution GuoDong ZHOU and Fang KONG Semi-Supervised Learning for Relation Extraction GuoDong ZHOU, LongHua QIAN and QiaoMing ZHU Story Link Detection based on Dynamic Information Extending Xiaoyan Zhang, Ting Wang and Huowang Chen C1-1: Asian Language Processing and Linguistics Issues Orthographic Disambiguation Incorporating Transliterated Probability Eiji ARAMAKI, Takeshi IMAI, Kengo MIYO and Kazuhiko OHE Name Origin Recognition Using Maximum Entropy Model and Diverse Features Min ZHANG, Chengjie SUN, Aiti AW and Chew Lim Tan A More Discerning and Adaptable Multilingual Transliteration Mechanism for Indian Languages Harshit Surana and Anil Kumar Singh
12:30-14:00	Lunch
14:00-15:30	Oral session: (3 + 3 + 3) A1-2: Parsing/Grammar UCSG: A Wide Coverage Shallow Parsing System Bharadwaja Kumar G and Narayana Murthy K Memory-Inductive Categorial Grammar: An Approach to Gap Resolution in Analytic-Language Translation Prachya Boonkwan and Thepchai Supnithi Dependency Parsing with Short Dependency Relations in Unlabeled Data Wenliang Chen, Daisuke Kawahara, Kiyotaka Uchimoto, Yujie Zhang and Hitoshi Isahara B1-2: Text Mining/Information Extraction (2) Effective Compositional Model for Lexical Alignment Beatrice Daille and Emmanuel Morin Determining the Unithood of Word Sequences using a Probabilistic Approach Wilson Wong, Wei Liu and Mohammed Bennamoun Lexical Chains as Document Features Dinakar Jayarajan, Dipti Deodhare and Ravindran Balaraman C1-2: Summarization Entity-driven Rewrite for Multi-document Summarization Ani Nenkova A New Approach to Automatic Document Summarization Xiaofeng Wu and Chengqing Zong Generic Text Summarization Using Probabilistic Latent Semantic Indexing Harendra Bhandari, Masashi Shimbo, Takahiko Ito and Yuji Matsumoto
15:30-16:00	Break
16:00-18:00	Oral session: (4 + 4 + 3) A1-3: Multiple Document Processing Identifying Cross-Document Relations between Sentences Yasunari Miyabe, Hiroya Takamura and Manabu Okumura Experiments on Semantic-based Clustering for Cross-document Coreference Horacio Saggion Modeling Context in Scenario Template Creation Long Qiu, Min-Yen Kan and Tat-Seng Chua Cross Language Text Categorization Using a Bilingual Lexicon Ke Wu, Xiaolin Wang and Bao-Liang Lu B1-3: Information Retrieval Identify Temporal Websites Based on User Behavior Analysis Yong Wang, Yiqun Liu, Min Zhang, Shaoping Ma and Liyun Ru A Comparative Study for Query Translation using Linear Combination and Confidence Measure Youssef Kadri and Jian-Yun Nie TSUBAKI: An Open Search Engine Infrastructure for Developing New Information Access Methodology Keiji Shinzato, Tomohide Shibata, Daisuke Kawahara, Chikara Hashimoto and Sadao Kurohashi A Study on Effectiveness of Syntactic Relationship in Dependence Retrieval Model Ding Fan and Wang Bin C1-3: Spoken Language Processing Automatic Estimation of Word Significance oriented for Speech-based Information Retrieval Takashi Shichiri, Hiroaki Nanjo and Takehiko Yoshimi Rapid Prototyping of Robust Language Understanding Modules for Spoken Dialogue Systems Yuichiro Fukubayashi, Kazunori Komatani, Mikio Nakano, Kotaro Funakoshi, Hiroshi Tsujino, Tetsuya Ogata and Hiroshi G. Okuno Automatic Prosodic Labeling with Conditional Random Fields and Rich Acoustic Features Gina-Anne Levow
	Day2: January 9th (Wed)
9:00-11:00	Oral session: (4 + 4 + 4) A2-1: Machine Translation (1) Chinese Unknown Word Translation by Subword Re-segmentation Ruiqiang Zhang and Eiichiro Sumita Hypothesis Selection in Machine Transliteration: A Web Mining Approach Jong-Hoon Oh and Hitoshi Isahara What Prompts Translators to Modify Draft Translations? An Analysis of Basic Modification Patterns for Use in the Automatic Notification of Awkwardly Translated Text Takeshi Abekawa and Kyo Kageura Improving Word Alignment by Adjusting Chinese Word Segmentation Ming-Hong Bai, Keh-Jiann Chen and Jason S. Chang B2-1: Text Mining/Information Extraction (3) The Telling Tail: Signals of Success in Electronic Negotiation Texts Marina Sokolova, Vivi Nastase and Stan Szpakowicz Automatic Extraction of Briefing Templates Dipanjan Das, Mohit Kumar and Alexander I. Rudnicky Mining the Web for Relations between Digital Devices using a Probabilistic Maximum Margin Model Oksana Yakhnenko and Barbara Rosario Learning Patterns from the Web to Translate Named Entities for Cross Language Information Retrieval Yu-Chun Wang, Richard Tzong-Han Tsai and Wen-Lian Hsu C2-1: Emotion/Sentiment Analysis Bootstrapping Both Product Features and Opinion Words from Chinese Reviews with Cross-Inducing Bo Wang and Houfeng Wang Learning to Shift the Polarity of Words for Sentiment Classification Daisuke Ikeda, Hiroya Takamura, Lev-Arie Ratinov and Manabu Okumura Unsupervised Classification of Sentiment and Objectivity in Chinese Text Taras Zagibalov and John Carroll Using Roget's Thesaurus for Fine-grained Emotion Recognition Saima Aman and Stan Szpakowicz
11:00-12:30	Poster session 1 (with coffee break): (30) Poster papers (Poster session 1): Vaakkriti: Sanskrit Tokenizer Aasish Pappu and Ratna Sanyal A Bottom Up approach to Persian Stemming Amir Azim Sharifloo and Mehrnoush Shamsfard Named Entity Recognition in Bengali: A Conditional Random Field Approach Asif Ekbal, Rejwanul Haque and Sivaji Bandyopadhyay An Online Cascaded Approach to Biomedical Named Entity Recognition Shing-Kit Chan, Wai Lam and Xiaofeng Yu Automatic rule acquisition for Chinese intra-chunk relations Qiang Zhou Japanese Named Entity Recognition Using Structural Natural Language Processing Ryohei Sasano and Sadao Kurohashi Dimensionality Reduction with Multilingual Resource YingJu Xia, Hao Yu and Gang Zou A Web-based English Proofing System for English as a Second Language Users XING YI, Jianfeng Gao and William B. Dolan Analysis of Intention in Dialogues Using Category Trees and Its Application to Advertisement Recommendation Hung-Chi Huang and Hsin-Hsi Chen Term extraction through unithood and termhood unification Thuy Vu, Ai Ti Aw and Zhang Min Search Result Clustering Using Label Language model Yeha Lee, Seung-Hoon Na and Jong-Hyeok Lee Effects of Related Term Extraction in Transliteration into Chinese HaiXiang Huang and Atsushi Fujii A Structured Prediction Approach for Statistical Machine Translation Dakun Zhang, Le Sun and Wenbo Li Method of Selecting Training Data to Build a Compact and Efficient Translation Model Keiji Yasuda, Ruiqiang Zhang, Hirofumi Yamamoto and Eiichiro Sumita Large and Diverse Language Models for Statistical Machine Translation Holger Schwenk and Philipp Koehn A linguistic and navigational knowledge approach to text navigation Javier Couto and Jean-Luc Minel Synset Assignment for Bi-lingual Dictionary with Limited Resource Virach Sornlertlamvanich, Thatsanee Charoenporn, Chumpol Mokarat, Hitoshi Isahara, Hammam Riza and Purev Jaimai Ranking words for building a Japanese defining vocabulary Tomoya Noro and Takehiro Tokuda Automatically Identifying Computationally Relevant Typological Features William Lewis and Fei Xia Automatic Paraphrasing of Japanese Functional Expressions Using a Hierarchically Organized Dictionary Suguru Matsuyoshi and Satoshi Sato Generation of Referring Expression Using Prefix Tree Structure Sibabrata Paladhi and Sivaji Bandyopadhyay Coverage-based Evaluation of Parser Generalizability Tuomo Kakkonen and Erkki Sutinen Learning Reliability of Parses for Domain Adaptation of Dependency Parsing Daisuke Kawahara and Kiyotaka Uchimoto Resolving Ambiguities of Chinese Conjunctive Structures by Divide-and-conquer Approaches Duen-Chi Yang, Yu-Ming Hsieh and Keh-Jiann Chen Dependency Annotation Scheme for Indian Languages Rafiya Begum, Samar Husain, Arun Dhwaj, Dipti Misra Sharma, Lakshmi Bai and Rajeev Sangal Non-Factoid Japanese Question Answering through Passage Retrieval that Is Weighted Based on Types of Answers Masaki Murata, Sachiyo Tsukawaki, Toshiyuki Kanamaru, Qing Ma and Hitoshi Isahara A Multi-Document Multi-Lingual Automatic Summarization System Mohamad Ali Honarpisheh, Gholamreza Ghassem-Sani and Ghassem Mirroshandel Summarization by Analogy: An Example-based Approach for News Articles Megumi Makino and Kazuhide Yamamoto Sentence Ordering based on Cluster Adjacency in Multi-Document Summarization Donghong Ji Statistical Machine Translation based Passage Retrieval for Cross-Lingual Question Answering Tomoyosi Akiba, Kei Shimizu and Atsushi Fujii
12:30-14:00	Lunch
14:00-15:00	Invited Talk 1: The 21st Sejong Project: With a Focus On Building of the SELK(Sejong Electronic Lexicon of Korean) and the KNC(Korean National Corpus) Prof Hyopil Shin, Seoul National University
15:00-16:30	Poster session 2 (with coffee break): (30) Poster papers (Poster session 2) Unsupervised All-words Word Sense Disambiguation with Grammatical Dependencies Vivi Nastase Syntactic and Semantic Frames in PrepNet Saint-Dizier Patrick Automatic Classification of English Verbs Using Rich Syntactic Features Lin Sun, Anna Korhonen and Yuval Krymolowski MRD-based Word Sense Disambiguation: Further Extending Lesk Timothy Baldwin, Su Nam Kim, Francis Bond, Sanae Fujita, David Martinez and Takaaki Tanaka Fast Computing Grammar-driven Convolution Tree Kernel for Semantic Role Labeling Wanxiang Che, Min Zhang, Aiti Aw, Chew Lim Tan, Ting Liu and Sheng Li SYNGRAPH: A Flexible Matching Method based on Synonymous Expression Extraction from an Ordinary Dictionary and a Web Corpus Tomohide Shibata, Michitaka Odani, Jun Harashima, Takashi Oonishi and Sadao Kurohashi Annotation of Multiword Expressions in the Prague Dependency Treebank Eduard Bejček, Pavel Straňk and Pavel Schlesinger Learning Named Entity Hyponyms for Question Answering Paul McNamee, Rion Snow, Patrick Schone and James Mayfield Errgrams -- A Way to Improving ASR for Highly Inflected Dravidian Languages Kamadev Bhanuprasad Noise as a Tool for Spoken Language Identification Sunita Maithani and J S Rawat Identifying Real or Fake Articles: Towards better Language Modeling Sameer Badaskar, Sachin Agarwal and Shilpa Arora Multi-label Text Categorization with Model Combination based on F1-score Maximization Akinori Fujino and Hideki Isozaki An Experimental Comparison of the Voted Perceptron and Support Vector Machines in Japanese Analysis Tasks Manabu Sassano Learning Decision Lists with Known Rules for Text Mining Venkatesan Chakravarthy, Sachindra Joshi, Ganesh Ramakrishnan, Shantanu Godbole and Sreeram Balakrishnan A re-examination of dependency path kernels for relation extraction Mengqiu Wang Mining Chinese-English Parallel Corpora from the Web Bo Li and Juan Liu Fast Duplicated Documents Detection using Multi-level Prefix-filter Kenji Tateishi and Dai Kusui Towards Data And Goal Oriented Analysis: Tool Inter-Operability And Combinatorial Comparison Yoshinobu Kano, Ngan Nguyen, Rune Satre, Kazuhiro Yoshida, Keiichiro Fukamachi, Yusuke Miyao, Yoshimasa Tsuruoka, Sophia Ananiadou and Jun'ichi Tsujii A Co-occurrence Graph-based Approach for Personal Name Alias Extraction from Anchor Texts Danushka Bollegala, Yutaka Matsuo and Mitsuru Ishizuka Towards Automated Semantic Analysis on Biomedical Research Articles Donghui Feng, Gully Burns, Jingbo Zhu and Eduard Hovy Large Scale Diagnostic Code Classification for Medical Patient Records Lucian Lita, Shipeng Yu, Stefan Niculescu and Jinbo Bi Hacking Wikipedia for Hyponymy Relation Acquisition Asuka Sumida and Kentaro Torisawa A Discriminative Approach to Japanese Abbreviation Extraction Naoaki Okazaki, Mitsuru Ishizuka and Jun'ichi Tsujii Linguistic Interpretation of Emotions for Affect Sensing from Text Mostafa Al Masum Shaikh, Helmut Prendinger and Mitsuru Ishizuka How to Take Advantage of the Limitations with Markov Clustering?--The Foundations of Branching Markov Clustering (BMCL) Hiroyuki AKAMA, Maki MIYAKE and Jaeyoung JUNG Combining Context Features by Canonical Belief Network for Chinese Part-Of-Speech Tagging Hongzhi Xu and Chunping Li Language Independent Text Correction using Finite State Automata Ahmed Hassan, Sara Noeman and Hany Hassan Semantic Role Labeling of Chinese Using Transductive SVM and Semantic Heuristics Yaodong Chen, Ting Wang, Huowang Chen and Xishan Xu A Comparative Study of Mixture Models for Automatic Topic Segmentation of Multiparty Dialogues Maria Georgescul Exploiting Unlabeled Text to Extract New Words of Different Semantic Transparency for Chinese Word Segmentation Richard Tzong-Han Tsai and Hsi-Chuan Hung
16:30-17:30	Oral session: (2 + 2 + 2) A2-2: Machine Translation (2) Heterogeneous Automatic MT Evaluation Through Non-Parametric Metric Combinations Jesus Gimenez and Lluis Marquez Paraphrasing depending on Bilingual Context Toward Generalization of Translation Knowledge Young-Sook Hwang, Young-Kil Kim and Sangkyu Park B2-2: Named Entity Recognition A Framework Based on Graphical Models with Logic for Chinese Named Entity Recognition Xiaofeng YU, Wai LAM and Shing-Kit CHAN A Hybrid Feature Set based Maximum Entropy Hindi Named Entity Recognition Sujan Kumar Saha, Sudeshna Sarkar and Pabitra Mitra C2-2: Relation Extraction Effective Methods of using Web based Information for Relation Extraction Stanley, Wai Keong Yong and Jian Su Minimally Supervised Learning of Semantic Knowledge from Query Logs Mamoru Komachi and Hisami Suzuki
	Day 3: January 10th (Thu)
09:00-10:30	Oral session: (3 + 3 + 3) A3-1: Statistical Models/Machine Learning for NLP (1) Learning a Stopping Criterion for Active Learning for Word Sense Disambiguation and Text Classification Jingbo Zhu, Huizhen Wang and Eduard Hovy Multi-View Co-Training of Transliteration Model Jin-Shea Kuo and Haizhou Li Identifying Sections in Scientific Abstracts using Conditional Random Fields Kenji Hirohata, Naoaki Okazaki, Sophia Ananiadou and Mitsuru Ishizuka B3-1: Ontologies and Linguistic Resources (1) Formalising Multi-layer Corpora in OWL DL - Lexicon Modelling, Querying and Consistency Control Aljoscha Burchardt, Sebastian Pado, Dennis Spohr, Anette Frank and Ulrich Heid Constructing taxonomy of numerative classifiers for Asian languages Kiyoaki Shirai, Takenobu Tokunaga, Chu-Ren Huang, Shu-Kai Hsieh, Tzu-Yi Kuo, Virach Sornlertlamvanich and Thatsanee Charoenporn Translating Compounds by Learning Component Gloss Translation Models via Multiple Languages Nikesh Garera and David Yarowsky C3-1: Question Answering Answering Definition Questions via Temporally-Anchored Text Snippets Marius Pasca Corpus-based Question Answering for why-Questions Ryuichiro Higashinaka and Hideki Isozaki Cluster-Based Query Expansion for Statistical Question Answering Lucian Lita and Jaime Carbonell
10:30-11:00	Break
11:00-12:30	Oral session: (3 + 3 + 3) A3-2: Statistical Models/Machine Learning for NLP (2) A Semantic Feature for Relation Recognition Using a Web-based Corpus ChenMing Hung Multilingual Text Entry using Automatic Language Detection Yo Ehara and Kumiko Tanaka-Ishii Using Contextual Speller Techniques and Language Modeling for ESL Error Correction Michael Gamon, Jianfeng Gao, Chris Brockett, Alexandre Klementiev, William B. Dolan, Dmitriy Belenko and Lucy Vanderwende B3-2: Ontologies and Linguistic Resources (2) Bilingual Synonym Identification with Spelling Variations Takashi Tsunakawa and Jun'ichi Tsujii Minimally Supervised Multilingual Taxonomy and Translation Lexicon Induction Nikesh Garera and David Yarowsky Japanese-Spanish Thesaurus Construction using English as a Pivot Jessica Ramirez, Masayuki Asahara and Yuji Matsumoto C3-2: Event/Sentence Relation Automatic Identification of Rhetorical Roles using Conditional Random Fields for Legal Document Summarization Saravanan M., Ravindran B. and Raman S. Projection-based Acquisition of a Temporal Labeller Kathrin Spreyer and Anette Frank Acquiring Event Relation Knowledge by Learning Cooccurrence Patterns and Fertilizing Cooccurrence Samples with Verbal Nouns Shuya Abe, Kentaro Inui and Yuji Matsumoto
12:30-14:00	Lunch
14:00-15:00	Invited talk 2: Language Processing for the evolving Web Dr S.H. Srinivasan, Yahoo!
15:00-15:30	Break
15:30-17:00	Oral session: (3 + 3 + 3) A3-3: Statistical Machine Translation Refinements in BTG-based Statistical Machine Translation Deyi Xiong, Min Zhang, Ai Ti Aw, Haitao Mi, Qun Liu and Shouxun Lin Simple Syntactic and Morphological Processing Can Help English-Hindi Statistical MT Ananthakrishnan Ramanathan, Jayprasad Hegde, Ritesh Shah, Pushpak Bhattacharyya and Sasikumar M Statistical Translation Models for Personalized Search Rohini U, Vamshi Ambati and Vasudeva Varma B3-3: Ontologies and Linguistic Resources (3) Repurposing Theoretical Linguistic Data for Tool Development and Search Fei Xia and William Lewis Computing Paraphrasability of Syntactic Variants using Web Snippets Atsushi Fujita and Satoshi Sato Augmenting Wikipedia with Named Entity Tags Wisam Dakka and Silviu Cucerzan C3-3: Semantic Similarity Context Feature Selection for Distributional Similarity Masato Hagiwara, Yasuhiro Ogawa and Katsuhiko Toyama Gloss-Based Semantic Similarity Metrics for Predominant Sense Acquisition Ryu Iida, Diana McCarthy and Rob Koeling Benchmarking Noun Compound Interpretation Su Nam Kim and Timothy Baldwin
17:00-17:30	Closing session