Innovative hybrid approaches to the processing of textual data
EACL 2012 Workshop
April 23, 2012, Avignon, France

Program

The poster size is ISO A0
(see instructions here

Important dates
Organization Committee
  • Natalia Grabar, CNRS UMR 8163 STL, Université Lille 1&3, France
  • Marie Dupuch, CNRS UMR 8163 STL, Université Lille 1&3, France
  • Amandine Périnet, LIM&BIO, Université Paris 13, France
  • Thierry Hamon, LIM&BIO, Université Paris 13, France
Submission instructions:
Authors are invited to submit full papers on original, unpublished work in the topic area of this workshop.
Submissions should be formatted using the EACL 2012 stylefiles for latex or MS Word, with blind review and not exceeding 8 pages plus an extra page for references.
The PDF files will be submitted electronically at https://www.softconf.com/eacl2012/hybrid2012/
The workshop Hybrid 2012 is sponsored by:
CNRS logo STL
								    logo LIMBIO logo
Lille3
								 logo           P13 logo

Program

Monday April 23, 2012
09:00    Introduction
09:10    Session 1
09:10    Experiments on Hybrid Corpus-Based Sentiment Lexicon Acquisition
Goran Glavaš,  Jan Šnajder,  Bojana Dalbelo Bašić
University of Zagreb, FER
09:40    A Study of Hybrid Similarity Measures for Semantic Relation Extraction
Alexander Panchenko and Olga Morozova
Center for Natural Language Processing, Catholic University of Louvain
10:10    Coffee break
10:30    Session 2
10:30    Hybrid Combination of Constituency and Dependency Trees into an Ensemble Dependency Parser
Nathan Green and Zdeněk Žabokrtský
Charles University in Prague
11:00    Describing Video Contents in Natural Language
Muhammad Usman Ghani Khan and Yoshihiko Gotoh
The University of Sheffield
11:30    An Unsupervised and Data-Driven Approach for Spell Checking in Vietnamese OCR-scanned Texts
Cong Duy Vu Hoang and Ai Ti Aw
Institute for Infocomm Research (I2R), A*STAR, Singapore
12:00    Short Presentation: Posters
12:30    Lunch break
14:00    Invited speaker
14:00    Multilingual Natural Language Processing
Rada Mihalcea
University of North Texas, USA
15:30    Coffee break and Poster Session
   Contrasting Objective and Subjective Portuguese Texts from Heterogeneous Sources
Michel Généreux1 and William Martinez2
1CLUL, 2ILTEC
   A Joint Named Entity Recognition and Entity Linking System
Rosa Stern1,  Benoît Sagot2,  Frédéric Béchet3
1AFP/INRIA/Paris 7, 2INRIA, 3LIF
   Collaborative Annotation of Dialogue Acts: Application of a New ISO Standard to the Switchboard Corpus
Alex C. Fang1,  Harry Bunt2,  Jing Cao3,  Xiaoyue Liu1
1City University of Hong Kong, 2Tilburg University, 3City University of Hong Kong, Zhongnan University of Economics and Law
   Coupling Knowledge-Based and Data-Driven Systems for Named Entity Recognition
Damien Nouvel,  Jean-Yves Antoine,  Nathalie Friburger,  Arnaud Soulet
Université François Rabelais Tours, Laboratoire d'Informatique
   A Random Forest System Combination Approach for Error Detection in Digital Dictionaries
Michael Bloodgood,  Peng Ye,  Paul Rodrigues,  David Zajic,  David Doermann
University of Maryland
   Methods Combination and ML-based Re-ranking of Multiple Hypothesis for Question-Answering Systems
Arnaud Grappy,  Brigitte Grau,  Sophie Rosset
LIMSI-CNRS
   A Generalised Hybrid Architecture for NLP
Alistair Willis,  Hui Yang,  Anne De Roeck
The Open University, UK
16:30    Session 3
16:30    Incorporating Linguistic Knowledge in Statistical Machine Translation: Translating Prepositions
Reshef Shilon1,  Hanna Fadida2,  Shuly Wintner3
1Tel Aviv University, 2Technion, 3University of Haifa
17:00    Combining Different Summarization Techniques for Legal Text
Filippo Galgani,  Paul Compton,  Achim Hoffmann
the University of New South Wales
17:30    Closing

Invited speaker

Accepted Papers

Accepted papers

Call

The hybrid approach term covers a large set of situations in which different approaches are combined in order to better process textual data and to attempt a better achievement of the dedicated task.
Among the hybridizations the possible combinations are unlimited. The most frequent combination, as stressed during The Balancing Act in 1994, addressed machine learning and rule-based systems. Beyond this, the hybridization can be augmented with distributionnal approaches, syntactic and morphological analyses, semantic distances and similarities, graph theory models, cooccurrences of linguistic units (e.g., word and their dependencies, word senses and pos-tag, NEs and semantic roles,...), knowledge-based approaches (terminologies and ontologies), etc.
As a matter of fact, the hybridization implies to define a strategy to efficiently combine several approaches: cooperation between approaches, filtering, voting or ranking of the multiple system outputs, etc.
Indeed, the combination of these different methods and approaches appears to provide more complete and performant results. The reason is that each method is sensitive and efficient with given data and within given contexts. Hence, their combination may improve both precision and recall. The coverage is indeed improved, while the exploitation of different methods may also lead to the improvement of the precision since their use within filtering, voting etc. modes becomes possible.
In this workshop, we favour the extended meaning of the hybridization of methods, applied to various application areas, such as (but do not feel constrained by these):
  • automatic creation of linguistic resources
  • POS tagging
  • building and structuring of terminologies
  • information retrieval and filtering
  • information extraction
  • linguistic annotation
  • semantic labeling
  • sign language recognition and transcription
  • oral data transcription
  • filtering and validation of lexical resources
  • text summarization
  • question/answering system
  • natural language generation
  • etc.
We invite authors to submit novel methods and novel conceptions of the hybridization performed in various areas related to the textual data processing.
Program Committee
  • Anders Ardö, EIT, LTH, Lund University, Sweden
  • Delphine Bernhard, LiLPa, Université de Strasbourg, France
  • Wray Duntine, NICTA, Australia
  • Philipp Cimiano, CITEC, University of Bielefeld, Germany
  • Vincent Claveau, IRISA-CNRS, Rennes, France
  • Kevin Cohen, University of Colorado Health Sciences Center, USA
  • Marie-Claude l'Homme, OLST, Université de Montreal, Canada
  • Béatrice Daille, Université de Nantes, LINA, France
  • Stefan Th. Gries, University of California, Santa Barbara, USA
  • Anna Kazantseva, University of Ottawa, Canada
  • Mikaela Keller, CNRS LIFL UMR8022, Mostrare INRIA, Université Lille 1&3, France
  • Alistair Kennedy, University of Ottawa, Canada
  • Ben Leong, University of North Texas, USA
  • Pierre Nugues, CS, LTH, Lund University, Sweden
  • Bruno Pouliquen, WIPO, Geneva, Switzerland
  • Sampo Pyysalo, National Centre for Text Mining, University of Manchester, United Kingdom
  • Mathieu Roche, LIRMM, Université de Montpellier 2, France
  • Patrick Ruch, Haute école de gestion de Genève, Switzerland
  • Paul Thompson, National Centre for Text Mining, University of Manchester, United Kingdom
  • Juan-Manuel Torres Moreno, LIA, Université d'Avignon et des Pays de Vaucluse, France
  • Özlem Uzuner, University at Albany, State University of New York, USA