Roles and responsibilities
I am a principal research officer (PRO) in the Multilingual Text Processing team at NRC, Digital Technologies. I am also currently heading the team, since 2017.
I occasionally manage R&D projects or client projects with institutional (other gvt organizations or DARPA, e.g.) or commercial partners.
Current research and/or projects
- Detecting change in online conversations using linguistic processing and changepoint detection.
- Statistical models of text categorization.
- Deriving nerw functionalities from the Additive Factor Model.
Research and/or project statements
I work on designing and building statistical models applied to Natural Language Processing, and Educational Data Mining.
In NLP, I am interested in uncovering non-obvious properties of text or writers, such as language variants or native language of ESL writers. I also investigate change and event detection from streams of textual documents.
In EDM, I am trying to use statistical models to uncover the mechanisms of human learning.
Ph.D in Artificial Intelligence and Pattern Recognition, Université Paris 6 (Pierre et Marie Curie), 1997.
DEA (M.Sc.) in Computer Science, École Nationale Supérieure de Techniques Avancées (Ensta), 1992.
M.Eng., École Nationale Supérieure de Techniques Avancées (Ensta), 1992.
Association for Computational Linguistics (ACL)
"Test of Time Award", awarded at the European Conference on Information Retrieval (ECIR) in 2016 in Padova, for the paper « A probabilistic interpretation of precision, recall and F-score, with implication for evaluation » co-authored with Eric Gaussier and published at ECIR 2005. This award recognizes work published ten years earlier that is still regularly cited in several fields outside IR.
Inventions and patents
A more complete list is available on Scholar.
Cyril Goutte, Serge Léger, Shervin Malmasi and Marcos Zampieri (2016) Discriminating Similar Languages: Evaluations and Explorations, Proceedings of the 10th Language Resources and Evaluation Conference (LREC-2016), pp. 1800-1807.
Cyril Goutte, Marine Carpuat and George Foster (2012) The Impact of Sentence Alignment Errors on Phrase-Based Machine Translation Performance, Tenth Biennial Conference of the Association for Machine Translation in the Americas (AMTA-2012).
Massih-Reza Amini, Nicolas Usunier and Cyril Goutte (2009) Learning from Multiple Partially Observed Views -- an Application to Multilingual Text Categorization, Advances in Neural Information Processing Systems (NIPS'09).
Michel Simard, Cyril Goutte and Pierre Isabelle (2007) Statistical Phrase-based Post-editing, Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2007).
Cyril Goutte and Eric Gaussier (2005) A Probabilistic Interpretation of Precision, Recall and F-score, with Implication for Evaluation, in D.E. Losada and J.M. Fernandez-Luna (eds), Advances in Information Retrieval - 27th European Conference on IR Research (ECIR'05), Lecture Notes in Computer Science 3408, Springer, pp. 345-359.
Previous work experience
I was fortunate to have the opportunity to work for the following organisations:
- 2001-2006, Xerox Research Centre Europe (XRCE): Researcher and Senior Scientist, Machine Learning for Multilingual Access.
- 2000-2001, National Research Institute for the Digital Sciences (Inria).
- 1999-2000, Nokia Mobile Phones.
- 1995-1999, Technical University of Denmark (DTU).
International experience and/or work
I worked in the Copenhagen (Denmark) area from 1995 to 2000, first at the Technical University of Denmark (DTU), then at Nokia Mobile Phones R&D.
I worked in the Grenoble (France) region from 2000 to 2006, first at Inria, then at Xerox Research Centre Europe (XRCE).