Publications, Talks and Presentations
26.12.2013

Antti Arppe
20.12.1967


Publications

Theses and dissertations

  • Arppe, Antti 2008. Univariate, bivariate and multivariate methods in corpus-based lexicography - a study of synonymy. [PhD Dissertation] Publications of the Department of General Linguistics, University of Helsinki, No. 44. URN: http://urn.fi/URN:ISBN:978-952-10-5175-3, 300pp. (Full PDF version including Appendices: 614 pages)
  • Arppe, Antti 1995. The Strategic Opportunities of a Small High-Technology Company on Emerging Markets. Unpublished Master’s thesis. Institute of Strategy and International Marketing, Department of Industrial Management, Helsinki University of Technology.
  • Edited Books

  • A Man of Measure. Festschrift in Honour of Fred Karlsson in his 60th Birthday (2006). Suominen, Mickael; Arppe, Antti; Airola, Anu; Heinämäki, Orvokki; Miestamo, Matti; Määttä, Urho; Niemi, Jussi; Pitkänen, Kari K.; Sinnemäki, Kaius (Editors). Special Supplement to SKY Journal of Linguistics, Volume 19/2006. Linguistic Association of Finland, Turku.
  • Inquiries into Words, Constraints, and Contexts. Festschrift for Kimmo Koskenniemi on his 60th Birthday (2005). Arppe, Antti; Carlson, Lauri; Lindén, Krister; Piitulainen, Jussi; Suominen, Mickael; Vainio, Martti; Westerlund, Hanna, Yli-Jyrä, Anssi (Editors). CSLI Studies in Computational Linguistics ONLINE. Copestake, Ann (Series editor). ISSN 1557-5772. 330pp.
  • Book articles

  • Arppe, Antti (2009). Linguistic choices vs. probabilities - how much and what can linguistic theory explain? In: Featherston, Sam & Winkler, Susanne (eds.) The Fruits of Empirical Linguistics. Volume 1: Process. Berlin: de Gruyter, pp. 1-24. (Proceedings of the International Conference on Linguistic Evidence, Tübingen, Germany, 31.1.-2.2.2008).
  • Arppe, Antti 2006. Frequency Considerations in Morphology, Revisited - Finnish Verbs Differ, Too. In: A Man of Measure. Festschrift in Honour of Fred Karlsson in his 60th Birthday (2006). Suominen, Mickael; Arppe, Antti; Airola, Anu; Heinämäki, Orvokki; Miestamo, Matti; Määttä, Urho; Niemi, Jussi; Pitkänen, Kari K.; Sinnemäki, Kaius (Editors). Special Supplement to SKY Journal of Linguistics, Volume 19/2006, pp. 175-189. Linguistic Association of Finland, Turku.
  • Arppe, Antti 2005. The Very Long Way from Basic Linguistic Research to Commercially Successful Language Technology: the Case of Two-Level Morphology. In: Inquiries into Words, Constraints, and Contexts. Festschrift for Kimmo Koskenniemi on his 60th Birthday. (2005). Arppe, Antti; Carlson, Lauri; Lindén, Krister; Piitulainen, Jussi; Suominen, Mickael; Vainio, Martti; Westerlund, Hanna, Yli-Jyrä, Anssi; (Editors). CSLI Studies in Computational Linguistics ONLINE. Copestake, Ann (Series editor), pp. 2-17.
  • Refereed papers in international academic journals

  • Han, Weifeng, Antti Arppe & John Newman (accepted April 2013 for publication) Topic marking in a Shanghainese corpus: From observation to prediction. Corpus Linguistics and Linguistic Theory. [AA responsible for all the statistical analysis in this study as well as drafting the text on the quantitative results, and revising the entire text together with the two co-authors WH and JN]
  • Divjak, Dagmar & Antti Arppe (2013) Extracting prototypes from exemplars. What evidence do corpora contain for the representation of linguistic categories? Cognitive Linguistics, 24(2), 221- 274 [55pp] [AA responsible for compiling and annotating the larger (Finnish) of the two corpus- based data-sets, and for all statistic analyses of both data-sets; joint responsibility for study design and interpretation, drafting and writing approx. one-third of the text]
  • Arppe, Antti, Gaëtanelle Gilquin, Dylan Glynn Martin Hilpert & Arne Zeschel (2010). Cognitive Corpus Linguistics: Five points of debate on current theory and methodology. Corpora 5:1, pp. 1-27. [AA responsible writing the target text for one of the five and the response text for another, as well as participating in the drafting of the introduction and conclusion]
  • Antti Arppe & Juhani Järvikivi (2007). Every method counts - Combining corpus-based and experimental evidence in the study of synonymy. Corpus Linguistics and Linguistic Theory, 3:2, pp. 131-159. [AA responsible for corpus-analysis and practical implementation of the experiments designed by co- author JJ; AA responsible for writing all the text, including introduction and discussion, other than the analysis section of the raw experimental results]
  • Magnusson, Camilla; Arppe, Antti; Eklund, Tomas; Back, Barbro; Vanharanta, Hannu; Visa, Ari (2005). The language of quarterly reports as an indicator of change in the company's financial status. Information & Management, 42:4, pp. 561-574. [A primary refereed publication of a multidisciplinary project involving linguistics, economics and computational modeling (GILTA: 2000-2002), where I was responsible for overall study design together with HV, as well as responsible for the overall interpretation of the joint results, combining the analysis of economic performance figures and linguistic texts, from the industrial management perspective]
  • Other papers (reviews, commentaries, etc.) in international academic journals or books

  • Arppe, Antti & Mikko Lounela (2012). Aineiston edustavuudesta ja luotettavuudesta [On the representativeness and reliability of text materials]. In: Heikkinen, Vesa et al. (Editors). Genreanalyysi – tekstilajitutkimuksen kasikirja [Genre analysis – handbook of text genre research]. Gaudeamus, Helsinki, 302-307 [5pp] [AA responsible for the original English text which has been translated to Finnish by ML]
  • Arppe, Arppe (2009). Monta tapaa ajatella – tilastollisten menetelmien hyodyntaminen aineistolahtoisessa kielentutkimuksessa. [Many ways of thinking – applying statistical methods in corpus-based lexicography]. Virittaja, 113:2. [8pp]
  • Antti Arppe & Juhani Järvikivi 2007. Take empiricism seriously! - In support of methological diversity in linguistics [Commentary of Geoffrey Sampson 2007. Grammar without Grammaticality.] Corpus Linguistics and Linguistic Theory, Vol. 3, No. 1, pp. 99-109. [AA responsible for planning the argumentation and structure of the article, drafting approx. half of the text, as well as finalizing the entire text]
  • Refereed papers in national academic journals

  • Arppe, Arppe 2002. Ei yhtä ainoaa polkua - Suomalaisia kokemuksia matkalla kieliteknologisesta tutkimuksesta liiketoimintaan. Puhe ja kieli 22:1, 37-44 (English translation)
  • Refereed papers in international conference proceedings

  • Snoek, Conor, Dorothy Thunder, Kaidi Lõo, Antti Arppe, Jordan Lachler, Sjur Moshagen, and Trond Trosterud 2014. Modeling the Nominal Morphology of Plains Cree. ComputEL: Workshop on the use of computational methods in the study of endangered languages, 52nd Annual Meeting of the Association for Computational Linguistics, Baltimore, Maryland, 26 June 2014.
  • Paukkeri, Mari-Sanna, Jaakko Vayrynen & Antti Arppe (2012). Exploring Extensive Linguistic Feature Sets in Near-Synonym Lexical Choice. Proceedings, Part II. Conference on Intelligent Text Processing and Computational Linguistics (CICLing), 11-17 March 2012, New Delhi, India, 1-12. [12pp]. [AA responsible for compiling and linguistically annotating the Finnish dataset used as well as providing linguistic interpretations for the computational and overall results]
  • Arppe, Antti 2002. The usage patterns and selectional preferences of synonyms in a morphologically rich language. In: Morin, Annie & Sébillot, Pascale (toim.) 2002. JADT-2002. 6th International Conference on Textual Data Statistical Analysis, March 13-15, 2002, Vol. 1, p. 21-32. INRIA, Rennes, France.
  • Arppe, Antti 2000. Developing a Grammar Checker for Swedish. Nordgård, Torbjørn (ed.) Proceedings from the 12th Nordiske datalingvistikkdager, Trondheim, December 9-10, 1999. Department of Linguistics, Norwegian University of Science and Technology (NTNU), Trondheim, Norway.
  • Arppe, Antti. 1995. Term Extraction from Unrestricted Text. Short Paper presented at the 10th Nordic Conference of Computational Linguistics (NoDaLiDa), May 30-31, 1995. Department of General Linguistics. University of Helsinki
  • Other publications (technical reports etc.)

  • Arppe, Antti, Sari Bruun, Kimmo Koskenniemi, Krister Linden, Mikael Linden, Ville Oksanen & Hanna Westerlund (eds.) (2011). A report including Model Licensing Templates and Authorization and Authentication Scheme (Deliverable D7S-2.1). Common Language Resources and Technology Infrastructure (CLARIN: EC FP7 project no. 212230). [AA responsible in 2008- 2009 for initiating and directing the multinational work on devising guidelines presented in this report, including writing introductory passages concerning the general aspects of the scheme]
  • Arppe, Antti, Sari Bruun, Krister Linden, & Hanna Westerlund (eds.) (2011). Set of Federation Agreements for CLARIN Centres (Deliverable D7S-4.1). Common Language Resources and Technology Infrastructure (CLARIN: EC FP7 project no. 212230). [AA responsible in 2008-2009 for initiating and directing the multinational work on devising the agreement set presented in this report, including directing the negotiations leading to, and thus contributing to the creation of the agreement texts]
  • Refereed abstracts in international conference proceedings

  • Conor Snoek, Dorothy Thunder, Kaidi Lõo, Antti Arppe, Jordan Lachler, Juhani Järvikivi, Timothy Mills, Sjur N. Moshagen & Trond Trosterud 2014. Literacy and language learning tools for Plains Cree. Prairies Workshop on Languages and Linguistics, Brandon, Manitoba, 1 March 2014.
  • Arppe, Antti 2008. Linguistic choices vs. probabilities - how much and what can linguistic theory explain?. Pre-proceedings of the International Conference on Linguistic Evidence. Tübingen, Germany, 31.1.-2.2.2008.
  • Arppe, Antti 2006. Complex phenomena deserve complex explanations (Slides). Quantitative Investigations in Theoretical Linguistics (QITL2/2006) Conference, Osnabrück, Germany, 1-2.6.2006. Also avavailable on-line at: http://www.cogsci.uni-osnabrueck.de/~qitl/
  • Arppe, Antti 2005. On the limits of generalizing from quantitative, corpus-based evidence in a morphologically rich language. Pre-proceedings of the International Conference on Linguistic Evidence. Tübingen, Germany, 2-4.2.2006. Also available on-line at: http://www.sfb441.uni-tuebingen.de/LingEvid2006/abstracts/arppe.pdf
  • Arppe, Antti 2004. Every method makes a difference - describing the use of a Finnish synonym pair. Pre-proceedings of International Conference on Linguistic Evidence, January 29 - 31, 2004, Tübingen, Germany.
  • Arppe, Antti & Järvikivi, Juhani 2002. Verbal Synonymy in Practice: Combining Corpus-Based and Psycholinguistic Evidence (Slides). Workshop on Quantitative Investigations in Linguistics (QITL1/QITL-2002), Osnabrück, Germany, 3-5.10.2002. Also available on-line at: http://www.cogsci.uni-osnabrueck.de/~qitl/
  • Non-refereed papers in international conference proceedings

  • Arppe, Antti 2007. Multivariate methods in corpus-based lexicography: A study of synonymy in Finnish. The Fourth Biennial Corpus Linguistics 2007 Conference, July 28-30, 2007, Birmingham, UK.
  • Arppe, Antti 2005. Morphological features as “context” in distinguishing semantically similar words. Available online at: Proceedings from the Corpus Linguistics Conference Series, Vol. 1, no. 1, ISSN 1747-9398. Third Biennial Corpus Linguistics 2005 Conference, July 14-17, 2005, Birmingham, UK.
  • Arppe, Antti 2001. Focal points in frequency profiles - how some word forms in a paradigm are more significant than others in Finnish. Proceedings of the 6th Conference on Computational Lexicography and Corpus Research, June 28-30, 2001, University of Birmingham, Birmingham, UK.
  • Arppe, Antti 2001. Corpus-Based Observations on the Use and Occurrence of Inflected Forms in Finnish - the Case of Synonyms. Niemi, Jussi & Heikkinen, Janne (eds.) Nordic and Baltic Morphology. Papers from a NorFA Course, Tartu, June 2000. Studies in Languages 36, University of Joensuu, Faculty of Humanities.
  • Arppe, Antti 2001. Lärdomar från utveckling av inflektereande synonymordböcker. Gellerstam, Martin et al (eds.) Nordiska studier i Lexikografi 5. Rapport från 'Konferens om lexikografi i Norden', Gothenburg May, 26-29, 1999. Skrifter utgivna av Nordiska föreningen för lexikografi (6) i samarbete med Nordiska språkrådet och Meijerbergs institut, Gothenburg, Sweden.
  • Arppe, Antti; Voipio, Mari; Würtz, Malene. 2000. Creating Inflecting Electronic Dictionaries. Lindberg, Carl-Erik & Lund, Steffen Nordahl (eds.) 17th Scandinavian Conference of Linguistics, Nyborg August 20-22, 1998. Odense Working Papers in Language and Communication No 19, April 2000, Vol 1. University of Southern Denmark, Odense, Denmark.
  • Ahmad, Khurshid; Ogonowski, Antoine; Dauphin, Eva; Sta, Jean-David; Arppe, Antti 1996. Engineering Terminology - A Case for a Linguistically Informed Database. Proceedings of the 4th Congress on Terminology and Knowledge Engineering (TKE), Vienna, 1996.
  • Arppe, Antti. 1994. Translators' Aids: User Profiles (main text chapter + appendix chapter). In: McNought John et alii (Editors). LRE EAGLES report. Commission of the European Union, Luxembourg.
  • Nonrefereed papers in national conference proceedings

  • Arppe, Antti 1996. Information Explosion and the Use of Linguistic Tools in Finland. Terttu Harakka ja Merja Koskela (eds.) Kieli ja tietokone. Association Finlandais de Linguistique Appliquée, Yearbook 54. Jyväskylä, Finland
  • Software and datasets

  • Antti Arppe (2013). polytomous: Polytomous logistic regression for fixed and mixed effects. R package version 0.1.6. URL: http://CRAN.R-project.org/package=polytomous
  • Antti Arppe, Petar Milin, R. Harald Baayen and with contributions from Peter Hendrix (2012). ndl: Naive Discriminative Learning. R package version 0.1.6. URL: http://CRAN.R- project.org/package=ndl
  • amph (2008). A micro-corpus of 3404 occurrences of the four most common Finnish THINK lexemes, ajatella, miettia, pohtia, and harkita, in Finnish newspaper and Internet newsgroup discussion texts, containing extracts and linguistic analysis of the relevant context in the original corpus data, scripts for processing this data, R functions for its statistical analysis, as well as a comprehensive set of ensuing results as R data tables. Compiled and analyzed by Antti Arppe. Available on-line at URL: http://www.csc.fi/english/research/software/amph/>
  • Web publications:

  • Arppe, Antti 2002. Forward with Feet on the Ground - Speech Technology the Finnish Way. (Formerly: Crossing the Chasm - Speech Technology the Finnish Way). Available originally at: URL: http://www.hltcentral.org/page-1054.0.shtml
  • Arppe, Antti 2002. Kärki on kapea mutta kärjen tuntumassa ollaan - Puheteknologiaa suomalaisittain. URL: http://www.csc.fi/euromap/artikkelit/puheteknologia.phtml.fi
  • Arppe, Antti 2002. No Single Path - Finnish Lessons in the Commercialization of Language Technology Research. (Formerly: Finnish Lessons: No Single Path to Market). Available originally at: URL: http://www.hltcentral.org/page-969.shtml
  • Arppe, Arppe 2002. Ei yhtä ainoaa polkua - Suomalaisia kokemuksia matkalla kieliteknologisesta tutkimuksesta liiketoimintaan. URL: http://www.csc.fi/euromap/artikkelit/arppe.phtml.fi
  • Arppe, Antti; Birn, Jussi; Westerlund, Fredrik. 1998. Lingsoft's Swedish Grammar Checker. URL: http://www.lingsoft.fi/doc/swegc/
  • Nonrefereed abstracts in international conference or workshop proceedings

  • Arppe, Antti 2004. Kuinka nähdä metsä puilta - piirteiden kombinatoriikasta leksikografisessa tutkimuksessa. FINEST Conference of Linguistics, 6-7.5.2004, Tallinn, Estonia.
  • Arppe, Antti & Kenttä, Reetta 2004. What you ask is what you get in an experiment - or do you? An auto-autopsy of a multimethodological linguistic study. Workshop On the necessity of Experimental Methods in Semantics, 20th Scandinavian Conference of Linguistics, 7-9.1. 2004, University of Helsinki, Helsinki, Finland,
  • Arppe, Antti 2002. Variation in an agglutinative language ? the case of inflected forms of a synonym pair in Finnish. The 19th Scandinavian Conference of Linguistics, University of Tromsø, 10-12.1.2002. Tromsø, Norway.
  • Antti, Arppe 2001. Word form selection in a morphologically rich language - the case of inflected forms of a synonym pair in Finnish. On top-down functionalism and bottom-up corpus research: can they meet? Workshop 31.8.2001. 34th SLE Meeting, Leuven, Belgium.
  • Arppe, Antti 2000. Grammatical descriptions vs. corpora: The use of the dual category in four northern Uralic languages (Ume Saamic, Khanty, Nenets and Selkup). SKY Symposium on Parts of Speech in and across Languages. 17-19.8.2000, Helsinki. URL: http://www.ling.helsinki.fi/sky/posabstr.html
  • Nonrefereed abstracts in national conferences or workshop proceedings

  • Arppe, Antti 2005. Morfologisten piirteiden rooli semanttisesti samankaltaisia sanoja erottavana kontekstikategoriana. XXXII Kielitieteen päivät, May 19-20, 2005, Tampere, Finland.
  • Arppe, Antti 2003. Yksilön ääni empiirisessä korpusaineistossa. Kielitiede - ihmistiede, tiede? -työpaja, XXX Kielitieteen päivät, May 15-16, 2003, Joensuu, Finland.
  • Arppe, Antti 2003. Empiirisen aineiston representatiivisuudesta kielentutkimuksessa. Yksilön ja ryhmän välissä - miten kognitiivisia järjestelmiä yleistetään - FiCLAn työpaja, XXX Kielitieteen päivät, May 15-16, 2003, Joensuu, Finland.
  • Arppe, Antti 2001. Sanojen (merkityksen) ja taivutusprofiilien yhteydestä - kiinnostavia yksittäistapauksia vai laajempaa säännönmukaisuutta? XXVII Kielitieteen päivät, Jyväskylä, May 17-18, 2001.
  • Arppe, Antti 2000. Voidaanko huonosti yleisimmin ensimmäisessä persoonassa vai imperfektissä? Merkityksen ja taivutuksen vuorovaikutuksesta. Kieli keskellä kognitiota - Suomen kognitiivisen kielitieteen yhdistyksen Syysseminaari October 13-14, 2000, Viking Amorella. URL: http://www.helsinki.fi/jarj/ficla/s00/arppe.html
  • Translations

  • English to Finnish: Naumanen, Mika 2002. Nuorten teknologiayritysten menestystekijät (Sitra reports series, ISSN 1457-571X; 28). Edita Publishing Oy, Helsinki. ISBN 951-37-3819-1
  • Invited Talks and Other Presentations:

  • March 13, 2010. Towards polytomous mixed effects regression analysis and modeling. Center for Comparative Psycholinguistics (CCP), Department of Linguistics, University of Alberta, Canada.
  • February 26, 2010. Polytomous logistic regression analysis and modeling of linguistic alternations. Center for Comparative Psycholinguistics (CCP), Department of Linguistics, University of Alberta, Canada.
  • February 10, 2010. Exemplars, prototypes, or both - what evidence do corpora contain for the representation of linguistic categories?, Linguistic Evidence 2010, University of Tübingen, Gerany.
  • April 21, 2005. Produktutveckling av språkgranskningsprogram. Seminarium of språkkontrollprogram [Seminar on Proofing tools] , April 21-22, 2005. Nordens språkråd Hotell Kalkstrand, Pargas, Finland.
  • October 18, 2003 Varför är språkkontrollprogram sådana som de är? Hur kunde de bli bättre eller nyttigare? Seminar of Nordic language councils on language technology, October 17-19, 2003. Nordisk språkråd, Schæffersgården, Copenhagen, Denmark.
  • March 27, 2003 What has worked and what hasn't in commercializing language technology - a Finnish experience. Information Day: "Language Technology: Technological Innovations and Business Solutions" (Wrap-up seminar of the Greek national Euromap project). Amphitheatre "Zerva" National Documentation Centre, Vas. Constantinou 48 av., Athens, Greece. URL: http://www.ilsp.gr/hope_files/en/imerida27_03_03.htm
  • February 6, 2003 Bruket av finska verbsynonymer i praktiken enligt korpuslingvistiska och psykolingvistiska metoder. Doctoral seminar, Department of Nordic languages, University of Helsinki.
  • January 27, 2000 Humanistisesta perustutkimuksesta kaupallisesti menestyksekkääseen liiketoimintaan. Ohjelmistojen ja add-in-komponenttien lisensointi kansainvälisille ohjelmistotaloille. Yritysesimerkki: Lingsoft Oy. Ohjelmistojen tuotteistaminen -kurssi, Department of Computer Science, University of Helsinki.
  • May 7, 1999 Den långa vägen från lingvistisk teori till kommersiell verksamhet (The long road ...), Language technology day, University of Gothenburg, Department of General Linguistics
  • April 29, 1999 Den långa vägen från lingvistisk teori till kommersiell verksamhet (The long road ...), University of Lund, Department of General Linguistics
  • March 24, 1999 Den långa vägen från lingvistisk teori till kommersiell verksamhet (The long road from linguistic theory to commercial activity), University of Uppsala, Department of General Linguistics.
  • March 22, 1999 Final panel, Temadag om datorstödd språkgranskning (Theme day of computer-aided language checking), Royal Institute of Technology, Stockholm, NADA (Numerisk analys och datalogi)
  • Other publications and published opinions

  • Niukkuus ja epävarmuus kannustavat? Helsingin Sanomat, 7.4.2005.
  • Onko makkaratehdas paras esikuva yliopistoille? Kanava, 4-5/2005, 2.6.2005.
  • Sopiiko markkinatalouden malli julkiselle sektorille?, Helsingin Sanomat, 16.10.2006. [Tiina Arppen, Jussi Pakkasvirran ja Martti Vainion kanssa].
  • Tieteellinen omavaraisuus turvattava, Suomen Kuvalehti, 3.11.2006. [Tiina Arppen, Jussi Pakkasvirran ja Martti Vainion kanssa] (Lyhentämätön versio).
  • Miscellaneous

    Scripts/functions/data for the R Statistical Computing Environment


    Last updated December 26, 2013 by Antti Arppe