Publikationen
ZORA-Abfrage
ZORA Publication List
Download Options
Publications
-
Digital Dickens: An automated content analysis of Charles Dickens’ novels. In: Buschfeld, Sarah; Ronan, Patricia; Neumaier, Theresa; Wellinghoff, Andreas; Westermayer, Lisa. Crossing Boundaries through Corpora: Innovative corpus approaches within and beyond linguistics. Amsterdam: John Benjamins Publishing, 62-98.
-
Automatically detecting directives with SPICE Ireland. In: Schweinberger, Martin; Ronan, Patricia. Socio-Pragmatic Variation in Ireland: Using Pragmatic Variation to Construct Social Identities. Berlin: De Gruyter, 205-234.
-
Text Analytics for Corpus Linguistics and Digital Humanities: Simple R Scripts and Tools. London: Bloomsbury Academic.
-
The Visualisation and Evaluation of Semantic and Conceptual Maps. In: Laitinen, Mikko; Tyrkkö, Jukka. Linguistics across Disciplinary Borders: The March of Data. London: Bloomsbury Publishing, 67-94.
-
Investigating child language acquisition from a joint perspective: A comparison of traditional and new L1 speakers of English. In: Schmalz, Mirjam; Vida-Mannl, Manuela; Buschfeld, Sarah. Acquisition and Variation in World Englishes: Bridging Paradigms and Rethinking Approaches. Berlin: De Gruyter, 133-157.
-
“To boldly go where no man has gone before”: how iconic is the Star Trek split infinitive?. Linguistics Vanguard, 9(s3):247-255.
-
Colloquialisation, compression and democratisation in British parliamentary debates. In: Korhonen, Minna; Kotze, Haidee; Tyrkkö, Jukka. Exploring Language and Society with Big Data: Parliamentary discourse across time and space. Amsterdam: John Benjamins Publishing, 336-372.
-
Differences in syntactic annotation affect retrieval. International Journal of Corpus Linguistics, 28(3):378-406.
-
Detecting and Analysing Learner Difficulties Using a Learner Corpus Without Error Tagging. In: Harrington, Kieran; Ronan, Patricia. Demystifying Corpus Linguistics for English Language Teaching. Cham: Palgrave Macmillan, 229-257.
-
Replicable semi-supervised approaches to state-of-the-art stance detection of tweets. Information Processing & Management, 60(2):103199.
-
Assessing How Attitudes to Migration in Social Media Complement Public Attitudes Found in Opinion Surveys. SPELL: Swiss Papers in English Language and Literature, 41:119-153.
-
Systematically Detecting Patterns of Social, Historical and Linguistic Change: The Framing of Poverty in Times of Poverty. Transactions of the Philological Society, 120(3):447-473.
-
Medical topics and style from 1500 to 2018. In: Hiltunen, Turo; Taavitsainen, Irma. Corpus pragmatic studies on the history of medical discourse. Amsterdam: Benjamins, 49-78.
-
Recent changes in spoken British English according to spoken BNC2014. In: Flach, Susanne; Hilpert, Martin. Broadening the spectrum of corpus linguistics: New approaches to variability and change. Amsterdam: John Benjamins Publishing, 173-195.
-
Measuring Attitudes to Migration in the Media automatically with Complementary Data Sources and Methods. In: Ronan, Patricia; Ziegler, Evelyn. Approaches to Migration and Language Identity. Oxford, Bern, Berlin, Bruxelles, New York, Wien: Peter Lang, 207-252.
-
Comparing data-driven to corpus-based approaches for diachronic variation: document-classification and overuse metrics. In: Schlüter, Julia; Schützler, Ole. Data and Methods in Corpus Linguistics: Comparative Approaches. Cambridge: Cambridge University Press, 291-322.
-
Syntactic changes in verbal clauses and noun phrases from 1500 onwards. In: Los, Bettelou; Cowie, Claire; Honeybone, Patrick. English Historical Linguistics: Change in Structure and Meaning. Amsterdam: John Benjamins Publishing, 163-200.
-
With a little help from familiar interlocutors: real-world language use in young and older adults. Aging & Mental Health, 25(12):2310-2319.
-
Pluralized non-count nouns across Englishes: a corpus-linguistic approach to dialect typology. Corpus Linguistics and Linguistic Theory, 16(3):515-546.
-
Linear and Non-Linear Age Trajectories of Language Use: A Laboratory Observation Study of Couples' Conflict Conversations. Journals of Gerontology, Series B: Psychological Sciences and Social Sciences, 75(9):e206-e214.
-
Changes in society and language: charting poverty. In: Rautinaho, Paula; Nurmi, Arja; Klemola, Juhani. Corpora and the changing society: studies in the evolution of English. Amsterdam: John Benjamins Publishing, 29-56.
-
Using Multilingual Resources to Evaluate CEFRLex for Learner Applications. In: 12th Conference on Language Resources and Evaluation (LREC 2020), Marseille, 11 May 2020 - 16 May 2020. European Language Resources Association, 346-355.
-
Spelling normalisation of Late Modern English: comparison and combination of VARD and character-based statistical machine translation. In: Kytö, Merja; Smitterberg, Eric. Late Modern English: novel encounters. Amsterdam: John Benjamins Publishing, 243-268.
-
A Man who Was Just an Incredible Man, an Incredible Man: Age Factors and Coherence in Donald Trump’s Spontaneous Speech. In: Schneider, Ulrike; Eitelmann, Matthias. Linguistic Inquiries into Donald Trump’s Language : From ‘Fake News’ to ‘Tremendous Success’. London: Bloomsbury, 62-84.
-
Statistics for Linguists: A patient, slow-paced introduction to statistics and to the programming language R. Zurich: Digitale Lehre und Forschung UZH.
-
Enhancing the linguistic discovery potential of historical corpora: a twin-track approach using ARCHER. In: CL 2019 International Corpus Linguistics Conference, Cardiff, Wales, UK, 22 Juli 2019 - 26 Juli 2019, Gossip Theme.
-
Topics of eighteenth-century medical writing with triangulation of methods: LMEMT and the underlying reality. In: Taavitsainen, Irma; Hiltunen, Turo. Late Modern English medical texts: writing medicine in the eighteenth century (Including the LMEMT Corpus). Amsterdam: John Benjamins Publishing, 31-74.
-
Statistical MWE-aware parsing. In: Parmentier, Yannick; Waszczuk, Jakub. Representation and parsing of multiword expressions: current trends. Berlin: Language Science Press, 147-182.
-
Scholastic argumentation in Early English medical writing and its afterlife: new corpus evidence. In: Suhr, Carla; Nevalianen, Terttu; Taavitsainen, Irma. From data to evidence in English language research. Leiden: Brill, 191-221.
-
NLP Corpus Observatory – Looking for Constellations in Parallel Corpora to Improve Learners’ Collocational Skills. In: 7th Workshop on NLP for Computer Assisted Language Learning at SLTC 2018 (NLP4CALL 2018), Stockholm, 7 November 2018 - 7 November 2018, 69-78.
-
Detecting innovations in a parsed corpus of learner English. In: Deshors, Sandra C.; Götz, Sandra; Laporte, Samanantha. Rethinking linguistic creativity in non-native Englishes. Amsterdam: John Benjamins Publishing, 47-74.
-
Differences between Swiss High German and German High German via data-driven methods. In: 3rd Swiss Text Analytics Conference (SwissText 2018), Winterthur, Switzerland, 12 June 2018 - 13 June 2018. CEUR-WS, 17-25.
-
Differences between Swiss High German and German German via data-driven methods. In: SwissText 2018: 3rd Swiss Text Analytics Conference, Winterthur, 12 Juni 2018 - 13 Juni 2018.
-
From Lexical Bundles to Surprisal and Language Models: measuring the idiom principle on native and learner language. In: Kopaczyk, Joanna; Tyrkkö, Jukka. Applications of Pattern-driven Methods in Corpus Linguistics. Amsterdam: Benjamins, 15-56.
-
Tools and Methods for Processing and Visualizing Large Corpora. Studies in Variation, Contacts and Change in English, 19:online.
-
Measuring Encoding Efficiency in Swedish and English Language Learner Speech Production. In: Interspeech 2017, Stockholm, 19 August 2017 - 24 August 2017. ISCA, 1779-1783.
-
Saying Whatever It Takes: Creating and Analyzing Corpora from US Presidential Debate Transcripts. In: Corpus Linguistics Conference 2017, Birmingham, 25 Juli 2017 - 28 Juli 2017, 537-544.
-
Comparing Rule-based and SMT-based Spelling Normalisation for English Historical Texts. In: Proceedings of the NoDaLiDa 2017 Workshop on Processing Historical Language, Gothenburg, 22 Mai 2017 - 22 Mai 2017, 40-46.
-
Statistical sequence and parsing models for descriptive linguistics and psycholinguistics. In: Timofeeva, Olga; Chevalier, Sarah; Gardner, Anne-Christine; Honkapohja, Alpo. New Approaches in English Linguistics : Building Bridges. Amsterdam: John Benjamins Publishing, 281-320.
-
Introduction - The New Energy Crisis : Climate, Economics and Geopolitics. In: Timofeeva, Olga; Gardner, Anne-Christine; Honkapohja, Alpo; Chevalier, Sarah. New Approaches in English Linguistics : Building Bridges. Amsterdam: Springer, 1-12.
-
Part-Of-Speech in Historical Corpora: Tagger Evaluation and Ensemble Systems on ARCHER. In: KONVENS 2016, Bochum, 19 September 2016 - 21 September 2016, RUB.
-
Detecting innovations in a parsed corpus of learner english. International Journal of Learner Corpus Research, 2(2):177-204.
-
Introduction - New Approaches to English Linguistics : Building bridges. In: Timofeeva, Olga; Gardner, Anne-Christine; Honkapoja, Alpo; Chevalier, Sarah. New Approaches to English Linguistics : Building bridges. Amsterdam: John Benjamins Publishing, 1-12.
-
Determining light verb constructions in contemporary British and Irish English. International Journal of Corpus Linguistics, 20(3):326-354.
-
Review of Automatic Treatment of Learner Corpus Data, Ana Diaz Negrillo, Nicolas Ballier and Paul Thompson, eds. (2013). International Journal of Learner Corpus Research, (1):172-177.
-
Parsing early and late modern English corpora. Literary and Linguistic Computing, 30(3):423-439.
-
Of-genitive versus s-genitive: A corpus-based analysis of possessive constructions in 20thcentury English. In: Bennett, Paul; Durrell, Martin; Scheible, Silke; Whitt, Richard J. New Methods in Historical Corpora. Tübingen: Narr Verlag, 163-180.
-
Investigating Irish English With ICE-Ireland. Cahiers de l'institut de linguistique et des sciences du langage, 38(2013):137-162.
-
Discovering new verb-preposition combinations in New Englishes. Studies in Variation, Contacts and Change in English, 13:online.
-
Dependency bank. In: LREC 2012 Conference Workshop "Challenges in the Management of Large Corpora", Istanbul, Turkey, 22 May 2012 - 22 May 2012, 23-28.
-
Using semantic resources to improve a syntactic dependency parser. In: LREC 2012 Conference Workshop "Semantic Relations II", Istanbul, Turkey, 22 May 2012 - 22 May 2012, 67-76.
-
Adapting a parser to historical English. Helsinki: University of Helsinki.
-
BNC Dependency Bank 1.0. In: Oksefjell, Signe; Ebeling, Jarle; Hasselgard, Hilde. Aspects of corpus linguistics: compilation, annotation, analysis. Helsinki: Research Unit for Variation, Contacts, and Change in English, online.
-
Semantic corpus trawling: Expressions of “courtesy” and “politeness” in the Helsinki Corpus. In: Suhr, Carla; Taavitsainen, Irma. Developing Corpus Methodology for Historical Pragmatics. Helsinki: Research Unit for Variation, Contacts and Change in English, 1.
-
Relative complexity in scientific discourse. English Language and Linguistics, 16(2):209-240.
-
"Off with their heads". Profiling TAM in ICE corpora. In: Hundt, Marianne; Gut, Ulrike. Mapping Unity and Diversity World-Wide. Corpus-Based Studies of New Englishes. Amsterdam: John Benjamins, 1-34.
-
Retrieving relatives from historical data. Literary and Linguistic Computing, 27(1):3-16.
-
Using automatically parsed corpora to discover lexico-grammatical features of English varieties. In: 30th International Conference on Lexis and Grammar, Nicosia, Cyprus, 5 October 2011 - 8 October 2011, 251-258.
-
Detection of interaction articles and experimental methods in biomedical literature. BMC Bioinformatics, 12(Suppl 8):S13.
-
Text-Mining-Methoden im Semantic Web. Wirtschaftsinformatik und Management, 3:28-35.
-
A large-scale investigation of verb-attached prepositional phrases. Helsinki: University of Helsinki.
-
A data-driven approach to alternations based on protein-protein interactions. In: III Congreso Internacional de Lingüística de Corpus, Valencia, Spain, 7 April 2011 - 9 April 2011, 597-607.
-
OntoGene (Team 65): preliminary analysis of participation in BioCreative III. In: BioCreative III workshop, Bethesda, Maryland, 13 September 2010 - 15 September 2010.
-
OntoGene in BioCreative II.5. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 7(3):472-480.
-
Text Mining Methoden im Semantic Web. HMD Praxis der Wirtschaftsinformatik, (271):35-46.
-
Multi-verbal expressions of ‘giving’ in Old English and Old Irish. In: Corpus Linguistics Conference, Liverpool, UK, 20 July 2009 - 23 July 2009, 116.
-
Using a parser as a heuristic tool for the description of New Englishes. In: The Fifth Corpus Linguistics Conference, Liverpool, UK, 20 July 2009 - 23 July 2009, online.
-
UZurich in the BioNLP 2009 Shared Task. In: BioNLP 2009 Companion Volume: Shared Task on Event Extraction, NAACL/HLT, Boulder, Colorado, 4 June 2009 - 5 June 2009, 28-36.
-
Detecting protein-protein interactions in biomedical texts using a parser and linguistic resources. In: Gelbukh, Alexander. Computational Linguistics and Intelligent Text Processing. Berlin: Springer, 406-417.
-
A New Hybrid Dependency Parser for German. In: Chiarcos, Christian; de Castilho, Richard Eckart; Stede, Manfred. Von der Form zur Bedeutung: Texte automatisch verarbeiten / From Form to Meaning: Processing Texts Automatically. Proceedings of the Biennial GSCL Conference 2009. Tübingen: Narr, 115-124.
-
Parser-based analysis of syntax-lexis interactions. In: Jucker, Andreas H; Schreier, Daniel; Hundt, Marianne. Corpora: Pragmatics and Discourse. Amsterdam, The Netherlands: Rodopi, 477-502.
-
Detecting Protein-Protein Interactions in Biomedical Literature Using a Parser. In: Clematide, Simon; Klenner, Manfred; Volk, Martin. Searching Answers. Münster: MV Verlag, 109-118.
-
Fishing for compliments: precision and recall in corpus-linguistic compliment research. In: Jucker, Andreas H; Taavitsainen, Irma. Speech acts in the history of English. Amsterdam: John Benjamins, 273-294.
-
A Broad-Coverage, Representationally Minimalist LFG Parser: Chunks and F-Structures Are Enough. In: LFG05, Bergen, Norway, 18 July 2005 - 20 July 2005.