The goal of the project is to create an open-access dictionary of Croatian idioms based on data from a large electronic corpus. The resulting dictionary will serve as a gateway for a large number of users and researchers to current idiomatic usage of Croatian.
The dictionary is based on the Croatian web corpus hrWaC (1.2 billion words). Using a large electronic corpus to compile a dictionary is in line with one of the key principles of modern-day lexicography: we can obtain reliable linguistic data by observing language in use.
Parizoska, Jelena. “Idiom Modifications: What Grammar Reveals about Conceptual Structure”
34th International Conference of the Croatian Applied Linguistics Society Linguistic and extralinguistic in interaction, Faculty of Humanities and Social Sciences, University of Split (online), 26 September 2020 (invited)
Parizoska, Jelena; Filipović Petrović, Ivana. “Variation in idioms dictionaries: Lexicographer’s work in the post-editing era”
International Conference Electronic lexicography in the 21st century (eLex 2021): Post-editing lexicography. Brno, Czechia (online), 5 July 2021
Filipović Petrović, Ivana; Parizoska, Jelena. Corpora in Studies of Idioms and Idioms Dictionaries.
Participants: university lecturers. Croatian Academy of Sciences, Zagreb, 19 February 2020
Filipović Petrović, Ivana. Workshop The Future of Academic Lexicography: Linguistic Knowledge Codification in the Era of Big Data and AI.
Lorentz Center, Leiden, The Netherlands, 4–8 November 2019