Dictionary of Croatian Idioms

A born-digital, corpus-driven, open-access dictionary of Croatian idioms


Project goal

The goal of the project is to create an open-access dictionary of Croatian idioms based on data from a large electronic corpus. The resulting dictionary will serve as a gateway for a large number of users and researchers to current idiomatic usage of Croatian.


The dictionary is based on the Croatian web corpus hrWaC (1.2 billion words). Using a large electronic corpus to compile a dictionary is in line with one of the key principles of modern-day lexicography: we can obtain reliable linguistic data by observing language in use.

27 September

Parizoska, Jelena. “Idiom Modifications: What Grammar Reveals about Conceptual Structure”

34th International Conference of the Croatian Applied Linguistics Society Linguistic and extralinguistic in interaction, Faculty of Humanities and Social Sciences, University of Split (online), 26 September 2020 (invited)

07 July

Parizoska, Jelena; Filipović Petrović, Ivana. “Variation in idioms dictionaries: Lexicographer’s work in the post-editing era”

International Conference Electronic lexicography in the 21st century (eLex 2021): Post-editing lexicography. Brno, Czechia (online), 5 July 2021

19 February

Filipović Petrović, Ivana; Parizoska, Jelena. Corpora in Studies of Idioms and Idioms Dictionaries.

Participants: university lecturers. Croatian Academy of Sciences, Zagreb, 19 February 2020

04 November

Filipović Petrović, Ivana. Workshop The Future of Academic Lexicography: Linguistic Knowledge Codification in the Era of Big Data and AI.

Lorentz Center, Leiden, The Netherlands, 4–8 November 2019