Jan Šnajder

Jan Šnajder Jan Šnajder, PhD
Associate Professor

Text Analysis and Knowledge Engineering Lab
Faculty of Electrical Engineering and Computing
University of Zagreb, Croatia

Phone: +385 1 6129 871
Email: jan (dot) snajder (at) fer (dot) hr

LinkedIn   Scholar

I am an Associate Professor at the Faculty of Electrical Engineering and Computing (FER) at the University of Zagreb and a member of Text Analysis and Knowledge Engineering Lab (TakeLab). My research interests are in natural language processing (NLP), machine learning, and language technologies. My current focus is on lexical semantics, information extraction, and opinion mining. I am a fan of functional programming, Haskell in particular.

Short Bio

I received my MSc and PhD degrees in Computer Science from the University of Zagreb, Faculty of Electrical Engineering and Computing (UNIZG FER), Zagreb, Croatia in 2006 and 2010, respectively. From 2002 I was working as a research assistant and from 2016 I am working as an Associate Professor at UNIZG FER. In 2012 and 2013 I was a visiting researcher at the Department of Computational Linguistics at Heidelberg University. In 2015 I was a visiting researcher at the NICT in Kyoto, and in 2014 and 2015 a visiting researcher at the IMS, Stuttgart University. In 2016 I was a visiting researcher at the Department of Computing and Information Systems, University of Melbourne.

Curriculum vitae

Teaching

Publications

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007 -

Software & Data

Students

Current PhD Students

  1. Domagoj Alagić - word sense modelling
  2. Filip Boltužić - argument recognition
  3. Vedran Galetić - prototype semantics
  4. Mladen Karan - semantic search
  5. Damir Korenčić (with Dr. Strahil Ristov) - content analysis
  6. Tamara Sladoljev-Agejev (with Dr. Svjetlana Kolić-Vehovec) - discourse analysis
  7. Milan Pavlović - threat detection
  8. Leon Rotim - educational text analysis
  9. Frane Šarić - multi-document summarization
  10. Martin Tutek - event modelling

Current MA Students

  1. Ana Brassard
  2. Filip Čulinović
  3. Luka Dulčić
  4. Bartol Freškura
  5. Bruno Gavranović
  6. Martin Gluhak
  7. Paula Gombar
  8. Marin Kačan
  9. Tin Kuculo
  10. Tomislav Marinković
  11. Stipan Mikulić
  12. Mihael Nikić
  13. Ivan Paljak
  14. Matej Paradžik
  15. Tena Perak
  16. Ante Pocedulić
  17. Ivan Sekulić
  18. Luka Skukan

Current BA Students

  1. Marin Kukovačec
  2. Toni Kukurin
  3. David Lozić
  4. Juraj Malenica
  5. Ivan Mršić
  6. Lukrecija Puljić
  7. Filip Šaina
  8. Antonio Šajatović
  9. Doria Šarić
  10. Ivan Tokić

Completed PhD Theses

  1. Goran Glavaš. Text Information and Retrieval Based on Event Graphs. 2014.

Completed MA Theses

  1. Maja Buljan. Multiword Identification Based on the Combination of Linguistic Features. 2016.
  2. Vjeran Crnjak. Learning to Search for Solving Natural Language Processing Tasks. 2016.
  3. Zoran Medić. Compositional Distributional Semantics Based on the Lexical Function Model. 2016.
  4. Dino Radaković. A Joint Model for Named Entity Relation Extraction. 2016.
  5. Sven Vidak. Deep Learning for Language Modeling of the Croatian Language. 2016.
  6. Toni Antunović. Automated Extraction of Bilingual Lexicons Based on Semantic Vector Spaces. 2015.
  7. Krešimir Baksa. Shallow Semantic Parsing of Croatian Texts. 2015.
  8. Dino Dolović. Sentiment Analysis in Tweets in Croatian Language. 2015.
  9. Goran Gašić. Deep Learning of Word Embeddings for Tagging Models for Croatian Texts. 2015.
  10. Lana Lisjak. Recognizing Textual Entailment in Croatian Texts. 2015.
  11. Hermina Petric Maretić. Project Proposals Analysis using Statistical Natural Language Processing. 2015.
  12. Mihael Šafarić. Feature Selection and Document Representation Methods for Text Classification. 2015.
  13. Petra Almić. A Model for Determining Semantic Compositionality of Croatian Multi-Word Expressions. 2014.
  14. Marko Bekavac. Word Sense Induction and Discrimination Model for Croatian Words. 2014.
  15. Petra Bevandić. Optimizing Dependency Parsing Parameters for Croatian Language. 2014.
  16. Siniša Biđin. Using Deep Learning for Sentiment Analysis of Croatian Expressions. 2014.
  17. Luka Krajcar. Sentiment Analysis of Tweets in Croatian Language. 2014.
  18. Lovro Rožić (with Mladen Vuković). Functional Programming. 2014.
  19. Martin Tutek. Multi-label Document Classification using EuroVoc Thesaurus. 2014.
  20. Leo Zuanović. Recurrent Neural Network Based Model of Croatian Language. 2014.
  21. Filip Petkovski. Application of Partial Membership Models to Keyphrase Extraction from Croatian Documents. 2013.
  22. Tin Franović. Classification of Email Importance Based on Speech Acts. 2013.
  23. Matija Hanževački. Coreference Resolution in Croatian Texts. 2013.
  24. Josip Bakić. Automatic Content Extraction from Web Pages. 2012.
  25. Sonja Grđan. Application of Machine Learning Methods for EEG-Based Brain-Computer Interface. 2012.
  26. Ante Kegalj. Sentiment Analysis Based on Prior Word Polarity. 2012.
  27. Ivan Krišto. Using Machine Learning Methods to Improve Document Retrieval. 2012.
  28. Tomislav Lombarović. Named Entity Recognition and Classification for Text in Croatian Language. 2012.
  29. Mladen Marović. Event and Temporal Relation Extraction in Croatian Language Texts. 2012.
  30. Hrvoje Peradin. Constraint Grammar-based Parsing of Croatian Texts. 2012.
  31. Veljko Srdarević. Text Report Generation Based on Structured Data. 2012.
  32. Fran Dragomanović (with Prof. Bojana Dalbelo Bašić). Acronym Extraction in Croatian Language. 2011.
  33. Zoran Hranj. Unsupervised Coreference Resolution. 2011.
  34. Vedrana Janković. Computational Models of Distributional Lexical Semantics in Croatian Language. 2011.
  35. Ivan Kmetović. Matching Co-referent Named Entities Using Machine Learning. 2011.
  36. Slavko Kručaj. Applying Machine Learning Methods to User Review Summarization. 2011.
  37. Ivan Kusalić. Application of Topic Models to Analysis of Croatian Documents. 2011.
  38. Ognjen Lajšić. Grammar and Style Checker for Croatian Language. 2011.
  39. Vladimir Manzin. Computer Agents for Poker. 2011.
  40. Vjekoslav Osmann. Tagging Parts of Speech in Croatian Texts. 2011.
  41. Paško Pajdek. Deep Generative Models for Semantic Document Clustering. 2011.
  42. Josip Saratlija. Unsupervised Parser for Croatian Language. 2011.
  43. Nikola Šantić. Automatic Paraphrasing of Croatian Expressions and Sentences. 2011.
  44. Matea Biočić (with Prof. Bojana Dalbelo Bašić). Word Sense Discrimination Using Expectation Maximization Algorithm. 2010.
  45. Zlatan Hot (with Prof. Bojana Dalbelo Bašić). A Stemming Algorithm Based on String Clustering. 2010.
  46. Marin Japec (with Prof. Bojana Dalbelo Bašić). System for Organizing and Sharing Knowledge Based on Topic Maps. 2010.
  47. Matija Lacković (with Prof. Bojana Dalbelo Bašić). Program Environment for Execution of Tournaments for Game Playing Algorithms. 2010.
  48. Nikola Novak (with Prof. Bojana Dalbelo Bašić). Implementation of a Game Simulator and Checkers Game-playing Algorithms. 2010.
  49. Ivan Šolta (with Prof. Bojana Dalbelo Bašić). Determining Semantic Orientation of Subjective Words and Phrases. 2010.
  50. Davor Delač (with Prof. Bojana Dalbelo Bašić). Collocation Extraction from Corpus. 2009.
  51. Lovro Žmak (with Prof. Bojana Dalbelo Bašić). FAQ Retrieval System for Croatian Language. 2009.
  52. Srđan Vuković (with Prof. Bojana Dalbelo Bašić). A Heuristic Algorithm for Matching of Address Data. 2008.

Completed BA Theses

  1. Bartol Freškura. Application of Deep Learning for Stance Detection in User Comments. 2016.
  2. Bruno Gavranović. Application of Deep Learning for Sentiment Analysis. 2016.
  3. Filip Hrenić. Detection of Inappropriate Messages in Online Chats. 2016.
  4. Marin Kačan. Detecting Lexical Transfer Errors of Second Language Learners. 2016.
  5. Mihael Nikić. Application of Machine Learning for Topic-Based Sentiment Analysis. 2016.
  6. Stipan Mikulić. Use of Distributional Semantic Models in the Word Association Game. 2016.
  7. Filip Čulinović. Acquisition of Verb Classes from Corpus using Unsupervised Machine Learning. 2015.
  8. Paula Gombar. Contextual Sentiment Analysis of Croatian Expressions. 2015.
  9. Ivan Paljak. Stance Classification and Analysis in Online User Comments. 2015.
  10. Ivan Sekulić. Extraction of Semantic Verb Relations from Croatian Corpora. 2015.
  11. Jura Šlosel. Entity-Based Coherence Model for Croatian Texts. 2015.
  12. Vjeran Crnjak. Part-of-Speech Tagging for Croatian using Conditional Random Fields. 2014.
  13. Stjepan Glavina. Machine Learning of Document Classification Rules. 2014.
  14. Zoran Medić. Quotation Extraction from News Stories in Croatian Language. 2014.
  15. Matej Paradžik. Semi-Supervised Acquisition of Sentiment Polarity Lexicon. 2014.
  16. Dino Radaković. Applying Semantic Kernel Functions in Text Classification. 2014.
  17. Luka Skukan. Temporal Expression Tagging for Croatian Texts. 2014.
  18. Sandra Trkulja. Feature Construction and Selection for Document Classification in Croatian Language. 2014.
  19. Sven Vidak. Offensive Text Detection using Machine Learning Methods. 2014.
  20. Ivana Balažević. Document Clustering Using Self-organizing Neural Networks. 2012.
  21. Marko Bekavac. Application of Genetic Programming in Keyphrase Extraction. 2012.
  22. Petra Bevandić. Automatic Natural Language Identification. 2012.
  23. Goran Gašić. Automatic Tagging of Croatian Newswire Articles. 2012.
  24. Luka Krajcar. Error Correction in Texts Produced by Speech Recognition of Croatian. 2012.
  25. Zolik Nemet. Extraction of Acronyms from Corpus of Texts in Croatian Language. 2012.
  26. Roko Pancirov. Automatic Extraction of Bilingual Dictionaries Based on Wikipedia. 2012.
  27. Martin Tutek. Using Wikipedia for Automatic Word Sense Disambiguation. 2012.
  28. Leo Zuanović. Machine Learning of Croatian Lemmatization Rules. 2012.
  29. Siniša Biđin. A Controlled Natural Language Parser. 2011.
  30. Matija Hanževački. Temporal Expression Tagging in Croatian Texts. 2011.
  31. Ante Kegalj (with Prof. Bojana Dalbelo Bašić). Automated Sentence Boundary Detection. 2010.
  32. Tomislav Lombarović (with Prof. Bojana Dalbelo Bašić). Question Type Classification for Information Retrieval Systems. 2010.
  33. Mladen Marović (with Prof. Bojana Dalbelo Bašić). OCR Error Correction. 2010.
  34. Mladen Mikša (with Bojana Dalbelo Bašić). Correction of Merged Words Errors in Texts Obtained by Optical Character Recognition. 2010.
  35. Veljko Srdarević (with Prof. Bojana Dalbelo Bašić). Building a Stemming Algorithm Using Genetic Programming. 2010.
  36. Zoran Hranj (with Prof. Bojana Dalbelo Bašić). Structure-Based Web Page Comparison Algorithm. 2009.
  37. Ivan Karačić (with Prof. Bojana Dalbelo Bašić). Word Sense Discrimination. 2009.
  38. Ivan Kmetović (with Prof. Bojana Dalbelo Bašić). Keyword Extraction from Text Using Decision Trees. 2009.
  39. Ivan Krišto (with Prof. Bojana Dalbelo Bašić). Web Page Cleaning Techniques for Text Mining. 2009.
  40. Ognjen Lajšić (with Prof. Bojana Dalbelo Bašić). OCR Error Correction. 2009.
  41. Josip Saratlija (with Prof. Bojana Dalbelo Bašić). Keyword Extraction Based on Document Clustering. 2009.
  42. Nikola Šantić (with Prof. Bojana Dalbelo Bašić). Automatic Diacritics Restoration in Croatian Texts. 2009.
  43. Igor Šoš (with Prof. Bojana Dalbelo Bašić). Client Side of Distributed Linguistic Resource Annotator. 2009.
  44. Marin Japec (with Prof. Bojana Dalbelo Bašić). Dialogue System in Croatian Language. 2008.
  45. Željko Rumenjak (with Prof. Bojana Dalbelo Bašić). Distributed linguistic resource annotator. 2008.
  46. Ivan Šolta (with Prof. Bojana Dalbelo Bašić). Query Correction Based on Levenshtein Distance. 2008.

Locations of visitors to this page