commit 98bc43593c15f1502b75787595a816a70109fe44 parent ab15d0621f8f4036b9be198be68e46c665922439 Author: Stefan Koch <programming@stefan-koch.name> Date: Sat, 1 Dec 2018 15:27:51 +0100 add first tools from ffzg Diffstat:
M | README.md | | | 36 | ++++++++++++++++++++++++++++++++++-- |
1 file changed, 34 insertions(+), 2 deletions(-)
diff --git a/README.md b/README.md @@ -1,2 +1,34 @@ -# awesome-croatian-nlp -A curated list of free resources for Croatian Natural Language Processing +# Awesome NLP Resources for Croatian + +> A curated list of *free* resources dedicated to Croatian +> Natural Language Processing +> +> Maintainers - [Stefan Koch](https://github.com/aufziehvogel) + +## Tools and/or Models + +### Named Entity Recognition + +- [Named entity recognition (ffzg)](http://nlp.ffzg.hr/resources/models/ner/) + +### Stemmers + +- [Rule-based stemmer for Croatian (ffzg)](http://nlp.ffzg.hr/resources/tools/stemmer-for-croatian/) +- [Rule-based stemmer for Croatian (nltk-compliant)](https://eliteinformatiker.de/2015/05/15/rewriting-university-of-zagrebs-croatian-stemmer-to-a-nltk-compliant-class) + +### Taggers + +- [Tagging model for hunpos tagger](http://nlp.ffzg.hr/resources/models/tagging/) + + +## Datasets + +### Corpora + +- [SETimes: Parallel English and South-East European Corpus](http://nlp.ffzg.hr/resources/corpora/setimes/) +- [hrWaC: Croatian Web Corpus](http://nlp.ffzg.hr/resources/corpora/hrwac/) +- [SETimes.HR+ Croatian dependency treebank](https://github.com/ffnlp/sethr) + +## Organisations + +- [NLP Group at Filozofki fakultet at University of Zagreb](http://nlp.ffzg.hr/)