spam

Ľ. Štúr Institute of Linguistics

Slovak Academy of Sciences

po slovensky
About us
Our mission Employees Structure Internal documents (sk) Gender Equality Plan GDPR
Research
Projects (sk) Ph.D. study (sk) Conferences
Resources
Dictionary portal Language counseling (sk) Slovak National Corpus Publications Journals Databases and tools
SLS
Contact (sk)
🔍
Search:

Databases, Portals and Tools

Portals and Databases

  • Dictionary portal
  • Terminology portal
  • Lexika slovenských terénnych názvov
  • Etymologická databáza slovenskej lexiky
  • “Retrográdny slovník súčasnej slovenčiny” – Web Portal
  • Slovak WordNet
  • Frequencies and ARF (Araneum Slovacum VII Maximum) dataset

Corpora

  • Slovak National Corpus
  • ARANEA corpora
  • Slovak Legislative Corpus
  • Corpus of Court Decisions
  • Error Corpus of Slovak “CHIBY”
  • Corpus of the journal “Slovenská reč”
  • Corpus of Rusyn Wikipedia
  • HPLT web corpus
  • Synthetic corpus of Slovak generated by a LLM
  • Synthetic parallel Slovak-Czech-English corpus generated by a LLM
  • Slovak web corpus ARANEUM + HPLT + FineWeb2

Tools

  • mistral-sk-7b, generative Slovak LLM
  • Lemmatization, Morphological Analysis and Disambiguation
  • Lemmatization, Morphological Analysis and Disambiguation (Slovak written without diacritics)
  • Word embeddings
  • Paraphrase Slovak (and Czech)
  • Rekonstruction of Diacritics
  • Vitvorťe si Štúrovskuo meno
  • Named Entity Recognition, Demo
  • Machine Translation of Slovak into the L. Štúr version
  • Timeline of word occurrences in the corpus
  • Visualization of Collocations
  • Transliteration of Slovak or Czech into Glagolitics
Ľ. Štúr Institute of Linguistics, Slovak Academy of Sciences, Panská 26, 811 01 Bratislava, Slovakia, phone: +421 2 5443 1761, f X