Skip to content
Change the repository type filter

All

    Repositories list

    • texton

      Public
      Text Tonsorium - a toolbox that automatically arranges NLP tools in workflows and enacts them with user's inputs
      PHP
      0420Updated Nov 15, 2024Nov 15, 2024
    • Binary executable files used by services in the Text Tonsorium.
      0000Updated Nov 14, 2024Nov 14, 2024
    • Web-based workflow management system that computes candidate tool workflows given input file(s) and the user's requirements regarding the output. Afterwards, runs a workflow selected by the user from the list of candidates. Implemented in Bracmat (~75%) and Java (~25%).
      Java
      GNU General Public License v3.0
      2200Updated Nov 8, 2024Nov 8, 2024
    • Docker setups for all Korp installations maintained by NorS.
      JavaScript
      MIT License
      0131Updated Oct 21, 2024Oct 21, 2024
    • Navigate and manipulate Hiccup documents.
      Clojure
      0110Updated Oct 15, 2024Oct 15, 2024
    • Convert XML into Hiccup in Clojure and ClojureScript.
      Clojure
      MIT License
      12100Updated Sep 11, 2024Sep 11, 2024
    • The Switchboard Tool Registry
      Python
      GNU General Public License v3.0
      14000Updated Sep 9, 2024Sep 9, 2024
    • The life of Louis Hjelmslev.
      Clojure
      14250Updated Aug 30, 2024Aug 30, 2024
    • DanNet

      Public
      The Danish WordNet as an RDF graph.
      Clojure
      MIT License
      020300Updated Aug 9, 2024Aug 9, 2024
    • cstlemma

      Public
      Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefix, infix, suffix, circumfix). Rules are obtained by supervised learning from a full form - lemma list.
      C++
      GNU General Public License v2.0
      73520Updated Jul 23, 2024Jul 23, 2024
    • trigram

      Public
      Extract trigrams from lemmas in tab separated full form/lemma list. Count the number of occurences of each trigram. Spaces are added before and after each lemma.
      GNU General Public License v3.0
      0000Updated Jul 22, 2024Jul 22, 2024
    • Linguistic resources for several of the tools included in the Text Tonsorium
      Roff
      0100Updated Jul 19, 2024Jul 19, 2024
    • Other
      0000Updated Jun 7, 2024Jun 7, 2024
    • Using supervised learning, create a set of affix rules for use by the CSTlemma lemmatiser.
      C++
      GNU General Public License v2.0
      0400Updated Mar 5, 2024Mar 5, 2024
    • Clojure
      1030Updated Jan 31, 2024Jan 31, 2024
    • dk5

      Public
      Fetch the DK5 dataset and store it as EDN.
      Clojure
      0000Updated Jan 4, 2024Jan 4, 2024
    • taggerXML

      Public
      Modernized version of Eric Brill's Part Of Speech tagger.
      C++
      GNU General Public License v2.0
      61710Updated Dec 15, 2023Dec 15, 2023
    • ParlaMint

      Public
      ParlaMint: Comparable Parliamentary Corpora
      GLSL
      53000Updated Sep 11, 2023Sep 11, 2023
    • rum

      Public
      Simple, decomplected, isomorphic HTML UI library for Clojure and ClojureScript
      HTML
      Eclipse Public License 1.0
      126000Updated Aug 14, 2023Aug 14, 2023
    • jerk

      Public
      Analyses the movement of two points in x-y plane, in casu nose tips data from OpenPoseDemo.exe, and computes velocity, acceleration and jerk of the points.
      C++
      0000Updated Aug 3, 2023Aug 3, 2023
    • aristotle

      Public
      RDF, SPARQL and OWL for Clojure
      Clojure
      Apache License 2.0
      17100Updated Aug 2, 2023Aug 2, 2023
    • Web service that wraps around Bernd Bohnet's graph based parser
      Java
      GNU General Public License v3.0
      0000Updated Mar 31, 2023Mar 31, 2023
    • qname

      Public archive
      A QName record and conversions between QNames, Keywords, and IRI strings.
      Clojure
      0000Updated Mar 31, 2023Mar 31, 2023
    • Webservice that wraps around the mate POS tagger
      Java
      GNU General Public License v3.0
      0000Updated Mar 30, 2023Mar 30, 2023
    • regionh

      Public
      Scrape JSON data from a list of Region Hovestaden URLs (with permission).
      Clojure
      0000Updated Mar 27, 2023Mar 27, 2023
    • Webservice that wraps around the OpenNLP POS tagger
      Java
      GNU General Public License v3.0
      0000Updated Mar 24, 2023Mar 24, 2023
    • The implementation of fcs-korp-endpoint running on Alf.
      Java
      GNU General Public License v3.0
      5000Updated Mar 20, 2023Mar 20, 2023
    • Turn a Pedestal web service into a SAML Service Provider.
      Clojure
      MIT License
      0740Updated Mar 9, 2023Mar 9, 2023
    • Functions for upper/lower casing, for testing whether a character is a letter and for conversion between Unicode encodings UTF-8 and UTF-16
      C
      GNU General Public License v2.0
      1200Updated Jan 30, 2023Jan 30, 2023
    • semdax

      Public
      Sense-annotated corpora from the Semantic Processing Across Domains project
      Other
      1200Updated Dec 8, 2022Dec 8, 2022