Update for GermaParl2 – Improving Corpus Quality and a Look Ahead

    Update for GermaParl2 – Improving Corpus Quality and a Look Ahead We always envisioned GermaParl as an evolving resource. Since some issues only become apparent during productive work, the continuous provision of releases aimed at improving the corpus was always part of our roadmap. Accordingly, over the last months, we...


    GermaParl2 Constitution Day Release 2023

    GermaParl2 Constitution Day Release 2023 We are pleased to announce the public release of GermaParl v2.0.0 corpus today, on Germany’s Constitution Day (May 23, 2023). With GermaParl2, all parliamentary debates of the German Bundestag from 1949 to 2021 become available in a comprehensively annotated format. The public release follows a...


    GermaParl v2.0.0-beta.3 Release Note

    A new GermaParl v2 beta version to improve usability On May 23 last year (Germany’s Constitution Day), we released the first beta version of GermaParl v2, a major rework of the GermaParl Corpus of Plenary Protocols of the German Bundestag. The most obvious development is that v2 comprises all protocols...


    The case for biglda: Topic modelling benchmarks

    The case for biglda: Topic modelling benchmarks As I picked up developing the biglda R package again (a GitHub-only package, see dev branch here), I started to wonder: Is this is really worth the effort? Pleasure and pain are mixed, trying to interface to Java via rJava for using Mallet...


    Rcppcwb V0.4.4 Released

    A new RcppCWB version “Jaberwocky” (v0.4.4) just made it to CRAN. Initially, this release was meant to be a minor maintenance release to address a warning on paths in an example of the cwbtools package. The exercise went beyond that, a broader set of issues and bug reports have been...