PolMine Project
Data and Code for Corpus Analysis
GermaParl2 Constitution Day Release 2023
GermaParl2 Constitution Day Release 2023 We are pleased to announce the public release of GermaParl v2.0.0 corpus today, on Germany’s Constitution Day (May 23, 2023). With GermaParl2, all parliamentary debates of the German Bundestag from 1949 to 2021 become available in a comprehensively annotated format. The public release follows a...
GermaParl v2.0.0-beta.3 Release Note
A new GermaParl v2 beta version to improve usability On May 23 last year (Germany’s Constitution Day), we released the first beta version of GermaParl v2, a major rework of the GermaParl Corpus of Plenary Protocols of the German Bundestag. The most obvious development is that v2 comprises all protocols...
The case for biglda: Topic modelling benchmarks
The case for biglda: Topic modelling benchmarks As I picked up developing the biglda R package again (a GitHub-only package, see dev branch here), I started to wonder: Is this is really worth the effort? Pleasure and pain are mixed, trying to interface to Java via rJava for using Mallet...
Rcppcwb V0.4.4 Released
A new RcppCWB version “Jaberwocky” (v0.4.4) just made it to CRAN. Initially, this release was meant to be a minor maintenance release to address a warning on paths in an example of the cwbtools package. The exercise went beyond that, a broader set of issues and bug reports have been...
cwbtools v0.3.3 'Hemicycle' brings Europarl closer.
cwbtools v0.3.3 ‘Hemicycle’ brings Europarl closer. As an immediate follow-up to cwbtools v0.3.2 “Il Postino”, a new cwbtools version (v0.3.3, “Hemicycle”) just made it to CRAN. This has become necessary to fix errors with the Solaris and Fedora test environments of CRAN that occurred because v0.3.2 expanded test coverage. Changes...