The case for biglda: Topic modelling benchmarks

    The case for biglda: Topic modelling benchmarks As I picked up developing the biglda R package again (a GitHub-only package, see dev branch here), I started to wonder: Is this is really worth the effort? Pleasure and pain are mixed, trying to interface to Java via rJava for using Mallet...


    Rcppcwb V0.4.4 Released

    A new RcppCWB version “Jaberwocky” (v0.4.4) just made it to CRAN. Initially, this release was meant to be a minor maintenance release to address a warning on paths in an example of the cwbtools package. The exercise went beyond that, a broader set of issues and bug reports have been...


    cwbtools v0.3.3 'Hemicycle' brings Europarl closer.

    cwbtools v0.3.3 ‘Hemicycle’ brings Europarl closer. As an immediate follow-up to cwbtools v0.3.2 “Il Postino”, a new cwbtools version (v0.3.3, “Hemicycle”) just made it to CRAN. This has become necessary to fix errors with the Solaris and Fedora test environments of CRAN that occurred because v0.3.2 expanded test coverage. Changes...


    cwbtools v0.3.2 'Il Postino' solidifies corpus download.

    RcppCWB v0.3.2 ‘Il Postino’ solidifies corpus download. A new, previously unknown bug users reported when trying to use the GermaParl package to download and install the GermaParl corpus (function germaparl_install_corpus()) generated some urgency to release of a new version of the cwbtools R package: We suddenly saw errors (false positives!)...


    RcppCWB v0.3.2 'Dune Ride' ensures Apple Silicon compatibility.

    RcppCWB v0.3.2 ‘Dune Ride’ ensures Apple Silicon compatibility. Apple’s new M1 chip has deservedly raised significant attention. The new Mac minis and 13 inch Macbook Pros running on “Apple Silicon” are fast, energy-saving and affordable at the same time. No doubt an increasing number of polmineR users will work on...