I created this NLC Ground Truth Wonderland
which basically is full of a bunch of CSV files (Ground Truth files) that can be used to quickly train
plus an R Script for it (though you can CURL it too)
I love NLC. THe "IBM Watson™ Natural Language Classifier uses machine learning algorithms to return the top matching predefined classes for short text input. You create and train a classifier to connect predefined classes to example texts so that the service can apply those classes to new inputs."
If you want a quick a dirty classifier in less than 30m. from your own GT - recommend giving it a go.
Alice seeks ground truth. (Souce: YouTube)
About this blog
This is an informal blog that explores tools, code and tricks that group members have developed to engage IBM Watson cognitive computing services - from the R Programming Language. Packages include RCURL to access Watson APIs - for services that include Natural Language Classifier and Speech to Text. THIS IS MY PERSONAL BLOG - it does not represent the views of my employer. Code is presented as 'use at your own risk' (it has lots of bugs)
Created: September 13, 2015English