A recent URL received via FB via TW from Jose Carlos Cortizo has driven my attention into Social Network Datasets. It begins with the Social Tagging list by Markus Strohmaier at his blog Intentialicious. But the comments include several other lists of datasets, most prominently:
What in fact leads me to make the question: do opinion mining datasets apply? OK, it is not Social Tagging (at least, not thematic tags). So I share the two only Opinion Mining/Sentiment Analysis datasets in Spanish I am aware of:
- Movie review corpus in Spanish by Fermín L. Cruz Mata
- The SFU Spanish Review Corpus, by Maite Taboada & Julian Brooke
And after a quick search on Opinion Mining datasets, I have found the blog by Bruno Ohana with a very interesting post which presents a short tutorial about Opinion Mining with Rapid Miner. While in fact it is not really Opinion Mining (as it does not use any sentiment features, it just approaches the task as any classification task: bag of words, etc.), I see it very interesting because it is a great tutorial on using this suite to make text classification!