I found it really interesting how you were able to collect all this data using the magic you call “HiveSQL” — lol.
On the language side, I’m fluent in both Spanish and English (i live in Argentina), so I’m actually thinking of running similar analytics on the Spanish-speaking community to see if the same patterns show up there. Could be cool to compare and see if the issue of low-value comments translates across languages just as clearly. What do you think ?
RE: Text analytics reveal thirty two percent of comments on hive are not unique and at least ten percent add no value to discussion