A survey of multilingual human-tagged short message datasets for sentiment analysis tasks