Skip to content

Source code and dataset about Hashtags, Emotions, and Comments: A Large-Scale Dataset to Understand Fine-Grained Social Emotions to Online Topics

Notifications You must be signed in to change notification settings

polyusmart/HEC-Dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 

Repository files navigation

HEC-Dataset

This is Dataset of our paper Hashtags, Emotions, and Comments: A Large-Scale Dataset to Understand Fine-Grained Social Emotions to Online Topics [pdf]

You can download it from OneDrive

Files

There are two files of our datasets:

  1. hashtag&emotion.txt It contains hashtags and their emotion votes.
  2. hashtag&comment.txt It contains user comments involved in the discussion initialized by a hashtag.

Data structure

The data structure is described in the following.

  1. hashtag&emotion.txt Each line consists of six fields: , , <Total # of Voters>, <Rank 1 Emotion>, <Rank 2 Emotion>, <Rank 3 Emotion>. Fields are devided by Tab. The top three emotions shown with the emoji (in []) and # of voters, seperated with a colon.
  2. hashtag&comment.txt Each line consists of three files: , , . Fields are divided by Tab.

Important Notes:

  1. Both hashtag&emotion.txt and hashtag&comment are in Chinese and encoded with UTF-8.
  2. The data of our dataset is sorted in alphabetical order.
  3. The dataset is released under a Creative Commons Attribution 3.0 Unported License (http://creativecommons.org/licenses/by/3.0/).

About

Source code and dataset about Hashtags, Emotions, and Comments: A Large-Scale Dataset to Understand Fine-Grained Social Emotions to Online Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published