hatefile contains only the sentences that are considered ashate speech.nohatefile contains only the sentences that are not considered ashate speech.- Both
hateandnohatesentences can be found in thehate_speechfile.
Note that in each file, one line is equivalent to one sentence.
The initiale dataset was downloaded from the Aitor García Pablos GitHube website.
"These files contain text extracted from Stormfront, a white supremacist forum. A random set of forums posts have been sampled from several subforums and split into sentences. Those sentences have been manually labelled as containing hate speech or not, according to certain annotation guidelines". Aitor García Pablos