Authorship Authentication of Short Messages from Social Networks Machines

Nesibe Merve Demir; Mehmet Can

doi:10.21533/scjournal.v7i1.148

Authorship Authentication of Short Messages from Social Networks Machines

Nesibe Merve Demir, Mehmet Can

Abstract

Dataset consists of 17000 tweets collected from Twitter, as 500 tweets for each of 34 authors that meet certain criteria. Raw data is collected by using the software Nvivo. The collected raw data is preprocessed to extract frequencies of 200 features. In the data analysis 128 of features are eliminated since they are rare in tweets. As a progressive presentation, five – fifteen – twenty – twenty five – thirty and thirty four of these authors are selected each time. Since recurrent artificial neural networks are more stable and in general ANNs are more successful distinguishing two classes, for N authors, N×N neural networks are trained for pair wise classification. These experts then organized in N competing teams (CANNT) to aggregate decisions of these NXN experts. Then this procedure is repeated seven times and committees with seven members voted for final decision. By a commonest type voting, the accuracy is boosted around ten percent. Number of authors is seen not so effective on the accuracy of the authentication, and around 80% accuracy is achieved for any number of authors.

Keywords

Authorship Authentication; short massages; committee machines; recurrent neural network

Full Text:

PDF

DOI: http://dx.doi.org/10.21533/scjournal.v7i1.148

Refbacks

There are currently no refbacks.

Digital Object Identifier DOI: 10.21533/scjournal

This work is licensed under a Creative Commons Attribution 4.0 International License

Username
Password
Remember me

Southeast Europe Journal of Soft Computing

Authorship Authentication of Short Messages from Social Networks Machines

Abstract

Keywords

Full Text:

Refbacks