Bert Model for Social Media Bot Detection

dc.contributor.authorHeidari, Maryam
dc.contributor.authorJones, James H Jr.
dc.date.accessioned2022-03-11T01:23:06Z
dc.date.available2022-03-11T01:23:06Z
dc.date.issued2022-03
dc.description.abstractMillions of online posts about different topics and products are shared on popular social media platforms. One use of this content is to provide crowd-sourced information about a specific topic, event, or product. However, this use raises an important question: what percentage of the information available through these services is trustworthy? In particular, might some of this information be generated by a machine, i.e., a ``bot" instead of a human? Bots can be, and often are, purposely designed to generate enough volume to skew an apparent trend or position on a topic, yet the consumer of such content cannot easily distinguish a bot post from a human post. This paper introduces a new model that uses Bidirectional Encoder Representations from Transformers (Google Bert) for sentiment classification of tweets to identify topic-independent features for the social media bot detection model. Using a Natural Language Processing approach to derive topic-independent features for the new bot detection model distinguishes this work from previous bot detection models. We achieve 94\% accuracy classifying the contents of data set Cresci \cite{cresci-etal-2017-paradigm}as generated by a bot or a human, where the most accurate prior work achieved an accuracy of 92\%.
dc.identifier.urihttps://hdl.handle.net/1920/12756
dc.rightsAttribution-NonCommercial-ShareAlike 3.0 United States
dc.rights.urihttps://creativecommons.org/licenses/by-nc-sa/3.0/us/
dc.subjectBot detection
dc.subjectNatural Language Processing
dc.subjectNeural Network
dc.subjectSocial Media
dc.titleBert Model for Social Media Bot Detection
dc.typeWorking Paper

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
bertf.pdf
Size:
321.69 KB
Format:
Adobe Portable Document Format
Description:
The paper
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2.52 KB
Format:
Item-specific license agreed upon to submission
Description: