research-article

Using Supervised Classification to Detect Political Tweets with Political Content

Authors:
Brunna de Sousa Pereira Amorim

Federal University of Campina Grande, Campina Grande, Paraíba, Brazil

Federal University of Campina Grande, Campina Grande, Paraíba, Brazil
View Profile

,
André Luiz Firmino Alves

Federal Institute of Education, Science and Technology of Ceará, Iguatu, Ceará, Brazil

Federal Institute of Education, Science and Technology of Ceará, Iguatu, Ceará, Brazil
View Profile

,
Maxwell Guimarães de Oliveira

Federal University of Cariri, Juazeiro do Noirte, Ceará, Brazil

Federal University of Cariri, Juazeiro do Noirte, Ceará, Brazil
View Profile

,
Cláudio de Souza Baptista

Federal University of Campina Grande, Campina Grande, Paraíba, Brazil

Federal University of Campina Grande, Campina Grande, Paraíba, Brazil
View Profile

WebMedia '18: Proceedings of the 24th Brazilian Symposium on Multimedia and the WebOctober 2018Pages 245–252https://doi.org/10.1145/3243082.3243113

Published:16 October 2018Publication History

WebMedia '18: Proceedings of the 24th Brazilian Symposium on Multimedia and the Web

Pages 245–252

ABSTRACT

Social media platforms have been increasingly used by modern society. In most platforms, users usually share content on various subjects and, in particular, politics is a favorite one. There are many interests in detecting and analyzing such a political content. However, there is a challenge in the process of detecting specific subjects from social media data mainly due to its informality. In this paper, we propose and compare two techniques, based on supervised classification, for the detection of tweets with political content. The results obtained by our approach have demonstrated satisfactory performance, which motivates further research to be undertaken.

References

Abebe Abeshu and Naveen Chilamkurti. 2018. Deep learning: the frontier for distributed attack detection in Fog-to-Things computing. IEEE Communications Magazine 56, 2 (2018), 169--175. Google ScholarDigital Library
Charu C. Aggarwal and ChengXiang Zhai. 2012. A Survey of Text Clustering Algorithms. Springer US, Boston, MA, 77--128.Google Scholar
Liliya Akhtyamova, John Cardiff, and Andrey Ignatov. 2017. Twitter Author Profiling Using Word Embeddings and Logistic Regression. Proceedings of Conference and Labs of the Evaluation Forum - CLEF 2017.Google Scholar
Ika Alfina, Dinda Sigmawaty, Fitriasari Nurhidayati, and Achmad Nizar Hidayanto. 2017. Utilizing Hashtags for Sentiment Analysis of Tweets in The Political Domain. In Proceedings of the 9th International Conference on Machine Learning and Computing. ACM, 43--47. Google ScholarDigital Library
Andre Luiz Firmino Alves, Claudio De Souza Baptista, Anderson Almeida Firmino, Maxwell Guimaraes De Oliveira, and Anselmo Cardoso De Paiva. 2014. A Comparison of SVM Versus Naive-Bayes Techniques for Sentiment Analysis in Tweets. In Proceedings of the 20th Brazilian Symposium on Multimedia and the Web - WebMedia 14.Google ScholarDigital Library
Rafael T. Anchieta and Raimundo S. Moura. 2017. Exploring Unsupervised Learning Towards Extractive Summarization of User Reviews. In Proceedings of the 23rd Brazillian Symposium on Multimedia and the Web - WebMedia 17. Google ScholarDigital Library
Matheus Araujo, Julio Reis, Adriano Pereira, and Fabricio Benevenuto. 2016. An evaluation of machine translation for multilingual sentence-level sentiment analysis. In Proceedings of the 31st Annual ACM Symposium on Applied Computing - SAC 16. Google ScholarDigital Library
Farzindar Atefeh and Wael Khreich. 2015. A survey of techniques for event detection in twitter. Computational Intelligence 31, 1 (2015), 132--164. Google ScholarDigital Library
Ricardo Baeza-Yates, Berthier Ribeiro-Neto, et al. 1999. Modern information retrieval. Vol. 463. ACM press New York. Google ScholarDigital Library
Piotr Bojanowski, Edouard Grave, Armand Joulin, and Tomas Mikolov. 2016. Enriching word vectors with subword information. arXiv preprint arXiv:1607.04606 (2016).Google Scholar
Alberto Carreras Mesa, Mari Carmen Aguayo-Torres, Francisco J. Martin-Vega, Gerardo Gómez, Francisco Blanquez-Casado, Isabel M. Delgado-Luque, and Jose Entrambasaguas. 2018. Link abstraction models for multicarrier systems: A logistic regression approach. International Journal of Communication Systems 31, 1 (2018).Google Scholar
Moon-tong Chan, Dalei Yu, and Kelvin K. W. Yau. 2015. Multilevel cumulative logistic regression model with random effects: Application to British social attitudes panel survey data. Computational Statistics & Data Analysis 88 (2015), 173--186. Google ScholarDigital Library
Eric Fernandes de Mello Araújo and Dave Ebbelaar. 2018. Detecting Dutch political tweets: A classifier based on voting system using supervised learning. In 10th International Conference on Agents and Artificial Intelligence, ICAART 2018. SciTePress.Google ScholarCross Ref
Maite Giménez, Tomás Baviera, Germán Llorca, José Gámir, Dafne Calvo, Paolo Rosso, and Francisco Rangel. 2017. Overview of the 1st classification of spanish election tweets task at ibereval 2017. In Notebook Papers of 2nd SEPLN Workshop on Evaluation of Human Language Technologies for Iberian Languages (IBEREVAL), Murcia, Spain, September, Vol. 19.Google Scholar
Frank E. Harrell. 2001. Regression modeling strategies, with applications to linear models, survival analysis and logistic regression. In Springer Series in Statistics. Springer. Google ScholarDigital Library
Abdalraouf Hassan and Ausif Mahmood. 2018. Convolutional Recurrent Deep Learning Model for Sentence Classification. IEEE Access 6 (2018), 13949--13957.Google ScholarCross Ref
Marti Hearst. 2003. What is text mining. SIMS, UC Berkeley (2003).Google Scholar
Armand Joulin, Edouard Grave, Piotr Bojanowski, and Tomas Mikolov. 2016. Bag of tricks for efficient text classification. arXiv preprint arXiv:1607.01759 (2016).Google Scholar
Dan Jurafsky and James H. Martin. 2009. Speech and language processing: An introduction to natural language processing, computational linguistics, and speech recognition. In Prentice Hall series in artificial intelligence. Prentice Hall, Pearson Education International, 1--1024. Google ScholarDigital Library
Ankush Khandelwal, Sahil Swami, Syed Sarfaraz Akhtar, and Manish Shrivastava. 2017. Classification Of Spanish Election Tweets (COSET) 2017: Classifying Tweets Using Character and Word Level Features. In IberEval@ SEPLN. 49--54.Google Scholar
Yuancheng Li, Rong Ma, and Runhai Jiao. 2015. A hybrid malicious code detection method based on Deep Learning. International Journal of Software Engineering and its Applications 9, 5 (2015), 205--216.Google Scholar
George Loukas, Tuan Vuong, Ryan Heartfield, Georgia Sakellari, Yongpil Yoon, and Diane Gan. 2018. Cloud-based cyber-physical intrusion detection for vehicles using Deep Learning. IEEE Access 6 (2018), 3491--3508.Google ScholarCross Ref
Kevin P. Murphy. 2012. Machine Learning: A Probabilistic Perspective. Adaptive Computation and Machine Learning. In Adaptive Computation and Machine Learning series. MIT press. Google ScholarDigital Library
Arman Khadjeh Nassirtoussi, Saeed Aghabozorgi, Teh Ying Wah, and David Chek Ling Ngo. 2015. Text mining of news-headlines for FOREX market prediction: A Multi-layer Dimension Reduction Algorithm with semantics and sentiment. Expert Systems with Applications 42, 1 (2015), 306--324. Google ScholarDigital Library
Nnamdi I. Nwulu. 2017. Evaluation of machine learning classification algorithms & missing data imputation techniques. In International Artificial Intelligence and Data Processing Symposium (IDAP). IEEE, 1--5.Google ScholarCross Ref
Erik Tjong Kim Sang, Herbert Kruitbosch, Marcel Broersma, and Marc Esteve Del Valle. 2017. Determining the function of political tweets. In IEEE 13th International Conference on e-Science. IEEE, 438--439.Google ScholarCross Ref
Sandro Skansi. 2018. Introduction to Deep Learning: From Logical Calculus to Artificial Intelligence. Springer. Google ScholarDigital Library
Karen Sparck Jones. 1972. A statistical interpretation of term specificity and its application in retrieval. Journal of documentation 28, 1 (1972), 11--21.Google ScholarCross Ref
Hastie Trevor, Robert Tibshirani, and Jerome H. Friedman. 2009. The elements of statistical learning: Data Mining, Inference, and Prediction. New York, NY: Springer.Google Scholar
David Watts, K. M. George, T. K. Ashwin Kumar, and Zenia Arora. 2016. Tweet sentiment as proxy for political campaign momentum. In IEEE International Conference on Big Data. IEEE, 2475--2484.Google ScholarCross Ref
Xiang Zhu, Yuanping Nie, Songchang Jin, Aiping Li, and Yan Jia. 2015. Spammer detection on online social networks based on logistic regression. In International Conference on Web-Age Information Management. Springer, 29--40.Google ScholarCross Ref

Index Terms

Using Supervised Classification to Detect Political Tweets with Political Content
1. Computing methodologies
  1. Machine learning
    1. Machine learning approaches
2. Information systems
  1. Information retrieval
    1. Specialized information retrieval
      1. Environment-specific retrieval
        Web and social media search

Recommendations

Predicting political affiliation of posts on Facebook
IMCOM '17: Proceedings of the 11th International Conference on Ubiquitous Information Management and Communication

Recently, social media such as Facebook has been more popular. Receiving information from Facebook and generating or spreading information on Facebook every day has become a general lifestyle. This new information-exchanging platform contains a lot of ...
Read More
Quantifying Political Leaning from Tweets, Retweets, and Retweeters
The widespread use of online social networks (OSNs) to disseminate information and exchange opinions, by the general public, news media, and political actors alike, has enabled new avenues of research in computational political science. In this paper, we ...
Read More
A comparison between semi-supervised and supervised text mining techniques on detecting irony in greek political tweets

The present work describes a classification schema for irony detection in Greek political tweets. Our hypothesis states that humorous political tweets could predict actual election results. The irony detection concept is based on subjective perceptions, ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
WebMedia '18: Proceedings of the 24th Brazilian Symposium on Multimedia and the Web
October 2018
437 pages
ISBN:9781450358675
DOI:10.1145/3243082
General Chairs:
Manoel Carvalho Marques Neto
IFBA
,
Renato Lima Novais
IFBA
,
Carlos Ferraz
UFPE
,
Windson Viana
UFC
Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 16 October 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Deep learning
Logistic regression
Machine learning
Social networks analysis
Text mining
Qualifiers
- research-article
- Research
- Refereed limited
Conference

Acceptance Rates
WebMedia '18 Paper Acceptance Rate37of111submissions,33%Overall Acceptance Rate270of873submissions,31%
More
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 2
  Total Citations
  View Citations
- 234
  Total Downloads
- Downloads (Last 12 months)27
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Using Supervised Classification to Detect Political Tweets with Political Content

WebMedia '18: Proceedings of the 24th Brazilian Symposium on Multimedia and the Web

ABSTRACT

References

Cited By

Index Terms

Recommendations

Predicting political affiliation of posts on Facebook

Quantifying Political Leaning from Tweets, Retweets, and Retweeters

A comparison between semi-supervised and supervised text mining techniques on detecting irony in greek political tweets

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Using Supervised Classification to Detect Political Tweets with Political Content

WebMedia '18: Proceedings of the 24th Brazilian Symposium on Multimedia and the Web

ABSTRACT

References

Cited By

Index Terms

Recommendations

Predicting political affiliation of posts on Facebook

Quantifying Political Leaning from Tweets, Retweets, and Retweeters

A comparison between semi-supervised and supervised text mining techniques on detecting irony in greek political tweets

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media