Treffer: Spatio-temporal Crime Analysis and Forecasting on Twitter Data Using Machine Learning Algorithms.

Title:
Spatio-temporal Crime Analysis and Forecasting on Twitter Data Using Machine Learning Algorithms.
Authors:
Vivek M; Computer Science and Engineering, CHRIST (Deemed to be University), Bangalore, India., Prathap BR; Computer Science and Engineering, CHRIST (Deemed to be University), Bangalore, India.
Source:
SN computer science [SN Comput Sci] 2023; Vol. 4 (4), pp. 383. Date of Electronic Publication: 2023 May 06.
Publication Type:
Journal Article
Language:
English
Journal Info:
Publisher: Springer Nature Country of Publication: Singapore NLM ID: 101772308 Publication Model: Print-Electronic Cited Medium: Internet ISSN: 2661-8907 (Electronic) Linking ISSN: 26618907 NLM ISO Abbreviation: SN Comput Sci Subsets: PubMed not MEDLINE
Imprint Name(s):
Original Publication: [Singapore] : Springer Nature, [2020]-
Contributed Indexing:
Keywords: Crime analysis; Crime prevention; Forecasting; Machine learning algorithms; Twitter data
Entry Date(s):
Date Created: 20230516 Latest Revision: 20230920
Update Code:
20250114
PubMed Central ID:
PMC10163854
DOI:
10.1007/s42979-023-01816-y
PMID:
37193217
Database:
MEDLINE

Weitere Informationen

The concept of social media began to gain popularity in the late 1990s and has played a significant role in connecting people across the globe. The constant addition of features to old social media platforms and the creation of new ones have helped amass and retain an extensive user base. Users could now share their views and provide detailed accounts of events from worldwide to reach like-minded people. This led to the popularization of blogging and brought into focus the posts of the commoner. These posts began to be verified and included in mainstream news articles bringing about a revolution in journalism. This research aims to use a social media platform, Twitter, to classify, visualize, and forecast Indian crime tweet data and provide a spatio-temporal view of crime in the country using statistical and machine learning models. The Tweepy Python module's search function and '#crime' query have been used to scrape relevant tweets under geographical constraints, followed by substring-keyword classification using 318 unique crime keywords. The Bokeh and gmaps Python modules create analytical and geospatial visualizations, respectively. Time series forecasting of crime tweet count is performed by comparing the accuracy of Long Short-Term Memory (LSTM), Auto-Regressive Integrated Moving Average (ARIMA), and Seasonal Auto-Regressivee Integrated Moving Average (SARIMA) models to determine the best model.
(© The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.)

Conflict of interestI know of no conflicts of interest associated with this publication, and there has been no significant financial support for this work that could have influenced its outcome. As the corresponding author, I confirm that the manuscript has been read and approved for submission by the named author.