Treffer: Identification of Trending Topics Using Periodically Collected Twitter Data

Title:
Identification of Trending Topics Using Periodically Collected Twitter Data
Source:
International Journal of Engineering and Technology; Vol. 7 No. 3.12 (2018): Special Issue 12; 205-208 ; 2227-524X ; 10.14419/ijet.v7i3.12
Publisher Information:
Science Publishing Corporation
Publication Year:
2018
Collection:
Science Publishing Corporation: E-Journals
Document Type:
Fachzeitschrift article in journal/newspaper
File Description:
application/pdf
Language:
English
DOI:
10.14419/ijet.v7i3.12.16025
Rights:
Copyright (c) 2018 International Journal of Engineering & Technology
Accession Number:
edsbas.804F710
Database:
BASE

Weitere Informationen

Social media is an interactive personal tool to articulate an individual's cognizance. This project involves one such micro blogging platform, Twitter. Trends can simply be defined as the frequently mentioned topics throughout the stream of user activities. Mining twitter data for identifying trending topics provides an overview of the topics and issues that are currently popular within the online community. Therefore, the most effective and suitable methodology should be implemented to identify the short term high intensity discussion topic. The trigrams or higher order n-grams are used to determine the trending topic. Twitter Streaming API is used to collect data from the Twitter accounts using API keys and the formatted tweets are stored in a non SQL database. Subsequent steps include data cleansing followed by stemming. The processed data is subjected to trend prediction algorithms like DB Scan, Frequent Pattern Mining, Trees(fuzzy/inductive/decision), Soft frequent pattern mining and empirical statistics such as Frequency metric, TF-IDF, Normalized term frequency and Entropy based on the key parameters to identify the most trending event within a period of time. Thus, the trending topics can be detected with a reasonably close approximation to the expected outcome. This can be used in detecting and predicting events for an early warning system (or) prediction tools and also artificially intelligent services like web search system or recognition systems. Â