Twitter data analysis using hadoop ecosystems and apache zeppelin

The day-to-day life of the people doesn't depend only on what they think, but it is affected and influenced by what others think. The advertisements and campaigns of the favourite celebrities and mesmerizing personalities influence the way people think and see the world. People get the news and...

Full description

Saved in:
Bibliographic Details
Main Authors: Wilson, Stanly (Author), R, Sivakumar (Author)
Format: EJournal Article
Published: Institute of Advanced Engineering and Science, 2019-12-01.
Subjects:
Online Access:Get fulltext
Tags: Add Tag
No Tags, Be the first to tag this record!
LEADER 02783 am a22003013u 4500
001 ijeecs18522_13185
042 |a dc 
100 1 0 |a Wilson, Stanly  |e author 
100 1 0 |e contributor 
700 1 0 |a R, Sivakumar  |e author 
245 0 0 |a Twitter data analysis using hadoop ecosystems and apache zeppelin 
260 |b Institute of Advanced Engineering and Science,   |c 2019-12-01. 
500 |a https://ijeecs.iaescore.com/index.php/IJEECS/article/view/18522 
520 |a The day-to-day life of the people doesn't depend only on what they think, but it is affected and influenced by what others think. The advertisements and campaigns of the favourite celebrities and mesmerizing personalities influence the way people think and see the world. People get the news and information at lightning speed than ever before. The growth of textual data on the internet is very fast. People express themselves in various ways on the web every minute. They make use of various platforms to share their views and opinions. A huge amount of data is being generated at every moment on this process. Being one of the most important and well-known social media of the present time, millions of tweets are posted on Twitter every day. These tweets are a source of very important information and it can be made use for business, small industries, creating government policies, and various studies can be performed by using it. This paper focuses on the location from where the tweets are posted and the language in which the tweets are written. These details can be effectively analysed by using Hadoop. Hadoop is a tool that is used to analyze distributed big data, streaming data, timestamp data and text data. With the help of Apache Flume, the tweets can be collected from Twitter and then sink in the HDFS (Hadoop Distributed File System). These raw data then analyzed using Apache Pig and the information available can be made use for social and commercial purposes. The result will be visualized using Apache Zeppelin. 
540 |a Copyright (c) 2019 Institute of Advanced Engineering and Science 
540 |a http://creativecommons.org/licenses/by-nc/4.0 
546 |a eng 
690 |a Twitter Data Analysis 
690 |a Flume, JSON, Hadoop, HDFS, Pig, Twitter, Tweets, Zeppelin 
655 7 |a info:eu-repo/semantics/article  |2 local 
655 7 |a info:eu-repo/semantics/publishedVersion  |2 local 
655 7 |2 local 
786 0 |n Indonesian Journal of Electrical Engineering and Computer Science; Vol 16, No 3: December 2019; 1490-1498 
786 0 |n 2502-4760 
786 0 |n 2502-4752 
786 0 |n 10.11591/ijeecs.v16.i3 
787 0 |n https://ijeecs.iaescore.com/index.php/IJEECS/article/view/18522/13185 
856 4 1 |u https://ijeecs.iaescore.com/index.php/IJEECS/article/view/18522/13185  |z Get fulltext