Real-time uber data analysis of popular uber locations in kubernetes environment

Gunawardena, TM; Jayasena, KPN

UoM IR
→
Research Publications
→
Conference Proceedings
→
UoM Conferences
→
Information Technology Research Unit (ITRU & ICITR)
→
ICITR - 2020
→
View Item

dc.contributor.author	Gunawardena, TM
dc.contributor.author	Jayasena, KPN
dc.contributor.editor	Karunananda, AS
dc.contributor.editor	Talagala, PD
dc.date.accessioned	2022-11-10T09:58:19Z
dc.date.available	2022-11-10T09:58:19Z
dc.date.issued	2020-12
dc.identifier.citation	T. M. Gunawardena and K. P. N. Jayasena, "Real-Time Uber Data Analysis of Popular Uber Locations in Kubernetes Environment," 2020 5th International Conference on Information Technology Research (ICITR), 2020, pp. 1-6, doi: 10.1109/ICITR51448.2020.9310851.	en_US
dc.identifier.uri	http://dl.lib.uom.lk/handle/123/19481
dc.description.abstract	Data is crucial in today's business and technology environment. There is a growing demand for Big Data applications to extract and evaluate information, which will provide the necessary knowledge that will help us make important rational decisions. These ideas emerged at the beginning of the 21st century, and every technological giant is now exploiting Big Data technologies. Big Data refers to huge and broad data collections that can be organized or unstructured. Big Data analytics is the method of analyzing massive data sets to highlight trends and patterns. Uber is using real-time Big Data to perfect its processes, from calculating Uber's pricing to finding the optimal positioning of taxis to maximize profits. Real-time data analysis is very challenging for the implementation because we need to process data in real-time, if we use Big Data, it is more complex than before. Implementation of real-time data analysis by Uber to identify their popular pickups would be advantageous in various ways. It will require high-performance platform to run their application. So far no research has been done on real-time analysis for identifying popular Uber locations within Big Data in a distributed environment, particularly on the Kubernetes environment. To address these issues, we have created a machine learning model with a Spark framework to identify the popular Uber locations and use this model to analyze real-time streaming Uber data and deploy this system on Google Dataproc with the different number of worker nodes with enabling Kubernetes and without Kubernetes environment. With the proposed Kubernetes environment and by increasing the worker nodes of Dataproc clusters, the performance can be significantly improved. The future development will consist of visualizing the real-time popular Uber locations on Google map.	en_US
dc.language.iso	en	en_US
dc.publisher	Faculty of Information Technology, University of Moratuwa.	en_US
dc.relation.uri	https://ieeexplore.ieee.org/document/9310851/	en_US
dc.subject	Uber	en_US
dc.subject	Kubernetes	en_US
dc.subject	Machine learning	en_US
dc.subject	Spark	en_US
dc.subject	real-time	en_US
dc.subject	Big Data	en_US
dc.subject	Distributed environment	en_US
dc.title	Real-time uber data analysis of popular uber locations in kubernetes environment	en_US
dc.type	Conference-Full-text	en_US
dc.identifier.faculty	IT	en_US
dc.identifier.department	Information Technology Research Unit, Faculty of Information Technology, University of Moratuwa.	en_US
dc.identifier.year	2020	en_US
dc.identifier.conference	5th International Conference in Information Technology Research 2020	en_US
dc.identifier.place	Moratuwa, Sri Lanka	en_US
dc.identifier.proceeding	Proceedings of the 5th International Conference in Information Technology Research 2020	en_US
dc.identifier.doi	doi: 10.1109/ICITR51448.2020.9310851	en_US