Published December 30, 2019 | Version v1
Journal article Open

Data Optimization using Apache Flink

  • 1. Assistant Professor, CSE Department, VTU PG Centre, Mysuru, Karnataka, India
  • 2. Professor, CSE Department, VTU PG Centre, Mysuru, Karnataka, India

Description

Map Reduce, Flink, and Spark, also become more popular in the processing of big data lately. Flink will be an open platform Big Data processing system for Apache-powered batch storage and streaming of data. Flink's query optimizer is constructed for historical information processing (batch) based on parallel storage systems approaches. Flink query query optimizer interprets the questions into jobs of different tasks that are regularly sent. Therefore, taking advantage of task similarities should prevent redundant computation. In this article, the multi-demand optimization model for Flink, Flink was planned and designed on Flink Software Stack's top priority. It's thought-about as an associate in Apache Flink's nursing add-on to maximize multi-demand information sharing. The Flink system takes advantage of option operators ' information sharing resources to reduce overlap and duplication of multi-query in-network information movement. Research findings show that the leveraging of shared option operations in vast information on multiple requests would offer promising time to perform queries. Therefore, in the stream phase, Without doubt the Flink approach can be used to boost application performance over time periods.

Files

B3081129219.pdf

Files (799.1 kB)

Name Size Download all
md5:226a8d699f37dac464da51cb68beae31
799.1 kB Preview Download

Additional details

Subjects

ISSN
2249-8958
Retrieval Number
B3081129219/2019©BEIESP