Paper
31 December 2020 Big data integration: an evolutionary perspective
Author Affiliations +
Proceedings Volume 11718, Advanced Topics in Optoelectronics, Microelectronics and Nanotechnologies X; 117180L (2020) https://doi.org/10.1117/12.2570740
Event: Advanced Topics in Optoelectronics, Microelectronics and Nanotechnologies 2020, 2020, Online Only
Abstract
This study aims at providing an efficient method to solve the statistical file merging issue: merging two or more files containing distinct datasets, from different sources, where the items of the datasets do not overlap (the percentage of units that are common to all datasets is insignificant). The problem is modeled as a network transportation problem and is solved using an adaptive Genetic Algorithm based on fuzzy logic controller, which dynamically calibrates algorithm parameters. This evolutionary technique is convenient for large-scale optimization problems with a significant number of variables and logical constraints. It also provides researchers in the fields of statistics and optimization a valuable instrument for achieving a correct balance between the quality of solutions and the processing time. This is a critical demand in most of the practical optimization problems. Numerical experiments on selected test instances validate this generalized merging approach and proves that the results of this file merging simulation can be extrapolated for better integration of various datasets generated by numerous government, administrative and statistical agencies. Apart from better central financial reporting for example, this will help to expand cooperation between private sector and government institutions.
© (2020) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Simona Dinu "Big data integration: an evolutionary perspective", Proc. SPIE 11718, Advanced Topics in Optoelectronics, Microelectronics and Nanotechnologies X, 117180L (31 December 2020); https://doi.org/10.1117/12.2570740
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Fuzzy logic

Data integration

Genetic algorithms

Data fusion

Distance measurement

Data modeling

Statistical analysis

RELATED CONTENT


Back to Top