Treffer: FastMosaic in Action: A New Mosaic Operator for Array DBMSs

Title:
FastMosaic in Action: A New Mosaic Operator for Array DBMSs
Source:
Proceedings of the VLDB Endowment ; volume 16, issue 12, page 3938-3941 ; ISSN 2150-8097
Publisher Information:
Association for Computing Machinery (ACM)
Publication Year:
2023
Document Type:
Fachzeitschrift article in journal/newspaper
Language:
English
DOI:
10.14778/3611540.3611590
Accession Number:
edsbas.85BB93C3
Database:
BASE

Weitere Informationen

Array DBMSs operate on N -d arrays. During the Data Ingestion phase, the widely used mosaic operator ingests a massive collection of overlapping arrays into a single large array, called mosaic. The operator can utilize sophisticated statistical and machine learning techniques, e.g. Canonical Correlation Analysis (CCA), to produce a high quality seamless mosaic where the contrasts between the values of cells taken from input overlapping arrays are minimized. However, the performance bottleneck becomes a major challenge when applying such advanced techniques over increasingly growing array volumes. We introduce a new, scalable way to perform CCA that is orders of magnitude faster than the popular Python's scikit-learn library for the purpose of array mosaicking. Furthermore, we developed a hybrid web-desktop application to showcase our novel FastMosaic operator, based on this new CCA. A rich GUI enables users to comprehensively investigate in/out arrays, interactively guides through an end-to-end mosaic construction on real-world geospatial arrays using FastMosaic, facilitating a convenient exploration of the FastMosaic pipeline and its internals.