Evaluation of Data Lake and Apache Spark Technologies for Urban Infrastructure Planning and Management

G.  Bektemisova; S.  Kalnazar

doi:10.51301/ce.2024.i1.05

Evaluation of Data Lake and Apache Spark Technologies for Urban Infrastructure Planning and Management

Authors

G. Bektemisova International Information Technology University, Kazakhstan
S. Kalnazar International Information Technology University, Kazakhstan

DOI:

https://doi.org/10.51301/ce.2024.i1.05

Keywords:

urban infrastructure, Data Lake, Apache Spark, big data analytics, scalability, automation, data quality, smart city planning

Abstract

This study evaluates the use of Data Lake technology and Apache Spark in the context of urban infrastructure management. By analyzing their capabilities for handling structured, semi-structured, and unstructured datasets, the research highlights their potential to optimize data processing workflows. The system was deployed on Yandex Cloud, leveraging distributed computing and horizontal scalability to achieve efficient data storage, real-time analytics, and fault tolerance. Automation pipelines and quality assurance mechanisms were implemented to streamline data ingestion, transformation, and validation processes. The findings demonstrate significant improvements in data processing efficiency, scalability, and resource optimization, offering a robust framework for enhancing smart city infrastructure planning and evaluation.

Downloads

Published

2024-03-31

How to Cite

Bektemisova, G. ., & Kalnazar, S. . (2024). Evaluation of Data Lake and Apache Spark Technologies for Urban Infrastructure Planning and Management. Computing &Amp; Engineering, 2(1), 25–31. https://doi.org/10.51301/ce.2024.i1.05

Download Citation

Issue

Vol. 2 No. 1 (2024): Computing & Engineering

Section

Automation, Robotics, and Intelligent Systems

License

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

<div class="pkpfooter-son">
<a rel="license" href="http://creativecommons.org/licenses/by-nc/4.0/"><img alt="Creative Commons License" style="border-width:0" src="https://i.creativecommons.org/l/by-nc/4.0/80x15.png"></a><br>This work is licensed under a <a rel="license" href="http://creativecommons.org/licenses/by-nc/4.0/">Creative Commons Attribution-NonCommercial 4.0 International License</a>.
</div>

Computing & Engineering

Evaluation of Data Lake and Apache Spark Technologies for Urban Infrastructure Planning and Management

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Current Issue

Language

Information

Make a Submission

Supported by