Chuyển tới nội dung chính

FPT Data Platform

🗃️ JupyterHub

JupyterHub is an open-source platform designed to provide a multi-user Jupyter Notebook environment, enabling data scientists, data engineers, and software developers to access computational resources for data analysis, data processing, and machine learning model development. When integrated into the Cloud Data Platform, JupyterHub becomes a core component that allows management, scaling, and optimization of resources across cloud services, thereby supporting large-scale data storage and processing workflows.

🗃️ Flink

Apache Flink is an open-source distributed data processing framework primarily designed for real-time stream processing. In addition to stream processing, it also supports batch processing, but it is especially recognized for its ability to handle continuous data streams with low latency. Flink offers flexible scalability, supports stateful processing, and ensures data consistency, making it a leading choice for Big Data Analytics, Machine Learning, IoT, financial systems, and system monitoring applications.