Research Article

International Network Performance and Security Testing Based on Distributed Abyss Storage Cluster and Draft of Data Lake Framework

Table 1

Key differences between data warehouse and Data Lake.

ItemsData warehouseData Lake

DataStructured, processedStructured/semistructured/unstructured, raw
ProcessingSchema-on-writeSchema-on-read
storageExpensive for large data volumesDesigned for low-cost storage
AgilityLess agile, fixed configurationHighly agile, configure and reconfigure as needed
SecurityMatureMaturing
usersBusiness professionalsData scientists, etc.