A surprisingly high number of people believe that Cloud Storage is synonymous with disk, primarily performance disk, or Tier-1 disk, and deduplication. 云存储是磁盘、高性能磁盘或Tier-1磁盘和重复数据删除的代名词。抱有这种想法的人很多,并还达到了一个惊人的数量。
Deduplication, RAID and thin-provisioning are all examples of great features on disk that allow cost savings and/ or protection. 重复数据删除、RAID和精简配置,所有这些节约成本或保护磁盘的强大功能都有可能。
When the magnitude of data get to be more than TB, a data deduplication system could reduce total storage cost efficiently. 当数据的数量级在TB以上时,数据去冗余系统将能够大幅度减少数据备份的成本。
The goal of this integration method is to obtain deep web pages as much as possible and use the extraction and deduplication techniques to get structural, high-quality data that are the data basis of analysis and mining. 本文致力于面向分析的deepweb数据集成研究,目标在于最大限度地获取deepweb页面,运用抽取与消重技术得到结构化良好、高质量的数据,为进一步的分析挖掘提供数据支持。
The explosive growth of data as well as large-scale centralized storage makes waste of space caused by the duplication of data; the problem is getting worse, which prompted the emergence and development of data deduplication technology. 数据量的爆炸式增长以及海量数据的大规模集中使得数据重复所导致的空间浪费问题越来越严重,这促使了重复数据消除技术的出现和发展。
The system can solve the single point failure of the server; has the file deduplication to reduce the number of release or download files; reduce the load on a single server when users publish and download large number of files and other functions. 该系统能够解决服务器单点故障;具有文件去重功能,减少了发布或下载文件的数量;降低用户在发布和下载大量文件时对单个服务器的负载等功能。