百度智能云

All Product Document

          Object Storage

          Overview

          Description of BOS HDFS Tool

          The BOS HDFS tool is a convenient tool launched by Baidu AI Cloud based on Hadoop framework specializing in solving the problem in reading, writing and using the data in BOS under the scenario of big data.

          The data analysis in big data scenario has become the core business concerned by the enterprises. Hadoop has outstanding capabilities in distributed data processing, with its characteristics of reliability, efficiency, scalability and concurrent processing; it has developed into one of the most mainstream open-source framework of big data. Hadoop realizes a distributed file system (Hadoop Distributed File System), or HDFS for short. HDFS features high fault tolerance, and supports application data access by high throughput, and it is suitable for business scenarios with very large data sets. HDFS has become the most important part of the Hadoop ecosystem, providing reliable storage performance for mass data. The increasing amount of data makes the protogenetic Hadoop face some new problems. HDFS has a very high cost of self-construction and operation and maintenance; meanwhile, how to store the mass data in local HDFS is also a great challenge for enterprises. Therefore, under the trend that enterprises store their data in the cloud, more and more enterprises choose to store data in the cloud, that is, in the object storage service. However, due to the limitations on the upper data interface of object storage, the access, read and write between the data in object storage and self-built HDFS have always been a bottleneck in big data scenario, and BOS HDFS solves the problem well.

          The BOS HDFS tool is fully compatible with Hadoop 2.7+/3.1+ series, and supports the mass storage of HDFS data in BOS, and uses HDFS standard interface to access, read and write data in upper data operation, which effectively solves the problem in high operation and maintenance costs and low scalability of the self-built HDFS data. You can enjoy the powerful advantages brought by BOS fully, such as ultra-low price, ultra-high performance, high reliability and high throughput, to meet the demands of enterprises in reading, writing and using data in big data scenario.

          Advantages of BOS HDFS Tool

          • Compatible framework: Fully compatible with Hadoop 2.7+/3.1+
          • Non-inductive call: Realize the non-inductive call of the data in BOS
          • High performance cost ratio for data storage: Integrate the advantages of object storage service BOS like ultra-low price, ultra-high performance, high reliability and high throughput
          Previous
          BOSFS
          Next
          Configuration and Use