Hadoop is a distributed system infrastructure developed by the Apache Foundation, enabling users to perform high-speed computing and storage of big data with the help of clusters without knowing the distributed underlying details.
It is a programming tool provided by Hadoop, which allows users to use any executable or script file as Mapper and Reducer.
It is an open source, non-relational distributed database similar to Google Big table. You can use Hbase to randomly query, update data, and keep latency low.
It is an open source data warehouse based on Hadoop. It enables you to avoid using low-level computer languages (such as java) to write MapReduce complex programs. For more information on Hive, please see http://hive.apache.org/.
It is a SQL query method provided by Hive (data warehouse analysis system based on Hadoop), which is used to analyze the data stored in the Hadoop distributed file system and map the structured data into a database table to provide a complete SQL query Features. In addition, you can convert SQL statements into MapReduce tasks and use your own SQL statements to query the content to be analyzed. Hive SQL is useful for users who are not familiar with Hadoop to query, analyze and summarize data.
HTTP Referer is part of the header. When a browser sends a request to a web server, it usually brings a Referer to tell the server which page I came from. The server can get some information for processing.
Hypertext Transfer Protocol
Hypertext Transfer Protocol (HTTP) is a data transfer protocol that specifies the rules for communication between browsers and web servers in detail. It is a data transfer protocol for transmitting web documents over the Internet., which is the seventh layer protocol in ISO seven layer model
Functions are user-written pieces of code that are triggered by events and perform specific functions.
A group of Baidu AI Cloud servers responsible for receiving frontend concurrent access forwarded to the backend server for processing according to the rules defined by the BLB instance.
It refers to some parameters used in function calculations to specify the operating environment of a function.
It is a temporary data exchange area with fast storage rate, which can reduce system load and increase data transmission rate.
Cache hit probability
The ratio of the data size called from the cache to the total data size called when the user's program runs.
It refers to an interaction between the user and the system, including the entire process from initiation to exit.
It is a mechanism that can identify the continuity of the user-server interaction process. It can distribute access requests related to the same user to the same server while loading balancing within a limited session lifecycle.