Mô hình xử lý dữ liệu lớn trên điện toán đám mây theo mô hình ánh xạ - rút gọn
ThS. Trần Thị Thúy
Abstract
Nowaday, we are living in the information age with an exponential explosion and growth information. The leading information technology companies such as Google, Yahoo, Facebook, Twitter,.... they are facing with a huge information. This requests to have the new strategy to analyze and process data. Cloud computing is developed and MapReduce-Hadoop has become the powerful computing model to be solved this problem. MapReduce provides a framework programming applications to process text data, it can solve a large amount of data fastly based on computing parallel on computer clusters. This article presents fundamental about a processing data on cloud computing, architecture and components of Hadoop, HDFS (Hadoop Distributed File System), Map Reduce and its application.