Expert techniques for architecting end-to-end big data solutions to get valuable insights /
نام نخستين پديدآور
V Naresh Kumar, Prashant Shindgikar.
وضعیت نشر و پخش و غیره
محل نشرو پخش و غیره
Birmingham :
نام ناشر، پخش کننده و غيره
Packt Publishing,
تاریخ نشرو بخش و غیره
2018.
مشخصات ظاهری
نام خاص و کميت اثر
1 online resource (394 pages)
يادداشت کلی
متن يادداشت
Table of ContentsHadoop Design Consideration Hadoop Life Cycle ManagementData Modeling in HadoopDesigning Streaming Data PipelinesBuilding Enterprise Search Platform Data Movement TechniquesEnterprise Data Architecture PrinciplesArchitecting Large Scale Data Processing Solutions using Spark Developing Application using Cloud InfrastructureDesigning Data Visualization Solutions Production Hadoop Administration and Cluster Deployment.
یادداشتهای مربوط به مندرجات
متن يادداشت
Cover; Title Page; Copyright and Credits; Packt Upsell; Contributors; Table of Contents; Preface; Chapter 1: Enterprise Data Architecture Principles; Data architecture principles; Volume; Velocity; Variety; Veracity; The importance of metadata; Data governance; Fundamentals of data governance; Data security; Application security; Input data; Big data security; RDBMS security; BI security; Physical security; Data encryption; Secure key management; Data as a Service; Evolution data architecture with Hadoop; Hierarchical database architecture; Network database architecture.
متن يادداشت
Add serviceService placement; Service client placement; Database creation on master; Ranger database configuration; Configuration changes; Configuration review; Deployment progress; Application restart; Apache Ranger user guide; Login to UI; Access manager; Service details; Policy definition and auditing for HDFS; Summary; Chapter 3: Hadoop Design Consideration; Understanding data structure principles; Installing Hadoop cluster; Configuring Hadoop on NameNode; Format NameNode; Start all services; Exploring HDFS architecture; Defining NameNode; Secondary NameNode; NameNode safe mode; DataNode.
متن يادداشت
Best practices Hadoop deploymentHadoop file formats; Text/CSV file; JSON; Sequence file; Avro; Parquet; ORC; Which file format is better?; Summary; Chapter 4: Data Movement Techniques; Batch processing versus real-time processing; Batch processing; Real-time processing; Apache Sqoop; Sqoop Import; Import into HDFS; Import a MySQL table into an HBase table; Sqoop export; Flume; Apache Flume architecture; Data flow using Flume; Flume complex data flow architecture; Flume setup; Log aggregation use case; Apache NiFi; Main concepts of Apache NiFi; Apache NiFi architecture; Key features.
متن يادداشت
Data replicationRack awareness; HDFS WebUI; Introducing YARN; YARN architecture; Resource manager; Node manager; Configuration of YARN; Configuring HDFS high availability; During Hadoop 1.x; During Hadoop 2.x and onwards; HDFS HA cluster using NFS; Important architecture points; Configuration of HA NameNodes with shared storage; HDFS HA cluster using the quorum journal manager; Important architecture points; Configuration of HA NameNodes with QJM; Automatic failover; Important architecture points; Configuring automatic failover; Hadoop cluster composition; Typical Hadoop cluster.
متن يادداشت
Relational database architectureEmployees; Devices; Department; Department and employee mapping table; Hadoop data architecture; Data layer; Data management layer; Job execution layer; Summary; Chapter 2: Hadoop Life Cycle Management; Data wrangling; Data acquisition; Data structure analysis; Information extraction; Unwanted data removal; Data transformation; Data standardization; Data masking; Substitution; Static ; Dynamic; Encryption; Hashing; Hiding; Erasing; Truncation; Variance; Shuffling; Data security; What is Apache Ranger?; Apache Ranger installation using Ambari; Ambari admin UI.
بدون عنوان
0
بدون عنوان
8
بدون عنوان
8
بدون عنوان
8
بدون عنوان
8
یادداشتهای مربوط به خلاصه یا چکیده
متن يادداشت
This book presents unique techniques to conquer different Big Data processing and analytics challenges using Hadoop. Practical examples are provided to boost your understanding of Big Data concepts and their implementation. By the end of the book, you will have all the knowledge and skills you need to become a true Big Data expert.
یادداشتهای مربوط به سفارشات
منبع سفارش / آدرس اشتراک
Packt Publishing
منبع سفارش / آدرس اشتراک
OverDrive, Inc.
شماره انبار
9781787128811
شماره انبار
D508DE65-BBBA-46CD-928A-49C8DFBFE6AC
ویراست دیگر از اثر در قالب دیگر رسانه
عنوان
Modern Big Data Processing with Hadoop : Expert techniques for architecting end-to-end big data solutions to get valuable insights.
عنوان به منزله موضوع
موضوع مستند نشده
Apache Hadoop.
موضوع مستند نشده
Apache Hadoop.
موضوع (اسم عام یاعبارت اسمی عام)
موضوع مستند نشده
Electronic data processing-- Distributed processing.
موضوع مستند نشده
COMPUTERS-- Computer Literacy.
موضوع مستند نشده
COMPUTERS-- Computer Science.
موضوع مستند نشده
Computers-- Data Modeling & Design.
موضوع مستند نشده
Computers-- Data Processing.
موضوع مستند نشده
Computers-- Database Management-- Data Mining.
موضوع مستند نشده
COMPUTERS-- Hardware-- General.
موضوع مستند نشده
COMPUTERS-- Information Technology.
موضوع مستند نشده
COMPUTERS-- Machine Theory.
موضوع مستند نشده
COMPUTERS-- Reference.
موضوع مستند نشده
Data capture & analysis.
موضوع مستند نشده
Data mining.
موضوع مستند نشده
Database design & theory.
موضوع مستند نشده
Electronic data processing-- Distributed processing.
موضوع مستند نشده
Information architecture.
مقوله موضوعی
موضوع مستند نشده
COM-- 013000
موضوع مستند نشده
COM-- 014000
موضوع مستند نشده
COM-- 018000
موضوع مستند نشده
COM-- 032000
موضوع مستند نشده
COM-- 037000
موضوع مستند نشده
COM-- 052000
موضوع مستند نشده
COM-- 067000
رده بندی ديویی
شماره
004
.
36
ويراست
23
رده بندی کنگره
شماره رده
QA76
.
9
.
D5
نشانه اثر
.
K863
2018eb
نام شخص به منزله سر شناسه - (مسئولیت معنوی درجه اول )