A Definitive Guide to Hadoop-Related Frameworks and Tools
E-Book, Englisch, 421 Seiten, eBook
ISBN: 978-1-4842-2199-0
Verlag: APRESS
Format: PDF
Kopierschutz: 1 - PDF Watermark
Set up the environment in Linux for Hadoop projects using Cloudera Hadoop Distribution CDH 5
Run a MapReduce job
Store data with Apache Hive, and Apache HBase
Index data in HDFS with Apache Solr
Develop a Kafka messaging system
Stream Logs to HDFS with Apache Flume
Transfer data from MySQL database to Hive, HDFS, and HBase with Sqoop
Create a Hive table over Apache Solr
Develop a Mahout User Recommender SystemWho This Book Is For:
Apache Hadoop developers. Pre-requisite knowledge of Linux and some knowledge of Hadoop is required.
Zielgruppe
Professional/practitioner
Autoren/Hrsg.
Weitere Infos & Material
Part I. Fundamentals.- Introduction.- 1. HDFS and MapReduce.- Part II Storing & Querying.- 2. Apache Hive.- 3. Apache HBase.- Part III Bulk Transferring & Streaming.- 4. Apache Sqoop.- 5. Apache Flume.- Part IV Serializing.- 6. Apache Avro.- 7. Apache Parquet.- Part V Messaging & Indexing.- 8. Apache Kafka.- 9. Apache Solr.- 10.Apache Mahout.