Scala Data Analysis Cookbook by Arun Manivannan

By Arun Manivannan

Navigate the area of information research, visualization, and laptop studying with over a hundred hands-on Scala recipes

About This Book

  • Implement Scala on your info research utilizing beneficial properties from Spark, Breeze, and Zeppelin
  • Scale up your info anlytics infrastructure with functional recipes for Scala computer learning
  • Recipes for each degree of the information research procedure, from analyzing and accumulating information to disbursed analytics

Who This publication Is For

This publication exhibits info scientists and analysts how one can leverage their current wisdom of Scala for caliber and scalable information analysis.

What you are going to Learn

  • Familiarize and manage the Breeze and Spark libraries and use info structures
  • Import information from a number of attainable assets and create dataframes from CSV
  • Clean, validate and remodel info utilizing Scala to pre-process numerical and string data
  • Integrate essential computing device studying algorithms utilizing Scala stack
  • Bundle and scale up Spark jobs through deploying them right into a number of cluster managers
  • Run streaming and graph analytics in Spark to imagine facts, permitting exploratory analysis

In Detail

This publication will introduce you to the preferred Scala instruments, libraries, and frameworks via sensible recipes round loading, manipulating, and getting ready your information. it is going to additionally assist you discover and make experience of your facts utilizing beautiful and insightfulvisualizations, and computer studying toolkits.

Starting with introductory recipes on using the Breeze and Spark libraries, familiarize yourself withhow to import information from a bunch of attainable resources and the way to pre-process numerical, string, and date facts. subsequent, you will get an figuring out of innovations that can assist you visualize information utilizing the Apache Zeppelin and Bokeh bindings in Scala, allowing exploratory information research. iscover the best way to application integral computer studying algorithms utilizing Spark ML library. paintings via steps to scale your computer studying types and installation them right into a standalone cluster, EC2, YARN, and Mesos. eventually dip into the strong recommendations provided by way of Spark Streaming, and desktop studying for streaming information, in addition to using Spark GraphX.

Style and approach

This ebook encompasses a wealthy set of recipes that covers the complete spectrum of attention-grabbing information research initiatives and may assist you revolutionize your information research abilities utilizing Scala and Spark.

Show description

Read or Download Scala Data Analysis Cookbook PDF

Similar object oriented design books

Logic Program Synthesis from Incomplete Information (The Springer International Series in Engineering and Computer Science)

Application synthesis is an answer to the software program situation. If we had a software that develops right courses from requisites, then software validation and upkeep may disappear from the software program life-cycle, and you can actually specialize in the extra inventive initiatives of specification elaboration, validation, and upkeep, simply because replay of application improvement will be less expensive.

Design Patterns in Java™ (2nd Edition) (Software Patterns Series)

Layout styles in Java™ supplies the hands-on perform and deep perception you must totally leverage the numerous energy of layout styles in any Java software program undertaking. the correct supplement to the vintage layout styles, this learn-by-doing workbook applies the newest Java gains and top practices to the entire unique 23 styles pointed out in that groundbreaking textual content.

Oracle Certified Associate, Java SE 7 Programmer Study Guide

Each one target is addressed utilizing a chain of programming examples. whilst the subject affects reminiscence, stack and heap illustrations are used to supply the reader with a extra extensive figuring out of the subject. on the finish of every bankruptcy, a chain of pattern questions are supplied to augment your wisdom.

Jump Start CoffeeScript: Get Up to Speed With CoffeeScript in a Weekend

A pragmatic and concise advent to CoffeeScript, a programming language that compiles into JavaScript and that makes operating with JavaScript more straightforward. The e-book lays out the fundamentals of the language, its syntax, and the attention-grabbing positive aspects that set it except JavaScript. it may fulfill an individual with an intermediate point of figuring out of JavaScript who wishes a conceptual and functional advent to CoffeeScript.

Additional resources for Scala Data Analysis Cookbook

Example text

Download PDF sample

Rated 4.46 of 5 – based on 21 votes