Performing Advanced Analytics on Relational Data with Spark SQL

In this event, we'll examine Spark SQL, a new Alpha component that is part of the Apache Spark 1.0 release. Spark SQL lets developers natively query data stored in both existing RDDs and external sources such as Apache Hive. A key feature of Spark SQL is the ability to blur the lines between re...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: Armbrust, Michael (VerfasserIn)
Format: Online
Sprache:eng
Veröffentlicht: Erscheinungsort nicht ermittelbar O'Reilly Media, Inc. 2014
Sebastopol, CA O'Reilly Media Inc.
Ausgabe:1st edition
Schlagworte:
Online Zugang:https://learning.oreilly.com/library/view/-/9781491908297
https://learning.oreilly.com/library/view/-/9781491908297/?ar
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In this event, we'll examine Spark SQL, a new Alpha component that is part of the Apache Spark 1.0 release. Spark SQL lets developers natively query data stored in both existing RDDs and external sources such as Apache Hive. A key feature of Spark SQL is the ability to blur the lines between relational tables and RDDs, making it easy for developers to intermix SQL commands that query external data with complex analytics. In addition to Spark SQL, we'll explore the Catalyst optimizer framework, which allows Spark SQL to automatically rewrite query plans to execute more efficiently.
Beschreibung:1 Online-Ressource (1 video file, approximately 41 min.)
ISBN:978149190828