Peakin

 

  1. SCALA BASICS
    • Introduction
    • Immutability
    • Variables declarations
    • String Interpolation
    • Methods
    • If Else Statements
    • Control structures
    • Tuples
    • Collections
    • Monadics
    • Traits

 

  1. Spark Framework
    • Why Spark ?
    • Spark vs MapReduce
    • Spark Architecture
    • Spark Submit

 

  1. RDD and its operations
    • RDD
    • Fault Tolerant
    • Immutability
    • Lazy Evaluation
    • Cache
    • Transformations
    • Actions

 

  1. Persistence Levels
  2. Serialization

 

  1. Shared Variables
    • Accumulators
    • Broadcast Joins

 

  1. Joins
    • Shuffled based Joins
    • Broadcast Joins

 

  1. File Formats
    • Parquet
    • ORC
    • JSON
    • XML
    • Text
    • AVRO

 

  1. Validations
    • Validations using either monadic collections

 

  1. Spark SQL
    • Introduction
    • Data Source API
    • Data Frames
    • Query processing using SQL
    • Query processing using DF API
    • Joins in Spark SQL
    • UDFs in Spark SQL
    • Memory Management
    • Optimization Techniques