Explore Learnmore Technologies
Syllabus
- Linux Introduction
- File Management
- Directories
- File Permission
- Basic Utilities
- Pipes & Filters
- Processes
- Communication
- The vi Editor
- Shell Scripting
- What is Shell?
- Using Variables
- Special Variables
- Using Arrays
- Basic Operators
- Decision Making
- Shell Loops
- Loop Control
- Shell Substitutions
- Quoting Mechanisms
- IO Redirections
- Shell Functions
- Home
- Overview
- RDBMS Concepts
- Databases
- Syntax
- Data Types
- Operators
- Expressions
- SQL Database
- SQL Table
- SQL Queries
- SQL Views
- Advance Query
- Big Data Overview
- Big Data Solutions
- Introduction
- Environment Setup
- HDFS Overview
- HDFS Operations
- Command Reference
- MapReduce
- Streaming
- Multi-Node Cluster
- MapReduce
- Introduction
- Algorithm
- Installation
- API
- Hadoop Implementation
- Partitioner
- Combiners
- Hadoop Administration
- Big Data Overview
- Big Data Solutions
- Introduction
- Environment Setup
- HDFS Overview
- HDFS Operations
- Command Reference
- MapReduce
- Streaming
- Multi-Node Cluster
- MapReduce
- Introduction
- Algorithm
- Installation
- API
- Hadoop Implementation
- Partitioner
- Combiners
- Hadoop Administration
- Introduction
- Feature & Characteristics
- Advantages
- Python Vs Scala Vs Java
- Variables
- Datatype
- Operators
- Type Conversion
- Conditional Statement
- Loops
- Comment
- function & higher order function
- OOPS Feature(object,class,Inheritance,Polymorphism,Encapsulation)
- String
- Arrays
- List
- File handling
- Exeption Handling
- Collection
- Spark RDD
- Parallelize
- Read text file
- Read CSV
- Create RDD
- Actions
- Pair Functions
- Repartition and Coalesce
- Shuffle Partitions
- Cache vs Persist
- Persistance Storage Levels
- Broadcast Variables
- Accumulator Variables
- Convert RDD to DataFrame
- Introduction of DataFrame
- createDataFrame()
- where() & filter()
- withColumn()
- withColumnRenamed()
- drop()
- distinct()
- groupBy()
- join()
- map() vs mapPartitions()
- foreach() vs foreachPartition()
- pivot()
- union()
- collect()
- cache() & persist()
- udf()
- Spark SQL StructType & StructField
- Apache Kafka – Basic Operations
- Simple Producer Example
- Consumer Group Example
- Integration With Storm
- Integration With Spark
- Real Time Application(Twitter)
- Apache Kafka – Tools
- Apache Kafka – Applications
- AWS- AWS Services(EMR,S3 ,SNS,Lambada function,Glue Job,GIT,Database,EC2 Instance)
- Microsoft Azure – Azure Services(Blob,Delta table,Databricks,ADF,GIT,Database,Virtual Machine)