The Work of Tony

Big Data Analytics

Apache Spark case study on EMR and Amazon S3

Miscellaneous Data Sets

1.Screenshot 2022-10-31 232830

Project Summary

Using commonly available data sets from Google, use Apache Spark, Scala and other enterprise data analytics tools to research and interpret data 

My Role

Independent Study and Research, Cloud Certification Series

Goals

Outcomes

Tools & Methods

Getting Started

 CI/CD Pipeline, GIT, Documentation and key differences in Industry 

Case Studies

Cassandra Case Study

Hadoop Case Study

MapReduce Case Study

Apache Hive Case Study 

Building a Dash Board with Nodejs.Kafka, Spark and HighCharts Case Study 

BigData On Cloud Case Study