ICP_01 : Introduction to Big Data Analytics and Applications - acikgozmehmet/BigDataAnalyticsAndApplications GitHub Wiki

Welcome to the BigDataAnalyticsAndApplications wiki!

Objectives:

We will focus on installation and getting familiar with Big Data Analytics and Applications programming concepts.

Installations

  • Google Colab

    Brief Introduction About Google Colab

  • Same as Jupyter Notebook

    has important libraries already installed

    some free GPU space

    Preexisting visualizations

    Many other features

How to work with Google Colab

  1. Sign into your Gmail account
  2. Go to https://colab.research.google.com/notebooks/intro.ipynb#recent=true
  3. Cick on New Notebook and start coding
  4. You can Save the Colab Notebook on the Github by clicking File> Save a copy in Github
  • Github

    • Software development platform

    • Brings together the world's largest community of developers to discover, share, and build better software

    • Projects on GitHub can be public or private

In Class Programming

1. Download the Data from the following link into your google drive. https://umkc.box.com/s/y3niwfccpumnzbkyunwyznngb1xtd7a6

2. Mount the google drive to Google Colab notebook and read the data using python Pandas library.

3. Perform 3 data analysis tasks on the data.

4. Create the Github repository.

5. Save the Colab notebook in your Github repository.

References: