1.4.3.Visualizing the data - sj50179/Google-Data-Analytics-Professional-Certificate GitHub Wiki

Q. What are some reasons why a data analyst might use data visualizations? Select all that apply.

  • To create interesting graphs

  • To explain complex data quickly

  • To reinforce data analysis

Correct. Data analysts use data visualizations to explain complex data quickly, reinforce data analysis, and create interesting graphs and charts.

  • To edit data in color

Planning a data visualization


Earlier, you learned that data visualization is the graphical representation of information. As a data analyst, you will want to create visualizations that make your data easy to understand and interesting to look at. Because of the importance of data visualization, most data analytics tools (such as spreadsheets and databases) have a built-in visualization component while others (such as Tableau) specialize in visualization as their primary value-add. In this reading, you will explore the steps involved in the data visualization process and a few of the most common data visualization tools available.

Steps to plan a data visualization

Let’s go through an example of a real-life situation where a data analyst might need to create a data visualization to share with stakeholders. Imagine you’re a data analyst for a clothing distributor. The company helps small clothing stores manage their inventory, and sales are booming. One day, you learn that your company is getting ready to make a major update to its website. To guide decisions for the website update, you’re asked to analyze data from the existing website and sales records. Let’s go through the steps you might follow.

Step 1: Explore the data for patterns

First, you ask your manager or the data owner for access to the current sales records and website analytics reports. This includes information about how customers behave on the company’s existing website, basic information about who visited, who bought from the company, and how much they bought.

While reviewing the data you notice a pattern among those who visit the company’s website most frequently: geography and larger amounts spent on purchases. With further analysis, this information might explain why sales are so strong right now in the northeast—and help your company find ways to make them even stronger through the new website.

Step 2: Plan your visuals

Next it is time to refine the data and present the results of your analysis. Right now, you have a lot of data spread across several different tables, which isn’t an ideal way to share your results with management and the marketing team. You will want to create a data visualization that explains your findings quickly and effectively to your target audience. Since you know your audience is sales oriented, you already know that the data visualization you use should:

  • Show sales numbers over time
  • Connect sales to location
  • Show the relationship between sales and website use
  • Show which customers fuel growth

Step 3: Create your visuals

Now that you have decided what kind of information and insights you want to display, it is time to start creating the actual visualizations. Keep in mind that creating the right visualization for a presentation or to share with stakeholders is a process. It involves trying different visualization formats and making adjustments until you get what you are looking for. In this case, a mix of different visuals will best communicate your findings and turn your analysis into the most compelling story for stakeholders. So, you can use the built-in chart capabilities in your spreadsheets to organize the data and create your visuals.

1) line charts can track sales over time 2) maps can connect sales to locations 3) donut charts can show customer segments 4) bar charts can compare total visitors that make a purchase

Build your data visualization toolkit

There are many different tools you can use for data visualization.

  • You can use the visualizations tools in your spreadsheet to create simple visualizations such as line and bar charts.
  • You can use more advanced tools such as Tableau that allow you to integrate data into dashboard-style visualizations.
  • If you’re working with the programming language R you can use the visualization tools in RStudio.

Your choice of visualization will be driven by a variety of drivers including the size of your data, the process you used for analyzing your data (spreadsheet, or databases/queries, or programming languages). For now, just consider the basics.

Spreadsheets (Microsoft Excel or Google Sheets)

In our example, the built-in charts and graphs in spreadsheets made the process of creating visuals quick and easy. Spreadsheets are great for creating simple visualizations like bar graphs and pie charts, and even provide some advanced visualizations like maps, and waterfall and funnel diagrams (shown in the following figures).

But sometimes you need a more powerful tool to truly bring your data to life. Tableau and RStudio are two examples of widely used platforms that can help you plan, create, and present effective and compelling data visualizations.

Visualization software (Tableau)

Tableau is a popular data visualization tool that lets you pull data from nearly any system and turn it into compelling visuals or actionable insights. The platform offers built-in visual best practices, which makes analyzing and sharing data fast, easy, and (most importantly) useful. Tableau works well with a wide variety of data and includes an interactive dashboard that lets you and your stakeholders click to explore the data interactively.

You can start exploring Tableau from the How-to Video resources. Tableau Public is free, easy to use, and full of helpful information. The Resources page is a one-stop-shop for how-to videos, examples, and datasets for you to practice with. To explore what other data analysts are sharing on Tableau, visit the Viz of the Day page where you will find beautiful visuals ranging from the Hunt for (Habitable) Planets to Who’s Talking in Popular Films.

Programming language (R with RStudio)

A lot of data analysts work with a programming language called R. Most people who work with R end up also using RStudio, an integrated developer environment (IDE), for their data visualization needs. As with Tableau, you can create dashboard-style data visualizations using RStudio.

Check out their website to learn more about RStudio.

You could easily spend days exploring all the resources provided at RStudio.com, but the RStudio Cheatsheets and the RStudio Visualize Data Primer are great places to start. When you have more time, check out the webinars and videos which offer advice and helpful perspectives for both beginners and advanced users.

Key takeaway

The best data analysts use lots of different tools and methods to visualize and share their data. As you continue learning more about data visualization throughout this course, be sure to stay curious, research different options, and continuously test new programs and platforms to help you make the most of your data.

Test your knowledge on visualizing data

Question 1

Fill in the blank: A data visualization is the _____ representation of information.

  • tabulated

  • attributed

  • graphical

  • contextual

Correct. A data visualization is the graphical representation of information.

Question 2

What would a bar graph be most effective at demonstrating?

  • The change in someone's age over time

  • How two or more people's ages form a relationship

  • How a group of people is broken down by their ages

  • The different ages of various people in a group

Correct. A bar graph would be most effective at demonstrating the difference in age of various people in a group.

Question 3

What are the key benefits of data visualizations? Select all that apply.

  • They can illustrate relationships between data points

  • They can clearly demonstrate patterns and trends

  • They can help stakeholders understand complex data more quickly

Correct. Data visualizations can clearly demonstrate patterns and trends, help stakeholders understand complex data more quickly, and illustrate relationships between data points.

  • They can speed up your analysis