3.5.3.Course challenge - quanganh2001/Google-Data-Analytics-Professional-Certificate-Coursera GitHub Wiki

Prepare for the course challenge by reviewing terms and definitions in the glossary. Then, demonstrate your knowledge of data collection, ethics and privacy, and bias during the quiz. You will also have an opportunity to apply your skill with spreadsheet and SQL functions, as well as filtering and sorting. Finally, secure and organize data with data analytics best practices.

Learning Objectives

  • Explain factors that should be considered when making decisions about data collection
  • Explain the relationship between data ethics and data privacy
  • Discuss characteristics of credible sources of data including reference to untidy data
  • Identify different types of bias including confirmation, interpretation, and observer bias
  • Demonstrate an understanding of how to use SQL functions to extract data from a database
  • Demonstrate an understanding of how to use spreadsheet functionality to import and inspect a given set of data
  • Describe best practices for organizing data

Glossary: Terms and definitions

We’ve covered a lot of terms—some of which you may have already known, and some of which are new. To make it easy to remember what a word means, we created this glossary of terms and definitions.

To use the glossary for this course item, click the link below and select “Use Template.”

Link to glossary: Week 5 Glossary

OR

If you don’t have a Google account, you can download the glossary directly from the attachment below:

Course 3 Week 5 Glossary_DA terms and definitions

Course challenge

Question 1

Scenario 1, questions 1-5

You’ve been working at a data analytics consulting company for the past six months. Your team helps restaurants use their data to better understand customer preferences and identify opportunities to become more profitable.

To do this, your team analyzes customer feedback to improve restaurant performance. You use data to help restaurants make better staffing decisions and drive customer loyalty. Your analysis can even track the number of times a customer requests a new dish or ingredient in order to revise restaurant menus.

Currently, you’re working with a vegetarian sandwich restaurant called Garden. The owner wants to make food deliveries more efficient and profitable. To accomplish this goal, your team will use delivery data to better understand when orders leave Garden, when they get to the customer, and overall customer satisfaction with the orders.

Before project kickoff, you attend a discovery session with the vice president of customer experience at Garden. He shares information to help your team better understand the business and project objectives. As a follow-up, he sends you an email with datasets.

Click below to read the email:

C3 Scenraio 1_Client Email

Course 3 Final Challenge Data Sets - Customer survey data (1)

Course 3 Final Challenge Data Sets - Delivery times_distance (1)

Reviewing the data enables you to describe how you will use it to achieve your client’s goals. First, you notice that all of the data was collected by Garden employees using their own resources. What type of data does this describe?

A. Nominal data

B. First-party data

C. Third-party data

D. Qualitative data

The correct answer is B. First-party data. Explain: This describes first-party data, which is collected by an individual or group using their own resources.

Question 2

Scenario 1 continued

Next, you review the customer satisfaction survey data. To use the template for the customer satisfaction survey data, click the link below and select “Use Template.”

Link to template: Customer Satisfaction Survey data

OR

If you don’t have a Google account, download the CSV file directly from the attachment below.

CustomerSurveyData

You notice that the data in column E is an example of Boolean data. Why did you come to this conclusion?

A. It has only two possible values.

B. It has each subject in multiple rows.

C. It is organized in a certain format, such as rows and columns.

D. It is qualitative data with a set order or scale.

The correct answer is A. It has only two possible values. Explain: Boolean data has only two possible values, such as yes or no.

Question 3

Scenario 1 continued

Now, you review the data on delivery times and the distance of customers from the restaurant.

To use the template for the dataset, click the link below and select “Use Template.”

Link to template: Delivery Times/Distance

OR

If you don’t have a Google account, download the CSV file directly from the attachment below.

DeliveryTimes_DistanceData

The data in column E shows the duration of deliveries from Garden to customers. What type of data is this? Select all that apply.

  • Continuous data
  • Quantitative data
  • Qualitative data
  • Discrete data

Explain: This is an example of discrete data, which is counted and has a limited number of values. It is also quantitative data, which is specific and measures numerical facts.

Question 4

Scenario 1 continued

The next thing you review is the file containing pictures of sandwich deliveries over a period of 30 days. This is an example of structured data. True or False?

A. True

B. False

It is false statement. Explain: This is an example of unstructured data, which is not organized in an easily identifiable manner.

Question 5

Scenario 1 continued

Now that you’re familiar with the data, you want to build trust with the team at Garden.

What actions should you take when working with their data? Select all that apply.

  • Post on social media that you’re working with Garden and would like feedback from any of your contacts who have ordered there before.
  • Keep the data safe by implementing data-security measures, such as password protection and user permissions.
  • Organize the data using effective naming conventions.
  • Share the client’s data with other delivery restaurants to compare performance.

Explain: You can build trust by showing a client that you will organize their data effectively and keep it safe by implementing appropriate data-security measures.

Question 6

Scenario 2, questions 6-10

You’ve completed this program and are interviewing for a junior data scientist position at a company called Sewati Financial Services.

Click below to review the job description:

C3 Course Challenge Junior Data Scientist Job Description .pdf

So far, you’ve successfully completed the first interview with a recruiter. They arrange your second interview with the team at Sewati Financial Services.

Click below to read the email from the human resources director:

Course 3 Scenario 2_Second Interview Email.pdf

You arrive 15 minutes early for your interview. Soon, you are escorted into a conference room, where you meet Kai Harvey, the senior manager of strategy. After welcoming you, he begins the behavioral interview.

Consider and respond to the following question. Select all that apply.

Our data analytics team often surveys clients to get their feedback. If you were on the team, how would you ensure the process does not cause potential bias?

  • Instruct participants to share their name and contact information.
  • Give participants enough time to answer each survey question.
  • Include clients with disabilities in the survey sample.
  • Make sure the wording of the survey question does not encourage a specific response from participants.

Explain: The way questions are written, the amount of time given to answer each question, and the inclusivity of the participants can cause potential bias.

Question 7

Scenario 2 continued

Consider and respond to the following question. Select all that apply.

Our data analytics team often uses external data. Where can you access useful external data?

  • Sewati Financial Services database in the cloud
  • Sewati Financial Services website
  • A public database
  • An open-data website

Explain: Data analysts access external data from open-data websites and public databases.

Question 8

Scenario 2 continued

Consider and respond to the following question. Select all that apply.

Our analysts often work with the same spreadsheet, but for different purposes. How would you use sorting to help in this situation?

  • Sort data to show only the data that meets a specific criteria while hiding the rest
  • Sort data to highlight the header row.
  • Sort the data to arrange data in a meaningful order
  • Sort data to make it easier to understand, analyze and visualize

Explain: Sorting data enables data analysts on the same team to use the same dataset for different purposes.

Question 9

Scenario 2 continued

Next, your interviewer wants to better understand your knowledge of basic SQL commands. He asks: How would you write a query that retrieves only data about people with the last name Hassan from the Clients table in our database?

A. SELECT * WHERE Clients = 'Hassan' B. SELECT Clients WHERE last_name = 'Hassan' C. SELECT * FROM Clients WHERE last_name = 'Hassan' D. SELECT * WHERE last_name = 'Hassan'

The correct syntax is C. SELECT * FROM Clients WHERE last_name = 'Hassan'. Explain: To write a query that retrieves only data about people with the last name Hassan from the Clients table, type SELECT * FROM Clients WHERE last_name=’Hassan’.

Question 10

Scenario 2 continued

For your final question, your interviewer explains that Sewati Financial Services needs its clients’ trust, and this is an important responsibility for the data analytics team.

He asks you to identify which data analytics practice involves preserving a data subject’s information and activity any time a data transaction occurs.

A. Sharing permissions

B. Data privacy

C. Bias

D. Encryption

The correct answer is B. Data privacy. Explain: Data privacy involves preserving a data subject’s information and activity any time a data transaction occurs.

Coming up next...

Congratulations on completing the third course in the Google Data Analytics Certificate!

To make continuing with the program easy, go to the next course by clicking this link: Process Data from Dirty to Clean .

Keep up the great work!