4.1.6.Weekly challenge 1 - quanganh2001/Google-Data-Analytics-Professional-Certificate-Coursera GitHub Wiki

Glossary: Terms and definitions

We’ve covered a lot of terms—some of which you may have already known, and some of which are new. To make it easy to remember what a word means, we created this glossary of terms and definitions.

To use the glossary for this course item, click the link below and select “Use Template.”

Link to glossary: Week 1 Glossary

OR

If you don’t have a Google account, you can download the glossary directly from the attachment below.

Weekly challenge 1

1st grade

Grade received 87.50%

Question 1

Fill in the blank: As a data analyst, you need to verify that your data is _____ to ensure your analysis and conclusions are accurate.

A. manipulated and valid

B. complete and valid

C. private and valid

D. manipulated and replicated

The correct answer is B. complete and valid

Question 2

A company has multiple retail chain stores. Each store’s database is located onsite and used for various purposes. Which of the following processes could compromise data integrity?

A. Data gathering

B. Data cleaning

C. Data transfer

D. Data replication

The correct answer is D. Data replication

Question 3

As a data analyst, you are working for a national pizza restaurant chain. You have a dataset with monthly order totals for each branch over the past year. With only this data, what questions can you answer?

A. Which branch had the most orders in the last month of last year?

B. What was the most popular item on the menu?

C. Which branch will be the most profitable over the next year?

D. Which region had the highest sales over the last two years?

The correct answer is A. Which branch had the most orders in the last month of last year?

Question 4

A data analyst is given a dataset for analysis. To use the template for this dataset, click the link below and select “Use Template.”

Link to template: June 2014 Invoices

OR

If you don’t have a Google account, download the CSV file directly from the attachment below.

The analyst notices a limitation with the data in rows 8 and 9. What is the limitation?

A. Row 9 needs more data.

B. Row 8 is not in the correct format.

C. Row 9 is a duplicate of row 8.

D. Row 8 and row 9 show the wrong currency.

The correct answer is C. Row 9 is a duplicate of row 8.

Question 5

A data analyst is working on a project about the global supply chain. They have a dataset with lots of relevant data from Europe and Asia. However, they decide to generate new data that represents all continents. What type of insufficient data does this scenario describe?

A. Data that keeps updating

B. Data from only one source

C. Data that's geographically limited

D. Data that's outdated

The correct answer is C. Data that’s geographically limited

Question 6

In the data analysis process, how does a sample relate to a population?

A. A sample is an ideal example taken from a population.

B. A sample is a part of a population that is representative of the population.

C. A sample is an average of all the data that represents the population.

D. A sample is a duplicate selection of data that is taken from the population.

The correct answer is B. A sample is a part of a population that is representative of the population.

Question 7

A candy manufacturer finds an even distribution of sales across all age ranges of customers who purchase their products. The manufacturer decides to conduct a survey to learn more about its customer base. Due to age requirements, they can only send the survey to customers who are 21 years or older. This scenario can be described as what?

A. Upsampling bias

B. Sampling bias

C. Down sampling bias

D. Unbiased sampling

The correct answer is B. Sampling bias

Question 8

What best describes a sample size?

A. A subset of the population between the 25th and 50th percentile

B. A subset that is representative of the population as a whole

C. A subset of the population excluding outliers

D. A random subset of the population

The correct answer is B. A subset that is representative of the population as a whole

2nd grade

Grade received 96.87%

Question 1

Fill in the blank: In order to have a strong and thorough analysis, a data analyst must verify _____.

A. data replication

B. data integrity

C. data manipulation

D. data engineering

The correct answer is B. data integrity

Question 2

A data analyst needs to migrate data from a server located at their company's headquarters to a remote site. This can lead to what type of data integrity issue?

  • Data manipulation
  • Data replication
  • Data transfer
  • Data cleaning

Question 3

As a data analyst, you are working for a national pizza restaurant chain. You have a dataset with monthly order totals for each branch over the past year. With only this data, what questions can you answer?

A. Which branch had the most orders in the last month of last year?

B. Which branch will be the most profitable over the next year?

C. Which region had the highest sales over the last two years?

D. What was the most popular item on the menu?

The correct answer is A. Which branch had the most orders in the last month of last year?

Question 4

A data analyst is given a dataset for analysis. To use the template for this dataset, click the link below and select “Use Template.”

Link to template: June 2014 Invoices

OR

If you don’t have a Google account, download the CSV file directly from the attachment below.

Which of the following has duplicate data?

A. Data for Symteco on 5/20/2014

B. Data for Valando on 2/18/2014

C. Data for Symteco on 2/21/2014

D. Data for Valando on 1/1/2014

The correct answer is B. Data for Valando on 2/18/2014

Question 5

A data analyst is working on a project about the global supply chain. They have a dataset with lots of relevant data from Europe and Asia. However, they decide to generate new data that represents all continents. What type of insufficient data does this scenario describe?

A. Data from only one source

B. Data that keeps updating

C. Data that’s outdated

D. Data that’s geographically limited

The correct answer is D. Data that’s geographically limited

Question 6

A car manufacturer wants to learn more about the brand preferences of electric car owners. There are millions of electric car owners in the world. Who should the company survey?

A. A sample of all electric car owners

B. The entire population of electric car owners

C. A sample of car owners who have owned more than one electric car

D. A sample of car owners who most recently bought an electric car

The correct answer is A. A sample of all electric car owners

Question 7

A high school principal is estimating the total number of students that will attend an upcoming event. She assumes that the older students are unlikely to attend and decides to only survey the first-year students. What issue will the principal face when calculating her estimation?

A. The sample should be the older students.

B. The sample exhibits sampling randomness.

C. The sample is too small.

D. The sample exhibits sampling bias.

The correct answer is D. The sample exhibits sampling bias.

Question 8

Which of the following processes helps ensure a close alignment of data and business objectives?

A. Completing data replication

B. Transferring data multiple times

C. Having data update automatically during analysis

D. Maintaining data integrity

The correct answer is D. Maintaining data integrity

3rd grade

Grade received 96.87%

Question 1

Fill in the blank: As a data analyst, you need to verify that your data is _____ to ensure your analysis and conclusions are accurate.

A. manipulated and valid

B. private and valid

C. manipulated and replicated

D. complete and valid

The correct answer is D. complete and valid

Question 2

A financial analyst imports a dataset to their computer from a storage device. As it’s being imported, the connection is interrupted, which compromises the data. Which of the following processes caused the compromise?

A. Data analysis

B. Data manipulation

C. Data transfer

D. Data gathering

The correct answer is C. Data transfer

Question 3

A data analyst is given a dataset for analysis. It includes data only about the total population of every country in the previous 20 years. Based on the available data, an analyst would have the full picture and be able to determine the reasons behind a certain country's population increase from 2016 to 2017. True or False?

A. True

B. False

It is false statement.

Question 4

A data analyst is given a dataset for analysis. To use the template for this dataset, click the link below and select “Use Template.”

Link to template: June 2014 Invoices

OR

If you don’t have a Google account, download the CSV file directly from the attachment below.

The analyst notices a limitation with the data in rows 8 and 9. What is the limitation?

A. Row 9 needs more data.

B. Row 8 and row 9 show the wrong currency.

C. Row 8 is not in the correct format.

D. Row 9 is a duplicate of row 8.

The correct answer is D. Row 9 is a duplicate of row 8.

Question 5

A data analyst at a software company wants to learn more about industry competitors. Because the software industry has more mergers than any other field, the companies and their products are constantly evolving. The analyst has a dataset from three years ago, and they notice that many of the companies and products in the dataset have changed. What makes the analyst decide that the data is insufficient, so they should generate fresh data instead?

A. It is data that keeps updating.

B. It is data from only one source.

C. It is geographically limited data.

D. It is outdated data.

The correct answer is D. It is outdated data.

Question 6

A company is trying to learn more about their customer base. They would like to conduct a survey to understand why their customers chose their brand. How should the company survey its customers?

A. Conduct a survey of customers who purchased a different brand

B. Conduct a survey of customers that live in high-income areas

C. Conduct a survey with a representative sample of their customer population

D. Conduct a survey with customers who have purchased more than five products

The correct answer is C. Conduct a survey with a representative sample of their customer population

Question 7

A car dealership gathers data about their entire customer population. They decide to conduct a survey to understand why their customers chose their dealership. They send out an email to all customers who have purchased more than two vehicles in the past five years. What does this scenario describe?

A. Geographically limited sampling

B. Sampling bias

C. Unbiased sampling

D. Random sampling

The correct answer is B. Sampling bias

Question 8

A data analyst retrieves a sample of their data that is roughly representative of the population as a whole. They realize that there will be some error in their sample results because they didn’t sample the entire population. What is this error called?

A. Mean squared error

B. Sampling error

C. Population error

D. Margin of error

The correct answer is D. Margin of error

4th grade

Grade received 100%

Question 1

Fill in the blank: Data _____ refers to the accuracy, completeness, consistency, and trustworthiness of data throughout its life cycle.

_**A. integrity

B. analysis

C. replication

D. sampling

Data integrity refers to the accuracy, completeness, consistency, and trustworthiness of data throughout its life cycle.

Question 2

A healthcare company keeps copies of their data at several locations across the country. The data becomes compromised because each location creates a copy of the original at different times of day. Which of the following processes caused the compromise?

A. Data manipulation

B. Data transfer

C. Data gathering

D. Data replication

The correct answer is D. Data replication

Question 3

A data analyst is given a dataset for analysis. It includes data about the total population of every country in the previous 20 years. Which of the following questions would the analyst need more data to address?

A. Which country had the smallest population in 2017?

B. What was the population of a certain country in 2020?

C. Which country had the greatest population in 2015?

D. What was the reason for the population increase in a certain country?

The correct answer is D. What was the reason for the population increase in a certain country?

Question 4

A data analyst is given a dataset for analysis. To use the template for this dataset, click the link below and select “Use Template.”

Link to template: June 2014 Invoices

OR

If you don’t have a Google account, download the CSV file directly from the attachment below.

Which of the following has duplicate data?

A. Data for Valando on 1/1/2014

B. Data for Symteco on 2/21/2014

C. Data for Valando on 2/18/2014

D. Data for Symteco on 5/20/2014

The correct answer is C. Data for Valando on 2/18/2014

Question 5

A data analyst wants to predict the production output of a factory using a dataset that covers the years 2020 to 2021. In 2022, the factory implemented major labor and facility changes. What limitation of the data means that the analyst needs to get new data?

A. The data is outdated.

B. The data is from only one source.

C. The data is geographically limited.

D. The data keeps updating.

The correct answer is A. The data is outdated.

Question 6

A company is trying to learn more about their customer base. They would like to conduct a survey to understand why their customers chose their brand. How should the company survey its customers?

A. Conduct a survey with a representative sample of their customer population

B. Conduct a survey of customers who purchased a different brand

C. Conduct a survey of customers that live in high-income areas

D. Conduct a survey with customers who have purchased more than five products

The correct answer is A. Conduct a survey with a representative sample of their customer population

Question 7

A restaurant gathers data about a new dish by providing free samples to parties of six or more diners. What does this scenario describe?

A. Sampling bias

B. Unbiased sampling

C. Geographically limited sampling

D. Random sampling

The correct answer is A. Sampling bias

Question 8

What best describes a sample size?

A. A random subset of the population

B. A subset of the population excluding outliers

C. A subset that is representative of the population as a whole

D. A subset of the population between the 25th and 50th percentile

The correct answer is C. A subset that is representative of the population as a whole