1.1.1.Defining Data Science - sj50179/IBM-Data-Science-Professional-Certificate GitHub Wiki

Fundamentals of Data Science

Data science can help organizations understand their environments, analyze existing issues, and reveal previously hidden opportunities.

데이터 과학은 각 쑰직이 쑰직 ν™˜κ²½μ„ μ΄ν•΄ν•˜κ³  ν˜„μ‘΄ν•˜λŠ” 문제λ₯Ό λΆ„μ„ν•˜κ³ , μ˜ˆμ „μ—λŠ” 숨겨져 μžˆμ—ˆλ˜ κΈ°νšŒλ“€μ„ 찾아내도둝 도와쀀닀.

Data scientists use data analysis to add to the knowledge of the organization by investigating data, exploring the best way to use it to provide value to the business.

데이터 κ³Όν•™μžλ“€μ€ 쑰직의 지식을 λŠ˜λ¦¬λŠ” 데 데이터 뢄석을 μ΄μš©ν•œλ‹€. 이λ₯Ό μœ„ν•΄ 데이터λ₯Ό μ‘°μ‚¬ν•˜κ³ , 데이터 뢄석을 μ΄μš©ν•œ μ΅œμƒμ˜ 방법을 νƒμƒ‰ν•˜μ—¬ 사업에 도움을 μ€€λ‹€.

Good data scientists are curious people who ask questions to clarify the business need.

쒋은 데이터 κ³Όν•™μžλ“€μ€ λΉ„μ¦ˆλ‹ˆμŠ€ μš”κ΅¬λ₯Ό νŒŒμ•…ν•˜λ €κ³  μ§ˆλ¬Έν•˜λŠ” ν˜ΈκΈ°μ‹¬ λ§Žμ€ μ‚¬λžŒλ“€μ΄λ‹€.

The next questions are: "what data do we need to solve the problem, and where will that data come from?".

ν•„μš”ν•œ μ§ˆλ¬Έμ€ "문제 해결을 μœ„ν•΄ ν•„μš”ν•œ λ°μ΄ν„°λŠ” 무엇인가?"와 "μ–΄λ””μ„œ 데이터λ₯Ό μˆ˜μ§‘ν•΄μ•Όν•˜λŠ”κ°€?"이닀.

Data scientists can analyze structured and unstructured data from many sources, and depending on the nature of the problem, they can choose to analyze the data in different ways.

데이터 κ³Όν•™μžλ“€μ€ λ§Žμ€ μžλ£Œλ“€λ‘œλΆ€ν„° μˆ˜μ§‘ν•œ μ •ν˜• 데이터와 λΉ„μ •ν˜• 데이터λ₯Ό 뢄석할 수 μžˆλ‹€. λ˜ν•œ 그듀은 문제점의 νŠΉμ„±μ— 따라 λ‹€λ₯Έ 방법을 μ΄μš©ν•΄μ„œ 데이터λ₯Ό 뢄석할 수 있고, μ—¬λŸ¬ λͺ¨λΈμ„ μ΄μš©ν•˜μ—¬ 데이터λ₯Ό νƒμƒ‰ν•˜κ³  νŒ¨ν„΄κ³Ό 특이치λ₯Ό μ°Ύμ•„λ‚Έλ‹€.


Advice for New Data Scientists

My advice to an aspiring data scientist is to be curious, extremely argumentative and judgmental. Curiosity is absolute must. If you're not curious, you would not know what to do with the data. Judgmental because if you do not have preconceived notions about things you wouldn't know where to begin with. Argumentative because if you can argument and if you can plead a case, at least you can start somewhere and then you learn from data and then you modify your assumptions and hypotheses and your data would help you learn. And you may start at the wrong point. You may say that I thought I believed this, but now with data I know this. So, this allows you a learning process. So, curiosity being able to take a position, strong position, and then moving forward with it.

"μž₯μ°¨ 데이터 κ³Όν•™μžκ°€ λ˜λ €λŠ” λΆ„λ“€μ—κ²Œ λ“œλ¦¬κ³  싢은 쑰언은 ν˜ΈκΈ°μ‹¬μ„ κ°–κ³ , 맀우 λ…ΌμŸμ μ΄λ©°, λΉ„νŒμ μ΄μ–΄μ•Ό ν•œλ‹€λŠ” κ²ƒμž…λ‹ˆλ‹€. ν˜ΈκΈ°μ‹¬μ€ λ°˜λ“œμ‹œ ν•„μš”ν•©λ‹ˆλ‹€. ν˜ΈκΈ°μ‹¬μ„ λŠλΌμ§€ μ•ŠλŠ”λ‹€λ©΄, λ°μ΄ν„°λ‘œ 무엇을 ν•΄μ•Όν•  지 λͺ¨λ₯Ό κ²ƒμž…λ‹ˆλ‹€. μ™œ λΉ„νŒμ μ΄μ–΄μ•Ό ν•˜λ‚˜λ©΄, κ·Έ 일이 μ–΄λ–¨ κ²ƒμ΄λΌλŠ” 생각을 ν•˜μ§€ μ•ŠμœΌλ©΄ μ–΄λ””μ„œλΆ€ν„° μ‹œμž‘ν•΄μ•Ό 할지 λͺ¨λ₯΄κΈ° λ•Œλ¬Έμ΄μ£ . 또 λ…ΌμŸμ μ΄μ–΄μ•Ό ν•˜λŠ”λ°, μ—¬λŸ¬λΆ„μ΄ λ…ΌμŸμ„ ν•  수 있고, μ–΄λ–€ κ²½μš°μ— λŒ€ν•΄ λ‹΅λ³€ν•  수 μžˆλ‹€λ©΄ 적어도 μ—¬λŸ¬λΆ„μ€ μ–΄λ””μ„œλΌλ„ μ‹œμž‘ν•΄μ„œ λ°μ΄ν„°λ‘œλΆ€ν„° 배울 수 있기 λ•Œλ¬Έμž…λ‹ˆλ‹€. μ—¬λŸ¬λΆ„μ˜ κ°€μ •κ³Ό 가섀을 μˆ˜μ •ν•  수 있고, λ°μ΄ν„°λŠ” μ—¬λŸ¬λΆ„μ˜ ν•™μŠ΅μ— 도움이 될 κ²λ‹ˆλ‹€. 잘λͺ»λœ μ§€μ μ—μ„œ μ‹œμž‘ν•  μˆ˜λ„ μžˆμŠ΅λ‹ˆλ‹€. μ—¬λŸ¬λΆ„μ€ μ•„λ§ˆλ„ "λ‚˜λŠ” 이럴 것이라고 μƒκ°ν–ˆμ–΄ ν•˜μ§€λ§Œ 이젠 데이터λ₯Ό ν†΅ν•΄μ„œ, 이걸 μ•Œκ²Œ 됐어"라고 λ§ν•˜κ² μ£ . κ·Έλž˜μ„œ μ—¬λŸ¬λΆ„μ€ 이걸 톡해 ν•™μŠ΅ 과정을 μ•Œ 수 μžˆμŠ΅λ‹ˆλ‹€. λ”°λΌμ„œ, ν˜ΈκΈ°μ‹¬μ€ μ—¬λŸ¬λΆ„μ΄ ν™•κ³ ν•œ ν¬μ§€μ…˜μ„ μ·¨ν•˜κ³  κ±°κΈ°μ„œλΆ€ν„° μ•žμœΌλ‘œ λ‚˜μ•„κ°€λ„λ‘ ν•©λ‹ˆλ‹€."

So figure out first what you're interested, and what is your competitive advantage. Your competitive advantage is not necessarily going to be your analytical skills. Your competitive advantage is your understanding of some aspect of life where you exceed beyond others in understanding that. Maybe it's film, maybe it's retail, maybe it's health, maybe it's computers. Once you've figured out where your expertise lies, then you start acquiring analytical skills. What platforms to learn and those platforms, those tools would be specific to the industry that you're interested in. And then once you have got some proficiency in the tools, the next thing would be to apply your skills to real problems, and then tell the rest of the world what you can do with it.

"μš°μ„  μ—¬λŸ¬λΆ„μ΄ 관심 μžˆλŠ” 것이 무엇인지, μ—¬λŸ¬λΆ„μ˜ 경쟁 μš°μœ„λŠ” 무엇인지 μ°Ύμ•„λ³΄μ‹­μ‹œμ˜€. 경쟁 μš°μœ„κ°€ λ°˜λ“œμ‹œ 뢄석 κΈ°μˆ μ΄μ–΄μ•Ό ν•  ν•„μš”λŠ” μ—†μŠ΅λ‹ˆλ‹€. μ—¬λŸ¬λΆ„μ˜ 경쟁 μš°μœ„λŠ” μ–΄λ–€ 양상을 μ΄ν•΄ν•˜λŠ” 데 μžˆμ–΄μ„œ λ‹€λ₯Έ μ‚¬λžŒλ“€μ„ λŠ₯κ°€ν•˜λŠ” κ²ƒμž…λ‹ˆλ‹€. μ΄λŠ” μ˜ν™”μΌ μˆ˜λ„ 있고, μ†Œλ§€μΌ μˆ˜λ„ 있고, 보건일 μˆ˜λ„ 있고, 컴퓨터일 μˆ˜λ„ μžˆμŠ΅λ‹ˆλ‹€. 일단 μ—¬λŸ¬λΆ„μ˜ μ „λ¬Έ 지식이 무엇인지 μ°Ύκ³  λ‚˜μ„œ 뢄석 κΈ°μˆ μ„ μŠ΅λ“ν•˜κΈ° μ‹œμž‘ν•˜λ©΄ λ©λ‹ˆλ‹€. λ°°μ›Œμ•Ό ν•˜λŠ” ν”Œλž«νΌλ“€κ³Ό 도ꡬ듀은 μ—¬λŸ¬λΆ„μ΄ 관심 μžˆλŠ” 산업에 따라 λ‹€λ₯Ό κ²λ‹ˆλ‹€. κ·Έλ ‡κ²Œ μ—¬λŸ¬λΆ„μ΄ κ·Έ 도ꡬ듀에 λŠ₯μˆ™ν•΄μ§€λ©΄, μ—¬λŸ¬λΆ„μ˜ κΈ°μˆ μ„ μ‹€μ œ λ¬Έμ œμ— μ μš©ν•΄ λ³΄μ‹­μ‹œμ˜€. 그리고 μ „ 세계에 μ—¬λŸ¬λΆ„μ΄ 데이터λ₯Ό μ΄μš©ν•΄ 무엇을 ν•  수 μžˆλŠ”μ§€ λ§ν•˜μ‹­μ‹œμ˜€."


Lesson Summary

In this lesson, I have learned:

  • Data science is the study of large quantities of data, which can reveal insights that help organizations make strategic choices. 데이터 과학은 쑰직이 μ „λž΅μ  선택을 ν•˜λŠ”λ° λ„μ›€μ΄λ˜λŠ” 톡찰λ ₯을 보여쀄 수 있게 λŒ€λŸ‰μ˜ 데이터λ₯Ό μ—°κ΅¬ν•˜λŠ” 학문이닀.

  • There are many paths to a career in data science; most, but not all, involve a little math, a little science, and a lot of curiosity about data. 데이터 κ³Όν•™ λΆ„μ•Όμ—λŠ” λ§Žμ€ 길이 μžˆλ‹€; μ „λΆ€κΉŒμ§€λŠ” μ•„λ‹ˆμ§€λ§Œ, λŒ€λΆ€λΆ„μ—, μ•½κ°„μ˜ μˆ˜ν•™, μ•½κ°„μ˜ κ³Όν•™, 그리고 데이터에 λŒ€ν•œ λ§Žμ€ ν˜ΈκΈ°μ‹¬μ„ μˆ˜λ°˜ν•œλ‹€.

  • New data scientists need to be curious, judgemental and argumentative. μƒˆλ‘œμš΄ 데이터 κ³Όν•™μžλ“€μ€ ν˜ΈκΈ°μ‹¬ 많고, νŒλ‹¨λ ₯ 있고, λ…ΌμŸμ μ΄μ–΄μ•Ό ν•œλ‹€.

  • Why data science is considered the sexiest job in the 21st century, paying high salaries for skilled workers. 데이터 과학은 21세기에 κ°€μž₯ μ„Ήμ‹œν•œ μ§μ—…μœΌλ‘œ 여겨지며, μˆ™λ ¨λœ λ…Έλ™μžλ“€μ—κ²Œ 높은 κΈ‰μ—¬λ₯Ό μ§€κΈ‰ν•œλ‹€.


Quiz