Python - jjin-choi/study_note GitHub Wiki

https://webcache.googleusercontent.com/search?q=cache:KvDHc7ueugsJ:https://www.holaxprogramming.com/2017/06/28/python-project-structures/+&cd=9&hl=ko&ct=clnk&gl=kr

μ •λ¦¬ν•˜κΈ°

200824

Data Analysis

Β§ EDA (Exploratory Data Analysis)

  • import data β†’ check data shape β†’ check data type β†’ check NULL β†’ check μ’…μ†λ³€μˆ˜μ˜ 뢄포 β†’ λ…λ¦½λ³€μˆ˜ - λͺ…λͺ©ν˜• λ³€μˆ˜ 뢄포 β†’ λ…λ¦½λ³€μˆ˜ - μˆ˜μΉ˜ν˜• λ³€μˆ˜ 뢄포 β†’ μˆ˜μΉ˜ν˜• λͺ…λͺ©ν˜• λ³€μˆ˜ κ°„ 관계 νŒŒμ•…

    • 쒅속 λ³€μˆ˜ : λ‹€λ₯Έ λ³€μˆ˜λ“€μ˜ 관계λ₯Ό 주둜 μΆ”λ‘ ν•˜κ³  μ΅œμ’…μ μœΌλ‘œ μ˜ˆμΈ‘ν•˜κ³ μž ν•˜λŠ” λ³€μˆ˜
    • λͺ…λͺ©ν˜• λ³€μˆ˜ : μΉ΄ν…Œκ³ λ¦¬ μˆ˜κ°€ λ„ˆλ¬΄ λ§Žκ±°λ‚˜ 쒅속 λ³€μˆ˜μ™€ 관련성이 적어 보일 경우 μ œμ™Έν•˜κ³  뢄석
    • λ‹¨λ³€μˆ˜ 탐색 : seaborn - distplot
  • κ΄€λ ¨ module : numpy / pandas / matplotlib / seaborn

Β§ Feature Engineering

// // Β§ Stratified sampling

Performance metrics

Β§