60. HuggingFaces - yojulab/learn_deeplearning GitHub Wiki

  • ๋จธ์‹ ๋Ÿฌ๋‹ ๊ธฐ์ˆ ์„ ์ œ๊ณตํ•˜๋Š” ์˜คํ”ˆ์†Œ์Šค ๋ฐ ํ”Œ๋žซํผ ์ œ๊ณต์—…์ฒด, ๋ชจ๋ธ์„ ํ›ˆ๋ จํ•˜๊ณ  ํ…Œ์ŠคํŠธํ•˜๋Š”๋ฐ ๊ฑธ๋ฆฌ๋Š” ์‹œ๊ฐ„์ด ์ƒ๋‹นํžˆ ๋‹จ์ฃฝ
  • ์‚ฌ์ „ ํ•™์Šต๋œ ์–ธ์–ด ๋ชจ ๋ธ, ๊ทธ๋ฆฌ๊ณ  image, ์˜ค๋””์˜ค ๋“ฑ ๋‹ค์–‘ํ•œ ๋ชจ๋ธ์„ ์ œ๊ณต
  • offical
  • youtube- HuggingFaces

Dataset

Tokenizers

  • (1) ๋ฌธ์ž ํ† ํฐํ™”(character tokenization)
  • (2)๋‹จ์–ด ํ† ํฐํ™”(word tokenization)
  • (3) ํ•˜์œ„ ๋‹จ์œ„ ํ† ํฐํ™”(subword tokenization)

Gradio

  • Build and share machine learning demos and web applications using the core Gradio Python library.

ex) ์‹ค์Šต ์ฝ”๋“œ(with classfication) : https://dreamfactory100.tistory.com/49

Models

Transformer

Text classification(๊ฐ์„ฑ๋ถ„์„)

Named entity recognition(NER)

์งˆ์˜์‘๋‹ต, ์š”์•ฝ, ๋ฒˆ์—ญ, ํ…์ŠคํŠธ ์ƒ์„ฑ ๋“ฑ

์„ฑ๋Šฅ์ธก์ •์ง€ํ‘œ(Performance Measures) : seqeval Lib

ํ…์ŠคํŠธ ์ƒ์„ฑ(Text Generation) : Greedy Search Decoding, Beam Search Decoding

  • ์–ธ๋”ํ”Œ๋กœ์šฐ๋Š” ์ˆ˜์น˜์  ๋ถˆ์•ˆ์ •์„ฑ, ๋กœ๊ทธ ํ™•๋ฅ  ์‚ฌ์šฉํ•ด ๋ฌธ์ œ ํ•ด๊ฒฐ
  • ๋กœ๊ทธ ํ™•๋ฅ  ์‚ฌ์šฉ vs ๋กœ๊ทธ ํ™•๋ฅ  ๋ฏธ์‚ฌ์šฉ

Pretrained model

  • ๋ชจ๋ธ ํ›ˆ๋ จ ์˜ต์…˜ : ํŠน์„ฑ ์ถ”์ถœ(Feature extraction), ๋ฏธ์„ธ ์กฐ์ •(Fine-tuningใ€‹)

Stable Diffusion