2021 10 18 - KR-HappyFace/meetup-logs GitHub Wiki

  • DPR ๋…ผ๋ฌธ์— ์œ ์˜๋ฏธํ•œ ๋‚ด์šฉ์ด ๋งŽ์€ ๊ฒƒ ๊ฐ™๋‹ค: ์„ฑ์šฑ
  • Entity marker ์ถ”๊ฐ€ํ•˜๊ธฐ: ์ค€ํ™
  • sparse vs dense ์‹คํ—˜: ์žฌ์˜
    • Baseline vs DPR mean one encoder. ์ด๊ฑด ๊นƒํ—™์— ์—…๋กœ๋“œํ•ด๋†“๊ฒ ๋‹ค.
    • ํ‰๊ท  ๋‚ด๋Š” ๋ฐฉ๋ฒ•์ด ์ข‹์€ ๊ฒƒ ๊ฐ™์€๋ฐ, pad token ๋นผ๊ณ  ํ•˜๋Š” ๊ฑด ์–ด๋–ป๊ฒŒ ํ•˜๋Š” ๊ฒŒ ์ข‹์ง€ ์•Š์„๊นŒ. ๋ฌธ์žฅ์ด ๊ธธ์–ด์„œ ์™ ๋งŒํ•˜๋ฉด ๊ฝ‰๊ฝ‰ ์ฑ„์›Œ์„œ ๋“ค์–ด๊ฐˆ ๊ฒƒ ๊ฐ™์€๋ฐ. Padding์ด ๋งŽ์ด ๋“ค์–ด๊ฐ„ ๊ฒƒ๋“ค์€ padding ์ œ์™ธํ•˜๊ณ  ํ‰๊ท  ๋‚ด๋ฉด ์ข‹์„ ๊ฒƒ ๊ฐ™๋‹ค๊ณ  ์ƒ๊ฐํ•ด๋ดค์Šต๋‹ˆ๋‹ค.
  • MRC ์ชฝ์€ Custom Model ๋ฐฉ๋ฒ•๋ก ์ด ์žˆ๋Š” ๊ฒŒ ์•„๋‹ˆ๋ผ Big Bird ๊ฐ™์€ Pretrained ๋œ ๋ชจ๋ธ์ด ์žˆ๋”๋ผ๊ณ ์š”. Long BERT ๊ฐ™์€ ๊ฑด ํ•œ๊ตญ์–ด๋กœ ์žˆ์ง€ ์•Š์„๊นŒ ์ƒ๊ฐํ•˜๊ธด ํ–ˆ๋Š”๋ฐ.
    • Pretrain์€ ๋ชป์‹œํ‚ค๋‚˜? ๊ฐ€๋Šฅ์€ ํ•  ๊ฒƒ ๊ฐ™์€๋ฐ. ๋ง˜ ๊ฐ™์•„์„œ๋Š” KLUE MRC ์“ฐ๊ณ  ์‹ถ๋„ค์š”.
  • [MASK]๋ฅผ ๋žœ๋ค์œผ๋กœ ์”Œ์›Œ๋ณด๋ ค๋Š” ์‹คํ—˜์„ ํ•˜๊ณ  ์žˆ์Œ.
  • Retriever์€ ๊ทธ๋ž˜๋„ ์–ด๋–ป๊ฒŒ ํ•ด๋ณผ ์ˆ˜ ์žˆ๋Š”๋ฐ, Reader์€ ์–ด๋–ป๊ฒŒ ์„ฑ๋Šฅ ํ–ฅ์ƒ์„ ์‹œํ‚ฌ์ง€ ๋ชจ๋ฅด๊ฒ ๋‹ค.
  • Retriever์œผ๋กœ๋ถ€ํ„ฐ ์˜จ passage๋“ค ์ค‘์—์„œ ๋ช‡ ๊ฐœ๋ฅผ ์šฐ๋ฆฌ๊ฐ€ ๋‹ค์‹œ ์‚ฌ์šฉ์„ ํ•  ๊ฒƒ์ธ์ง€, negative passage๋กœ ์‚ฌ์šฉ์„ ํ•  ๊ฒƒ์ธ์ง€ hyperparameter๋กœ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋‹ค.
  • T5๋Š” ๊ณ ์œ ๋ช…์‚ฌ์—๋‹ค๊ฐ€ Masking์„ ํ•˜๋”๋ผ๊ณ ์š”. ๊ทธ๋ ‡๊ฒŒ ๊ณ ์œ ๋ช…์‚ฌ๋ฅผ Maskingํ•ด์„œ ๊ณ ์œ ๋ช…์‚ฌ๋ฅผ ๋งž์ถ”๊ฒŒ ํ•˜๋ฉด ๊ทธ๋Ÿฐ ๋ฌธ๋งฅ์ด ๊ฐ–๊ณ  ์žˆ๋Š” ์ •๋ณด๋Š” ๋” ๊ฐ–๊ณ  ์˜ค์ง€ ์•Š์„๊นŒ?
  • Special Mission 2 Generation Based MRC์—์„œ ๋ฌธ์ œ๊ฐ€ ์žˆ๋Š” ๊ฒƒ ๊ฐ™๋‹ค.