๐Ÿค– ์ธ๊ณต์ง€๋Šฅ ๊ฐœ๋ก  ์ •๋ฆฌ

์ดํฌ์ œยท2021๋…„ 6์›” 28์ผ
3

AI

๋ชฉ๋ก ๋ณด๊ธฐ
2/5
post-thumbnail

์ธ๊ณต์ง€๋Šฅ ๊ฐœ๋ก 

  • 2016๋…„์— ์•ŒํŒŒ๊ณ  ๋‚˜์˜ค๊ณ  ์ธ๊ณต์ง€๋Šฅ ์ธ๊ธฐ๊ฐ€ ํญ๋ฐœ์ ์œผ๋กœ ์ฆ๊ฐ€ํ–ˆ๋‹ค.

  • ์ธ๊ณต์ง€๋Šฅ => ์—ฌ๋Ÿฌ ๋ถ„์•ผ์— ํญ ๋„“๊ฒŒ ์ ์šฉ๋  ์ˆ˜ ์žˆ๋‹ค.

  • rule-based AI => ๊ทœ์น™์— ์ ํ•ฉํ•˜๊ฒŒ ์ฒ˜๋ฆฌํ•˜๋Š”์ง€๊ฐ€ ๊ฐ€์žฅ ์ค‘์š”ํ•˜๋‹ค.

  • ๋จธ์‹ ๋Ÿฌ๋‹ => ๋ชจ๋ธ์€ ๋งŒ๋“ค์–ด ๋†“๊ณ  ๋ฐ์ดํ„ฐ์˜ ๊ทœ์น™์„ ์ฐพ๋Š” ๊ฒƒ์ด๋‹ค.

    • ๋ฐ์ดํ„ฐ ๊ณผํ•™์ด ์ฃผ๋ชฉ๋ฐ›๊ณ  ์žˆ๋‹ค.

๋”ฅ๋Ÿฌ๋‹

  1. ๋ชจ๋ธ์„ ๋งŒ๋“œ๋Š” ๋‹จ๊ณ„ (๊ป๋ฐ๊ธฐ๋ฅผ ๋งŒ๋“ค์–ด ์ค€๋‹ค.)
  2. ํ•™์Šต(with Large-Scale Dataset)
  3. Interence/Testing (Real-World Execution) => ์ถ”๋ก 

Issue - Overfitting

  • ๋ฐ์ดํ„ฐ ์ž์ฒด๊ฐ€ ๋ถ€์กฑํ•œ ๊ฒฝ์šฐ
    • ๋”ฐ๋ผ์„œ ๋ฐ์ดํ„ฐ ๊ณผํ•™์ด ๋งค์šฐ ์ค‘์š”ํ•˜๋‹ค.

CNN

  • ์ด๋ฏธ์ง€๋ฅผ ์ฒ˜๋ฆฌํ•  ์ˆ˜ ์žˆ๋‹ค.
  • ์‹œ๊ฐ์  ์ •๋ณด ์ฒ˜๋ฆฌ

RNN

  • ์‚ฌ์Šฌ์ฒ˜๋Ÿผ ์—ฐ๊ฒฐ์ด ๋˜์–ด ์‹œ๊ฐ„์— ๋Œ€ํ•œ ์ •๋ณด๋ฅผ ์•Œ ์ˆ˜ ์žˆ๋‹ค.
  • ์‹œ๊ฐ„์ด ๋“ค์–ด๊ฐ„ ์ •๋ณด ์ฒ˜๋ฆฌ

Visual Learning

  • Object Recognition
  • Style Trasfer -> ์ด๋ฏธ์ง€ ์ฒ˜๋ฆฌ ๊ธฐ๋ฐ˜
  • Deblurring and Denoising -> ๋…ธ์ด์ฆˆ ์ œ๊ฑฐ ํ›„ ํ•ด์ƒ๋„ ๋ณต์›
  • Super-Resolution -> ๋น„๋””์˜ค ํ•ด์ƒ๋„๋ฅผ ์˜ฌ๋ฆฐ๋‹ค.
  • Identification

GAN - Generative Adversarial Network

  • ๋ฌด์–ธ๊ฐ€๋ฅผ ์ƒ์„ฑํ•˜๋Š” ๊ธฐ์ˆ ์ด๋‹ค. ์ƒˆ๋กœ์šด ์ฐฝ์ž‘๋ฌผ์„ ๋„์ถœ
  • ์˜ˆ์ˆ , ๋ฏธ์ˆ , ์•ˆ๋ฌด ๋“ฑ์˜ ๋ถ„์•ผ
  • Generators(์ƒ์„ฑ์ž), Discriminator(๊ฒ€์‚ฌ์ž) ์˜ ๊ฒฝ์Ÿ์„ ํ†ตํ•ด ๋Šฅ๋ ฅ์„ ์˜ฌ๋ฆฐ๋‹ค.
  • Discriminator์˜ ์„ฑ๋Šฅ์ด ์ข‹์œผ๋ฉด Generators์˜ ์„ฑ๋Šฅ๋„ ์˜ฌ๋ผ๊ฐ„๋‹ค.

Python/TensorFlow Examples

  • tensorflow ๋˜๋Š” keras

    • tensorflow ๋Š” ์œ ์—ฐ์„ฑ์ด ๋†’๋‹ค. => ๋Œ€์‹  ํ•˜๋‚˜ํ•˜๋‚˜ ๋งŒ๋“ค์–ด์•ผ ํ•œ๋‹ค.

    • keras๋Š” ๋ธ”๋ก์„ ์Œ“๋“ฏ์ด ์ฝ”๋”ฉํ•œ๋‹ค๊ณ  ํ‘œํ˜„์ด ๋œ๋‹ค.

image-20210628100920015

=> node1, node2์˜ ๋ฐ˜ํ™˜๊ฐ’์€ ํ…์„œํ˜•์ด๋‹ค.

=> ๋”ฐ๋ผ์„œ session์„ ์‚ฌ์šฉํ•ด์„œ ์ถœ๋ ฅํ•ด์ฃผ์–ด์•ผ ํ•œ๋‹ค.

image-20210628102022963

=> with์„ ๊ฐ์‹ธ์„œ ๋งŒ๋“ค์–ด ์ค„ ์ˆ˜๋„ ์žˆ๋‹ค.

  • ํšŒ์‚ฌ์—์„œ๋Š” keras๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ์ด ๋” ์ข‹๋‹ค.

Placeholder

image-20210628102147925

=> placeholder: ๋ฐฐ์—ด(ํ–‰๋ ฌ)์ด๋ผ๊ณ  ์ƒ๊ฐํ•˜๋ฉด ๋œ๋‹ค.

=> feed_dict: ๊ฐ’์„ ๋„ฃ์–ด์ค€๋‹ค.


Regression and Classifiation

  • regression => ํšŒ๊ท€๋ถ„์„=> ์„ ํ˜•์œผ๋กœ ๊ฒฝํ–ฅ์„ฑ์„ ํŒ๋‹จํ•œ๋‹ค.

Linear Regression

  • model: H(x) = Wx + b (W, b์˜ ๊ฐ’์„ ์ฐพ์•„์•ผ ํ•œ๋‹ค.)

    image-20210628102933720

=> ์˜ค์ฐจ๋“ค์˜ ํ‰๊ท ์„ ๊ตฌํ•ด์„œ Cost ํ•จ์ˆ˜๋ฅผ ๋„์ถœํ•œ๋‹ค. =>๊ฐ€์žฅ ์ ๋„๋ก ํ•ด์•ผ ํ•œ๋‹ค.

image-20210628103116154

=> ์ œ๊ณฑ์ด๊ธฐ ๋•Œ๋ฌธ์— 2์ฐจ์› ํ•จ์ˆ˜๊ฐ€ ๋œ๋‹ค.

=> Gradient Descent Method (๊ฒฝ์‚ฌ ํ•˜๊ฐ•๋ฒ•)

image-20210628103321974

=> ์ปต์ผ ๋•Œ๋งŒ ๊ฒฝ์‚ฌ ํ•˜๊ฐ•๋ฒ•์ด ์ ํ•ฉํ•˜๋‹ค.

image-20210628103412128

image-20210628103513255


Classification

Binary Classification

  • ex) ์ŠคํŒธ์ธ์ง€ ์•„๋‹Œ์ง€ => Spam Detection
  • ex) Facebook Feed => ๋‚˜์—๊ฒŒ ๋ณด์—ฌ์ค„์ง€ ๋ง์ง€
  • ex) Credit Card Fraudulent Transaction Detection
  • Tumor Image Detection In Radioloy => ๋ฐฉ์‚ฌ์„  ํŒ๋‹จ

Basic Idea

  • step1) Linear regression with H(x)= Wx+b
  • step2) Logistic/sigmoid function (sig(t)) based on the result of Step 1.
    • 0๊ณผ 1 ์‚ฌ์ด๋กœ ๊ฐ’์„ ์ œํ•œํ•ด์ค€๋‹ค.

=> ๋ชจ๋ธ์ด ์ง์„ ์ด๋ฉด ์ปต์ฒ˜๋Ÿผ ์ƒ๊ฒผ๊ฒ ์ง€๋งŒ, binary classification model ์€ ์ปต์ฒ˜๋Ÿผ ์ด์˜๊ฒŒ ์ƒ๊ธฐ์ง€ ์•Š์•˜๋‹ค.

image-20210628110747550

=> H(x) ,y ๋‘˜ ๋‹ค 0์ด๋ฉด ์•„๋ฌด๋Ÿฐ ์‚ฌ๊ฑด์ด ์ผ์–ด๋‚˜์ง€ ์•Š์•˜๋‹ค.

=> y๋Š” 1์ด๋ฉด ์‚ฌ๊ฑด์ด ๋ฐœ์ƒ, H(x)๋Š” 0 ์ด๋ฉด ์ž˜๋ชปํŒ๋‹จํ•œ ๊ฒƒ์ด๋‹ค.(ai ํ‹€๋ฆผ) => cost๊ฐ€ ๋ฐœ์ƒ

=> y=0์ด๋ฉด ์‚ฌ๊ฑด ๋ฐœ์ƒ x, ๊ทธ๋Ÿฌ๋‚˜ H(x) ๊ฐ€ 1์ด๋ฉด ๋˜ ์ž˜๋ชปํŒ๋‹จ. => cost ๋ฐœ์ƒ

=> y=1, H(x)=1 ์ด๋ฉด ์ž˜ ํŒ๋‹จํ•œ ๊ฒƒ์ด๋‹ค. => cost๊ฐ€ ๋ฐœ์ƒํ•˜์ง€ ์•Š๋Š”๋‹ค.

image-20210628111123439


SoftMax Classification (Multinomial Classification)

  • ๋ฐ”์ด๋„ˆ๋ฆฌ๋กœ ๋Œ€๋‹ตํ•  ์ˆ˜ ์žˆ๋„๋ก ํ•ด์•ผ ํ•œ๋‹ค.
  • ๋ฐ”์ด๋„ˆ๋ฆฌ ๊ธฐ๋ฐ˜์ด๊ธฐ ๋–„๋ฌธ์— ์—ฌ๋Ÿฌ๋ฒˆ ์งˆ๋ฌธ์„ ํ•˜๋Š” ๊ฒƒ์ด๋‹ค.

image-20210628111433891

=> A์ธ์ง€, B์ธ์ง€, C์ธ์ง€๋ฅผ ํŒ๋‹จํ•œ๋‹ค.

=> ์‹œ๊ทธ๋ชจ์ด๋“œ ๊ฐ’์ด 0.5๋ณด๋‹ค ํฐ ๊ฒŒ 2๊ฐœ๋ผ๋ฉด? => ์‹œ๊ทธ๋ชจ์ด๋“œ ๊ฐ’์ด ํฐ ๊ฒƒ์„ ๊ณ ๋ฅธ๋‹ค. (๊ฐ€์žฅ ์œ ์‚ฌํ•œ ๊ฒƒ์„ ํŒ๋‹จ.)

image-20210628111639914

=> ~์ด๋ƒ ์•„๋‹ˆ๋ƒ : linear combination

image-20210628112024132

=> X๋Š” ์ธํ’‹ ๊ฐ’

=> A,B,C ์ค‘์— ๊ฐ€์žฅ ํฐ ๊ฐ’์„ ์ฐพ๋Š”๋‹ค.

Cost Function : Cross-Entropy

  • ๋กœ๊ทธ๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•œ๋‹ค.

    image-20210628112441903

=> ๊ฐ’์ด ์—‡๊ฐˆ๋ฆฌ๊ฒŒ ๋˜๋ฉด cost๊ฐ€ ๋ฌดํ•œ๋Œ€๋กœ ๋œ๋‹ค.


ANN (Artificial Neural Networks)

  • ์ธ๊ณต์‹ ๊ฒฝ๋ง

  • ๋”ฅ๋Ÿฌ๋‹

  • Nonlinear Functions

    image-20210628112904232

=> ์–ด๋–ค ์‹ ํ˜ธ๋Š” ์ฆํญ, ๊ฐ์†Œ ๋˜๋Š” ๋ฌด์‹œ

=> ์—ญ์น˜๋ฅผ ๋„˜์œผ๋ฉด ์‹ ํ˜ธ๊ฐ€ ์ „๋‹ฌ๋˜์—ˆ๋‹ค๊ณ  ํŒ๋‹จ

=> activation function: ์‹œ๊ทธ๋ชจ์ด๋“œ๋ฅผ ์”Œ์šฐ๊ณ  ํŒ๋‹จ

image-20210628113217520

=> hidden layer๋ฅผ ๋งŒ๋“ค์–ด์ค€๋‹ค. ๋งŽ์•„์งˆ์ˆ˜๋ก ๊ณ„์‚ฐ์ด ์ •๊ตํ•ด์ง„๋‹ค.

=> hidden layer๊ฐ€ ๋งŽ์•„์ง€๋ฉด ๊ธฐ์ค€์ ์ด ๋งŽ์•„์ง„๋‹ค. ๋ณต์žกํ•œ๊ฑฐ๋ฅผ ์ž˜ ๋ถ„๋ฅ˜ํ•  ์ˆ˜ ์žˆ๋‹ค.

=> hidden layer 1์˜ ํ–‰๋ ฌ๊ณผ 2์˜ ํ–‰๋ ฌ์€ ์™„์ „ํžˆ ๋‹ค๋ฅด๋‹ค.

Application to Logic Gate Design

  • ์„ ์œผ๋กœ ๋‚˜๋ˆ„๋Š” ๊ฒƒ์ด ๊ฐ€๋Šฅํ•˜๋‹ค.

image-20210628130251849

  • XOR ์—ฐ์‚ฐ

    • ๋™์ผํ•  ๋•Œ 0์ด๊ณ  ๋‹ค๋ฅด๋ฉด 1์ธ ์—ฐ์‚ฐ์ž
    • ์ง์„ ์œผ๋กœ ๋‚˜๋ˆŒ ์ˆ˜ ์—†๋‹ค.
  • Multilayer Perceptron(MLP)

    image-20210628130651134

=> 100๊ฐœ์˜ layer = 1๊ฐœ์˜ outlayer + 99๊ฐœ์˜ hidden layer

Solving XOR with MLP

image-20210628131007068

=> A, B๋Š” ์„ธ๋กœ๋กœ 2๊ฐœ ๊ฐ’์ด ์žˆ์–ด์•ผ ํ•œ๋‹ค.

image-20210628131518174


image-20210628132507757

์ด ํ•™์Šตํ•˜๋Š” ๋ณ€์ˆ˜๋Š”?

1x3 3x1

32๊ฐœ +9๊ฐœ => 41๊ฐœ


  • ์œ ๋‹›์ด ๋Š˜์–ด๋‚˜๋ฉด ๊ทธ ๋งŒํผ ์ •๊ตํ•˜๊ฒŒ ๋‚˜๋ˆŒ ์ˆ˜ ์žˆ๋‹ค.
  • wide์™€ deep ์ค‘ deep์ด ๋” ์„ฑ๋Šฅ์ด ์ข‹๋‹ค.
    • ๊ธฐ์ค€์ด ๋Š˜์–ด๋‚˜๋Š” ๊ฒƒ์ด powerful ํ•˜๋‹ค.
  • layer์˜ ์ ํ•ฉํ•œ ๊ฐฏ์ˆ˜๋Š” ํ•ด๋ด์•ผ ์•ˆ๋‹ค.

  • ์••์ถ• ํšจ๊ณผ๊ฐ€ ์ผ์–ด๋‚˜์„œ ์›๋ณธ์˜ ์„ฑ์งˆ์ด ๋–จ์–ด์ง€๋Š” ๊ฒฝ์šฐ๊ฐ€ ์žˆ๋‹ค.

  • input size ๋งŒํผ layer๋ฅผ ์Œ“์•„์ค˜์•ผ ์›๋ณธ์˜ ์„ฑ์งˆ์ด ๋ณด์žฅ๋œ๋‹ค.

Gradient Vanishing Problem

  • layer๋ฅผ ๋”ฅํ•˜๊ฒŒ ์Œ“์œผ๋ฉด ์‹œ๊ทธ๋ชจ์ด๋“œ๊ฐ€ ๊ณ„์† ๋ถ™๊ฒŒ ๋œ๋‹ค.

    => ๋”ฐ๋ผ์„œ ๊ฐ’์ด ์‚ฌ๋ผ์ง€๊ฒŒ ๋œ๋‹ค. (The numberous multiplication of this result converges to near zero.)

ReLU (Rectified Linear Unit)

  • layer๋ฅผ deepํ•˜๊ฒŒ ์Œ“์„ ์ˆ˜ ์žˆ๊ฒŒ ๋˜์—ˆ๋‹ค.

์™œ ๋”ฅ๋Ÿฌ๋‹์ด ์ธ๊ธฐ๊ฐ€ ๋งŽ์•„์กŒ์„๊นŒ?

  • ReLU for solving Gradient Vanishing Problem
  • Convolution Layers for Multi-Dimensional Inputs
  • Big-Data
  • GPU
    • ๊ทธ๋ž˜ํ”ฝ์Šค๋ฅผ ๋น ๋ฅด๊ฒŒ ์—ฐ์‚ฐํ•˜๋Š” ๊ฒƒ
    • ์—ฐ์‚ฐ๋งŒ ๋น ๋ฅด๊ฒŒ๋งŒ ํ•˜๋‹ค. ํ–‰๋ ฌ ์—ฐ์‚ฐ๋ฅผ ๋น ๋ฅด๊ฒŒ ํ•  ์ˆ˜ ์žˆ๋‹ค. => ๊ทธ๋ž˜์„œ ๋”ฅ๋Ÿฌ๋‹์— ๋งŽ์ด ์‚ฌ์šฉ๋˜๋Š” ๊ฒƒ์ด๋‹ค.
    • ์บ์‹œ๋„ ์—†๋‹ค.

์•”๋‹ฌ์˜ ๋ฒ•์น™

์›ํ•ซ์ธ์ฝ”๋”ฉ


MNIST Data Set

  • ์‚ฌ๋žŒ์˜ ์† ๊ธ€์”จ์ž„

  • Hand written images and their label

CNN: Convolution

  • Strides (step-size of filter moving) => ๊ฑด๋„ˆ๋›ฐ๋Š” ๊ฒƒ
  • Padding
  • Pooling Layer (Sampling)=> ์ค„์ด๋Š” ๊ฒƒ
    • max pooling: ํฐ ๊ฐ’์„ ๋ฝ‘์•„์ค€๋‹ค.
  • input์„ ํ•„ํ„ฐ๋ฅผ ์”Œ์–ด์„œ softmax์— ๋„ฃ์–ด์ฃผ๋Š” ๊ฒƒ์ด๋‹ค.
  • ์ธํ’‹์„ ๊ทธ๋Œ€๋กœ ๋ฐ›์•„๋“ค์ด๋Š” ๊ฒƒ์ด ์•„๋‹ˆ๊ณ  ํ•„ํ„ฐ๋ฅผ ๊ฑฐ์ณ์„œ ํ•™์Šต์„ ์‹œํ‚ค๋Š” ๊ฒƒ์ด๋‹ค.
    • ํ•„ํ„ฐ๋„ ํ•™์Šต์ด ๋˜๋Š” ๊ฒƒ์ด๋‹ค.
    • ๋”ฐ๋ผ์„œ ๋ถ„๋ฅ˜ํ•˜๊ธฐ๊ฐ€ ๋”์šฑ ์‰ฌ์›Œ์ง„๋‹ค.

RNN (Recurrent Neural Networks)

  • cell : ํ•˜๋‚˜์˜ ์ธ๊ณต์‹ ๊ฒฝ๋ง, ์‹œ๊ฐ„์— ๋Œ€ํ•œ ๊ฐœ๋…์„ ๋„ฃ์–ด์ค€๋‹ค.

  • ๊ตฌ์กฐ ์„ค๊ณ„๊ฐ€ ์ œ์ผ ์ค‘์š”ํ•˜๋‹ค.

    image-20210628162203688


GAN (Generative Adversarial Network)

  • ๊ฐ€์žฅ ํ˜์‹ ์ ์ธ ๊ธฐ์ˆ 

  • ๋ฌด์–ธ๊ฐ€ ์ƒ์„ฑํ•˜๋Š” ๊ธฐ์ˆ ์ด๋‹ค.

  • ๊ฒฝ์Ÿ์„ ํ†ตํ•ด์„œ ํ•™์Šตํ•œ๋‹ค.

  • ๋‹จ์ ์€ ์ดˆ๊ธฐ์— ๋„ˆ๋ฌด ๋Š๋ฆฌ๋‹ค.

image-20210628170416710

=> Discriminator ๋Š” ์ด์ง„ ๋ถ„๋ฅ˜๊ธฐ๋‹ค. (์ง€๋„ํ•™์Šต)

=> Generator model (๋น„์ง€๋„ํ•™์Šต)


Interpolation vs Linear Regression

  • ๋”ฅ๋Ÿฌ๋‹์€ x, y์˜ ๊ด€๊ณ„๋ฅผ ๊ทœ๋ช…ํ•ด์ค€๋‹ค.
  • ๊ฐ•ํ™”ํ•™์Šต

Dimensionally Reduction: PCA and LDA

  • PCA => ์ฐจ์›์€ ์ค„์—ˆ์ง€๋งŒ ๋ชจ์–‘์€ ์ตœ๋Œ€ํ•œ ๋‚จ๊ธฐ๋Š” ๊ฒƒ

  • LDA => ์ฐจ์›์„ ์ค„์ด๊ณ  ๋ถ„๋ฅ˜ํ•˜๊ธฐ ์ ํ•ฉํ•˜๊ฒŒ

Overfitting

  • ํ•™์Šตํ•  ๋•Œ ๋ฐ์ดํ„ฐ๋ฅผ ์ถฉ๋ถ„ํžˆ ์ค˜์•ผํ•œ๋‹ค.
  • ์˜ค์ฐจ๊ฐ€ ์ƒ๊ธธ ์ˆ˜ ์žˆ๋‹ค.

How to overcome?

  1. more training data
  2. Reduce the number of features => autoencoding, dropout
    1. dropout๋ฅผ ํ˜„์—…์—์„œ ๋งŽ์ด ์‚ฌ์šฉํ•œ๋‹ค.
  3. Regularization
profile
์˜ค๋Š˜๋งŒ ์—ด์‹ฌํžˆ ์‚ด๊ณ  ๋ชจ๋“  ๊ฑธ ๋‚จ๊ธฐ๋˜ ํ›„ํšŒ๋Š” ๋‚จ๊ธฐ์ง€ ๋ง์ž

0๊ฐœ์˜ ๋Œ“๊ธ€