DrawingProcess
๋“œํ”„ DrawingProcess
DrawingProcess
์ „์ฒด ๋ฐฉ๋ฌธ์ž
์˜ค๋Š˜
์–ด์ œ
ยซ   2025/05   ยป
์ผ ์›” ํ™” ์ˆ˜ ๋ชฉ ๊ธˆ ํ† 
1 2 3
4 5 6 7 8 9 10
11 12 13 14 15 16 17
18 19 20 21 22 23 24
25 26 27 28 29 30 31
  • ๋ถ„๋ฅ˜ ์ „์ฒด๋ณด๊ธฐ (964)
    • Profile & Branding (22)
      • Career (15)
    • IT Trends (254)
      • Conference, Faire (Experien.. (31)
      • News (187)
      • Youtube (19)
      • TED (8)
      • Web Page (2)
      • IT: Etc... (6)
    • Contents (97)
      • Book (66)
      • Lecture (31)
    • Project Process (94)
      • Ideation (0)
      • Study Report (34)
      • Challenge & Award (22)
      • 1Day1Process (5)
      • Making (5)
      • KRC-FTC (Team TC(5031, 5048.. (10)
      • GCP (GlobalCitizenProject) (15)
    • Study: ComputerScience(CS) (72)
      • CS: Basic (9)
      • CS: Database(SQL) (5)
      • CS: Network (14)
      • CS: OperatingSystem (3)
      • CS: Linux (39)
      • CS: Etc... (2)
    • Study: Software(SW) (95)
      • SW: Language (29)
      • SW: Algorithms (1)
      • SW: DataStructure & DesignP.. (1)
      • SW: Opensource (15)
      • SW: Error Bug Fix (43)
      • SW: Etc... (6)
    • Study: Artificial Intellige.. (149)
      • AI: Research (1)
      • AI: 2D Vision(Det, Seg, Tra.. (35)
      • AI: 3D Vision (70)
      • AI: MultiModal (3)
      • AI: SLAM (0)
      • AI: Light Weight(LW) (3)
      • AI: Data Pipeline (7)
      • AI: Machine Learning(ML) (1)
    • Study: Robotics(Robot) (33)
      • Robot: ROS(Robot Operating .. (9)
      • Robot: Positioning (8)
      • Robot: Planning & Control (7)
    • Study: DeveloperTools(DevTo.. (83)
      • DevTool: Git (12)
      • DevTool: CMake (13)
      • DevTool: NoSQL(Elastic, Mon.. (25)
      • DevTool: Container (17)
      • DevTool: IDE (11)
      • DevTool: CloudComputing (4)
    • ์ธ์ƒ์„ ์‚ด๋ฉด์„œ (64)
      • ๋‚˜์˜ ์ทจ๋ฏธ๋“ค (7)
      • ๋‚˜์˜ ์ƒ๊ฐ๋“ค (42)
      • ์—ฌํ–‰์„ ๋– ๋‚˜์ž~ (10)
      • ๋ถ„๊ธฐ๋ณ„ ํšŒ๊ณ  (5)

๊ฐœ๋ฐœ์ž ๋ช…์–ธ

โ€œ ๋งค์ฃผ ๋ชฉ์š”์ผ๋งˆ๋‹ค ๋‹น์‹ ์ด ํ•ญ์ƒ ํ•˜๋˜๋Œ€๋กœ ์‹ ๋ฐœ๋ˆ์„ ๋ฌถ์œผ๋ฉด ์‹ ๋ฐœ์ด ํญ๋ฐœํ•œ๋‹ค๊ณ  ์ƒ๊ฐํ•ด๋ณด๋ผ.
์ปดํ“จํ„ฐ๋ฅผ ์‚ฌ์šฉํ•  ๋•Œ๋Š” ์ด๋Ÿฐ ์ผ์ด ํ•ญ์ƒ ์ผ์–ด๋‚˜๋Š”๋ฐ๋„ ์•„๋ฌด๋„ ๋ถˆํ‰ํ•  ์ƒ๊ฐ์„ ์•ˆ ํ•œ๋‹ค. โ€

- Jef Raskin

๋งฅ์˜ ์•„๋ฒ„์ง€ - ์• ํ”Œ์ปดํ“จํ„ฐ์˜ ๋งคํ‚จํ† ์‹œ ํ”„๋กœ์ ํŠธ๋ฅผ ์ฃผ๋„

์ธ๊ธฐ ๊ธ€

์ตœ๊ทผ ๊ธ€

์ตœ๊ทผ ๋Œ“๊ธ€

ํ‹ฐ์Šคํ† ๋ฆฌ

hELLO ยท Designed By ์ •์ƒ์šฐ.
DrawingProcess

๋“œํ”„ DrawingProcess

[Gen AI] ์ƒ์„ฑํ˜• ๋ชจ๋ธ๋“ค์˜ ์›๋ฆฌ ๋น„๊ต: VAE, GAN, Flow-based, Diffusion
Study: Artificial Intelligence(AI)/AI: 2D Vision(Det, Seg, Trac)

[Gen AI] ์ƒ์„ฑํ˜• ๋ชจ๋ธ๋“ค์˜ ์›๋ฆฌ ๋น„๊ต: VAE, GAN, Flow-based, Diffusion

2024. 9. 14. 08:16
๋ฐ˜์‘ํ˜•
๐Ÿ’ก ๋ณธ ๋ฌธ์„œ๋Š” '[Gen AI] ์ƒ์„ฑํ˜• ๋ชจ๋ธ๋“ค์˜ ์›๋ฆฌ ๋น„๊ต: VAE, GAN, Flow-based, Diffusion'์— ๋Œ€ํ•ด ์ •๋ฆฌํ•ด๋†“์€ ๊ธ€์ž…๋‹ˆ๋‹ค.
์ƒ์„ฑํ˜• ๋ชจ๋ธ๋“ค ์ค‘ ๋Œ€ํ‘œ์ ์ธ ๋ชจ๋ธ์ธ VAE, GAN, Flow-based, Diffusion์— ๋Œ€ํ•ด ๋น„๊ตํ•˜๊ณ , ๊ฐ ๋ฐฉ๋ฒ•๋ก ์ด Latent variable๋กœ๋ถ€ํ„ฐ ์ƒ์„ฑํ•˜๋Š” ์›๋ฆฌ๋ฅผ ์ •๋ฆฌํ•˜์˜€์œผ๋‹ˆ ์ฐธ๊ณ ํ•˜์‹œ๊ธฐ ๋ฐ”๋ž๋‹ˆ๋‹ค.

1. Prerequisite

1) Markov Chain

Markov ์„ฑ์งˆ์„ ๊ฐ–๋Š” ์ด์‚ฐ ํ™•๋ฅ  ๊ณผ์ •

  • Markov ์„ฑ์งˆ: "ํŠน์ • ์ƒํƒœ์˜ ํ™•๋ฅ (t+1)์€ ์˜ค์ง ํ˜„์žฌ(t)์˜ ์ƒํƒœ์— ์˜์กดํ•œ๋‹ค"
  • ์ด์‚ฐ ํ™•๋ฅ  ๊ณผ์ •: ์ด์‚ฐ์ ์ธ ์‹œ๊ฐ„(0์ดˆ, 1์ดˆ, ..,) ์†์—์„œ์˜ ํ™•๋ฅ ์  ํ˜„์ƒ

$$ P[s_(t+1) | s_(t)] = P[s_(t+1) | s_1, ..., s_(t)] $$

e.g. "๋‚ด์ผ์˜ ๋‚ ์”จ๋Š” ์˜ค๋Š˜์˜ ๋‚ ์”จ๋งŒ ๋ณด๊ณ  ์•Œ ์ˆ˜ ์žˆ๋‹ค."

2) Normalizing Flow

์‹ฌ์ธต์‹ ๊ฒฝ๋ง ๊ธฐ๋ฐ˜ ํ™•๋ฅ ์  ์ƒ์„ฑ ๋ชจํ˜• ์ค‘ ํ•˜๋‚˜. ์ž ์žฌ ๋ณ€์ˆ˜(Z) ๊ธฐ๋ฐ˜ ํ™•๋ฅ ์  ์ƒ์„ฑ ๋ชจํ˜•์œผ๋กœ์„œ, ์ž ์žฌ ๋ณ€์ˆ˜(Z) ํš๋“์— '๋ณ€์ˆ˜ ๋ณ€ํ™˜' ๊ณต์‹์„ ํ™œ์šฉํ•ฉ๋‹ˆ๋‹ค.

2. Probabilistic Generative Model: Latent variable model

1) Overview of Generative Models

  • ๋ฐ˜๋ณต์ ์ธ ๋ณ€ํ™”(iterative transformation)๋ฅผ ํ™œ์šฉํ•œ๋‹ค๋Š” ์ ์—์„œ Flow-based models์™€ ์œ ์‚ฌ
  • ๋ถ„ํฌ์— ๋Œ€ํ•œ ๋ณ€๋ถ„์  ์ถ”๋ก (Variational Inference)์„ ํ†ตํ•œ ํ•™์Šต์„ ์ง„ํ–‰ํ•œ๋‹ค๋Š” ์ ์€ VAE์™€ ์œ ์‚ฌ
  • ์ตœ๊ทผ์—๋Š” Diffusion ๋ชจ๋ธ์˜ ํ•™์Šต์— Adversarial Training์„ ํ™œ์šฉํ•˜๊ธฐ๋„ ํ•จ (Diffusion-GAN, 2022)

2) Generative model: Latent variable model

๊ฒฐ๊ตญ ์ƒ์„ฑ ๋ชจ๋ธ๋กœ๋ถ€ํ„ฐ ์›ํ•˜๋Š” ๊ฒƒ์€ ๋งค์šฐ ๊ฐ„๋‹จํ•œ ๋ถ„ํฌ(Z)๋ฅผ ํŠน์ •ํ•œ ํŒจํ„ด์„ ๊ฐ–๋Š” ๋ถ„ํฌ๋กœ ๋ณ€ํ™˜(Mapping, transformation, sampling)ํ•˜๋Š” ๊ฒƒ์ž…๋‹ˆ๋‹ค. ๊ทธ๋ ‡๊ธฐ์— ๋Œ€๋ถ€๋ถ„์˜ ์ƒ์„ฑ๋ชจ๋ธ์ด ์ฃผ์–ด์ง„ ์ž…๋ ฅ ๋ฐ์ดํ„ฐ๋กœ๋ถ€ํ„ฐ latent variable(Z)์„ ์–ป์–ด๋‚ด๊ณ , ์ด๋ฅผ ๋ณ€ํ™˜ํ•˜๋Š” ์—ญ๋Ÿ‰์„ ํ•™์Šตํ•˜๊ณ ์ž ํ•ฉ๋‹ˆ๋‹ค.

3) Variational Auto Encoder

  • ํ•™์Šต๋œ Decoder network๋ฅผ ํ†ตํ•ด latent variable์„ ํŠน์ •ํ•œ ํŒจํ„ด์˜ ๋ถ„ํฌ๋กœ mapping
  • Encoder๋ฅผ ๋ชจ๋ธ ๊ตฌ์กฐ์— ์ถ”๊ฐ€ํ•ด, Latent variable / Encoder / Decoder๋ฅผ ๋ชจ๋‘ ํ•™์Šต์‹œํ‚ต๋‹ˆ๋‹ค.

3) Generative Adversarial Network (GAN) 

  • ํ•™์Šต๋œ Generator๋ฅผ ํ†ตํ•ด latent variable์„ ํŠน์ •ํ•œ ํŒจํ„ด์˜ ๋ถ„ํฌ๋กœ mapping
  • Discriminator๋ฅผ ๋ชจ๋ธ ๊ตฌ์กฐ์— ์ถ”๊ฐ€ํ•ด, Generator๋ฅผ ํ•™์Šต์‹œํ‚ด

4) Flow-based Model

๊ธฐ๋ณธ์ ์œผ๋กœ ๊ฐ„๋‹จํ•˜๊ณ  tractableํ•œ prior ๋ถ„ํฌ๋ฅผ ๋ณต์žกํ•œ ๋ถ„ํฌ๋กœ ๋ณ€ํ™”์‹œ๊ณ ์ž ํ•ฉ๋‹ˆ๋‹ค. ์ด๋ฅผ ์œ„ํ•ด ํ•™์Šตํ•œ Invertible Function์˜ Inverse mapping์„ ์ด์šฉํ•˜๋ฉฐ, ์ด๋Ÿฌํ•œ function์„ flow๋ผ ํ•˜์—ฌ ์ƒ์„ฑ์— ํ™œ์šฉํ•ฉ๋‹ˆ๋‹ค.

  • ํ•™์Šต๋œ Flow model์˜ Inverse mapping์„ ํ†ตํ•ด latent variable์„ ํŠน์ •ํ•œ ํŒจํ„ด์˜ ๋ถ„ํฌ๋กœ mapping
  • ์ƒ์„ฑ์— ํ™œ์šฉ๋˜๋Š” Inverse mapping์„ ํ•™์Šตํ•˜๊ธฐ ์œ„ํ•ด Invertible Function์„ ํ•™์Šต

5) Diffusion based generative model

Diffusion ๋ชจ๋ธ๋„ ๊ธฐ๋ณธ์ ์œผ๋กœ ๊ฐ„๋‹จํ•˜๊ณ  tractableํ•œ prior ๋ถ„ํฌ๋ฅผ ๋ณต์žกํ•œ ๋ถ„ํฌ๋กœ ๋ณ€ํ™”์‹œ๊ณ ์ž ํ•ฉ๋‹ˆ๋‹ค. 

  • ํ•™์Šต๋œ Diffusion Model์˜ ์กฐ๊ฑด๋ถ€ ํ™•๋ฅ  ๋ถ„ํฌ P(x|z)๋ฅผ ํ†ตํ•ด ํŠน์ •ํ•œ ํŒจํ„ด์˜ ๋ถ„ํฌ ํš๋“
  • ์ƒ์„ฑ์— ํ™œ์šฉ๋˜๋Š” ์กฐ๊ฑด๋ถ€ ํ™•์œจ ๋ถ„ํฌ P(x|z)๋ฅผ ํ•™์Šตํ•˜๊ธฐ ์œ„ํ•ด Diffusion process q(z|x)๋ฅผ ํ™œ์šฉ 

์ฐธ๊ณ 

[Youtube] [Paper Review] Denoising Diffusion Probabilistic Models: https://www.youtube.com/watch?v=_JQSMhqXw-4

๋ฐ˜์‘ํ˜•
์ €์ž‘์žํ‘œ์‹œ ๋น„์˜๋ฆฌ ๋ณ€๊ฒฝ๊ธˆ์ง€

'Study: Artificial Intelligence(AI) > AI: 2D Vision(Det, Seg, Trac)' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€

[Gen AI] Generative Adversarial Network(GAN) ์„ค๋ช…: ๊ธฐ์ดˆ  (1) 2024.10.02
[Gen AI] Diffusion Model ์„ค๋ช…: ์‘์šฉ  (0) 2024.09.14
[Gen AI] Diffusion Model ์„ค๋ช…: ๊ธฐ์ดˆ  (0) 2024.09.13
[Gen AI] Stable Diffusion WebUI Docker ํ™˜๊ฒฝ ์„ค์ • ๋ฐ ์‚ฌ์šฉํ•˜๊ธฐ  (0) 2024.08.03
[Gen AI] DreamBooth ์‚ฌ์šฉํ•ด๋ณด๊ธฐ - DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation  (0) 2024.07.28
    'Study: Artificial Intelligence(AI)/AI: 2D Vision(Det, Seg, Trac)' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€
    • [Gen AI] Generative Adversarial Network(GAN) ์„ค๋ช…: ๊ธฐ์ดˆ
    • [Gen AI] Diffusion Model ์„ค๋ช…: ์‘์šฉ
    • [Gen AI] Diffusion Model ์„ค๋ช…: ๊ธฐ์ดˆ
    • [Gen AI] Stable Diffusion WebUI Docker ํ™˜๊ฒฝ ์„ค์ • ๋ฐ ์‚ฌ์šฉํ•˜๊ธฐ
    DrawingProcess
    DrawingProcess
    ๊ณผ์ •์„ ๊ทธ๋ฆฌ์ž!

    ํ‹ฐ์Šคํ† ๋ฆฌํˆด๋ฐ”