DrawingProcess
๋“œํ”„ DrawingProcess
DrawingProcess
์ „์ฒด ๋ฐฉ๋ฌธ์ž
์˜ค๋Š˜
์–ด์ œ
ยซ   2026/04   ยป
์ผ ์›” ํ™” ์ˆ˜ ๋ชฉ ๊ธˆ ํ† 
1 2 3 4
5 6 7 8 9 10 11
12 13 14 15 16 17 18
19 20 21 22 23 24 25
26 27 28 29 30
  • ๋ถ„๋ฅ˜ ์ „์ฒด๋ณด๊ธฐ (992)
    • Profile & Branding (26)
      • Career (19)
    • IT Trends (255)
      • Conference, Faire (Experien.. (31)
      • News (187)
      • Youtube (20)
      • TED (8)
      • Web Page (2)
      • IT: Etc... (6)
    • Contents (98)
      • Book (67)
      • Lecture (31)
    • Project Process (95)
      • Ideation (0)
      • Study Report (35)
      • Challenge & Award (22)
      • 1Day1Process (5)
      • Making (5)
      • KRC-FTC (Team TC(5031, 5048.. (10)
      • GCP (GlobalCitizenProject) (15)
    • Study: ComputerScience(CS) (73)
      • CS: Basic (9)
      • CS: Database(SQL) (5)
      • CS: Network (14)
      • CS: OperatingSystem (3)
      • CS: Linux (39)
      • CS: Etc... (3)
    • Study: Software(SW) (95)
      • SW: Language (29)
      • SW: Algorithms (1)
      • SW: DataStructure & DesignP.. (1)
      • SW: Opensource (15)
      • SW: Error Bug Fix (43)
      • SW: Etc... (6)
    • Study: Artificial Intellige.. (159)
      • AI: Research (2)
      • AI: 2D Vision(Det, Seg, Tra.. (40)
      • AI: 3D Vision (74)
      • AI: MultiModal (3)
      • AI: SLAM (0)
      • AI: Light Weight(LW) (3)
      • AI: Data Pipeline (7)
      • AI: Machine Learning(ML) (1)
    • Study: Robotics(Robot) (33)
      • Robot: ROS(Robot Operating .. (9)
      • Robot: Positioning (8)
      • Robot: Planning & Control (7)
    • Study: DeveloperTools(DevTo.. (83)
      • DevTool: Git (12)
      • DevTool: CMake (13)
      • DevTool: NoSQL(Elastic, Mon.. (25)
      • DevTool: Container (17)
      • DevTool: IDE (11)
      • DevTool: CloudComputing (4)
    • ์ธ์ƒ์„ ์‚ด๋ฉด์„œ (74) N
      • ๋‚˜์˜ ์ทจ๋ฏธ๋“ค (12) N
      • ๋‚˜์˜ ์ƒ๊ฐ๋“ค (42)
      • ์—ฌํ–‰์„ ๋– ๋‚˜์ž~ (10)
      • ๋ถ„๊ธฐ๋ณ„ ํšŒ๊ณ  (10) N

๊ฐœ๋ฐœ์ž ๋ช…์–ธ

โ€œ ๋งค์ฃผ ๋ชฉ์š”์ผ๋งˆ๋‹ค ๋‹น์‹ ์ด ํ•ญ์ƒ ํ•˜๋˜๋Œ€๋กœ ์‹ ๋ฐœ๋ˆ์„ ๋ฌถ์œผ๋ฉด ์‹ ๋ฐœ์ด ํญ๋ฐœํ•œ๋‹ค๊ณ  ์ƒ๊ฐํ•ด๋ณด๋ผ.
์ปดํ“จํ„ฐ๋ฅผ ์‚ฌ์šฉํ•  ๋•Œ๋Š” ์ด๋Ÿฐ ์ผ์ด ํ•ญ์ƒ ์ผ์–ด๋‚˜๋Š”๋ฐ๋„ ์•„๋ฌด๋„ ๋ถˆํ‰ํ•  ์ƒ๊ฐ์„ ์•ˆ ํ•œ๋‹ค. โ€

- Jef Raskin

๋งฅ์˜ ์•„๋ฒ„์ง€ - ์• ํ”Œ์ปดํ“จํ„ฐ์˜ ๋งคํ‚จํ† ์‹œ ํ”„๋กœ์ ํŠธ๋ฅผ ์ฃผ๋„

์ธ๊ธฐ ๊ธ€

์ตœ๊ทผ ๊ธ€

์ตœ๊ทผ ๋Œ“๊ธ€

ํ‹ฐ์Šคํ† ๋ฆฌ

hELLO ยท Designed By ์ •์ƒ์šฐ.
DrawingProcess

๋“œํ”„ DrawingProcess

Study: Artificial Intelligence(AI)/AI: 2D Vision(Det, Seg, Trac)

[2D Vision] ์—ฐ์„ธ YAI ๊ธฐ์ดˆ์‹ฌํ™”CV: Generative Models

2025. 8. 19. 23:15
๋ฐ˜์‘ํ˜•
๐Ÿ’ก ๋ณธ ๋ฌธ์„œ๋Š” 'Generative Models'์— ๋Œ€ํ•ด ์ •๋ฆฌํ•ด๋†“์€ ๊ธ€์ž…๋‹ˆ๋‹ค.
๋ณธ ๋ณด๊ณ ์„œ๋Š” VAE(Variational AutoEncoder), GAN(Generative Adversarial Network), Diffusion Model ๋“ฑ ์ตœ๊ทผ ๊ฐ๊ด‘๋ฐ›๊ณ  ์žˆ๋Š” ๋‹ค์–‘ํ•œ ์ƒ์„ฑ ๋ชจ๋ธ์˜ ์›๋ฆฌ, ๊ตฌ์กฐ, ๊ทธ๋ฆฌ๊ณ  ์‘์šฉ ๋ถ„์•ผ๋ฅผ ์‹ฌ๋„ ์žˆ๊ฒŒ ๋‹ค๋ฃจ๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.

Introduction: ์ƒˆ๋กœ์šด ๋ฐ์ดํ„ฐ์˜ ์ฐฝ์กฐ์ž๋“ค

์ƒ์„ฑ ๋ชจ๋ธ์€ ์ฃผ์–ด์ง„ ๋ฐ์ดํ„ฐ์˜ ๋ถ„ํฌ๋ฅผ ํ•™์Šตํ•˜์—ฌ ์ƒˆ๋กœ์šด ๋ฐ์ดํ„ฐ๋ฅผ ์ฐฝ์กฐํ•˜๋Š” ์ธ๊ณต์ง€๋Šฅ ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค. ๊ธฐ์กด์˜ ํŒ๋ณ„ ๋ชจ๋ธ์ด ๋ฐ์ดํ„ฐ๋ฅผ ๋ถ„๋ฅ˜ํ•˜๊ฑฐ๋‚˜ ์˜ˆ์ธกํ•˜๋Š” ๊ฒƒ์— ์ง‘์ค‘ํ–ˆ๋‹ค๋ฉด, ์ƒ์„ฑ ๋ชจ๋ธ์€ ํ…์ŠคํŠธ, ์ด๋ฏธ์ง€, ์Œ์„ฑ ๋“ฑ ํ˜„์‹ค ์„ธ๊ณ„์™€ ์œ ์‚ฌํ•œ ์ƒˆ๋กœ์šด ์ƒ˜ํ”Œ์„ ๋งŒ๋“ค์–ด๋‚ด๋Š” ๊ฒƒ์„ ๋ชฉํ‘œ๋กœ ํ•ฉ๋‹ˆ๋‹ค. ์ด๋Ÿฌํ•œ ์ƒ์„ฑ ๋ชจ๋ธ์€ ํฌ๊ฒŒ ๋‘ ๊ฐ€์ง€๋กœ ๋ถ„๋ฅ˜ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ์ฒซ ๋ฒˆ์งธ๋Š” ๋ฐ์ดํ„ฐ์˜ ๋ฐ€๋„ ํ•จ์ˆ˜๋ฅผ ๋ช…์‹œ์ ์œผ๋กœ ํ•™์Šตํ•˜๋Š” Explicit Model๋กœ, VAE๋‚˜ Diffusion Model์ด ์—ฌ๊ธฐ์— ์†ํ•ฉ๋‹ˆ๋‹ค. ๋‘ ๋ฒˆ์งธ๋Š” ๋ฐ์ดํ„ฐ์˜ ๋ถ„ํฌ๋ฅผ ๊ฐ„์ ‘์ ์œผ๋กœ ํ•™์Šตํ•˜๋Š” Implicit Model๋กœ, GAN์ด ๋Œ€ํ‘œ์ ์ธ ์˜ˆ์ž…๋‹ˆ๋‹ค.

Method: Network Design & Principles

1. VAE (Variational AutoEncoder)

VAE๋Š” ํ™•๋ฅ ์  ์ ‘๊ทผ์„ ๋„์ž…ํ•˜์—ฌ Latent Space์„ ์—ฐ์†์ ์ธ ๋ถ„ํฌ๋กœ ๋ชจ๋ธ๋งํ•ฉ๋‹ˆ๋‹ค. ๊ธฐ์กด์˜ AutoEncoder๊ฐ€ ์ž ์žฌ ๊ณต๊ฐ„์˜ ๋ถˆ์—ฐ์†์„ฑ ๋•Œ๋ฌธ์— ์ƒˆ๋กœ์šด ๋ฐ์ดํ„ฐ ์ƒ์„ฑ์— ํ•œ๊ณ„๊ฐ€ ์žˆ์—ˆ๋˜ ๋ฐ˜๋ฉด, VAE๋Š” ์ž…๋ ฅ ๋ฐ์ดํ„ฐ๋ฅผ ํ‰๊ท ๊ณผ ๋ถ„์‚ฐ์„ ๊ฐ–๋Š” ํ™•๋ฅ  ๋ถ„ํฌ๋กœ ํ‘œํ˜„ํ•˜์—ฌ ์ž ์žฌ ๊ณต๊ฐ„ ๋‚ด์—์„œ ์ž์—ฐ์Šค๋Ÿฌ์šด ๋ณด๊ฐ„(interpolation)๊ณผ ์ƒˆ๋กœ์šด ๋ฐ์ดํ„ฐ ์ƒ์„ฑ์„ ๊ฐ€๋Šฅํ•˜๊ฒŒ ํ•ฉ๋‹ˆ๋‹ค. VAE๋Š” ์ธ์ฝ”๋”๋ฅผ ํ†ตํ•ด ๋ฐ์ดํ„ฐ๋ฅผ ์ž ์žฌ ๊ณต๊ฐ„์˜ ๋ถ„ํฌ๋กœ ๋ณ€ํ™˜ํ•˜๊ณ , ๋””์ฝ”๋”๋ฅผ ํ†ตํ•ด ์ด ๋ถ„ํฌ์—์„œ ์ƒ˜ํ”Œ๋ง๋œ ๋ฐ์ดํ„ฐ๋ฅผ ์›๋ณธ๊ณผ ์œ ์‚ฌํ•˜๊ฒŒ ๋ณต์›ํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ ํ•™์Šตํ•ฉ๋‹ˆ๋‹ค.

์ด๋ฅผ ์œ„ํ•œ Loss Function์€ ์œ„์™€ ๊ฐ™์ด ๊ตฌ์„ฑ๋˜๋Š”๋ฐ, ์ด๋Š” Reconstruction Term๊ณผ KL Divergence Term์œผ๋กœ ๊ตฌ์„ฑ๋ฉ๋‹ˆ๋‹ค. Reconstruction Term์€ ์ž ์žฌ ๋ถ„ํฌ z๋กœ๋ถ€ํ„ฐ ๋ฐ์ดํ„ฐ๋ฅผ ์–ผ๋งˆ๋‚˜ ์ž˜ ๋ณต์›ํ–ˆ๋Š”์ง€๋ฅผ ํ‰๊ฐ€ํ•˜๋ฉฐ, ์›๋ณธ ๋ฐ์ดํ„ฐ x์™€ ๋ณต์›๋œ ๋ฐ์ดํ„ฐ x’ ์‚ฌ์ด์˜ ์ฐจ์ด๋ฅผ ์ธก์ •ํ•ฉ๋‹ˆ๋‹ค. KL Divergence Loss (Regularization Loss)๋Š” ๊ทผ์‚ฌ ๋ถ„ํฌ q(zโˆฃx)์™€ ์ •๊ทœ ๋ถ„ํฌ p(z)์˜ ์œ ์‚ฌ๋„๋ฅผ ํ‰๊ฐ€ํ•˜๋ฉฐ, ์ด๋ฅผ ์ตœ์†Œํ™”ํ•˜๋ฉด ์ž ์žฌ ๊ณต๊ฐ„์ด ์ •๊ทœ ๋ถ„ํฌ์™€ ๊ฐ€๊นŒ์›Œ์ง€๊ฒŒ ๋ฉ๋‹ˆ๋‹ค.

2. GAN (Generative Adversarial Network)

GAN์€ ์ƒ์„ฑ์ž(Generator)์™€ ํŒ๋ณ„์ž(Discriminator)๋ผ๋Š” ๋‘ ๊ฐœ์˜ ์‹ ๊ฒฝ๋ง์ด ์„œ๋กœ ๊ฒฝ์Ÿํ•˜๋ฉฐ ํ•™์Šตํ•˜๋Š” ๋…ํŠนํ•œ ๊ตฌ์กฐ๋ฅผ ๊ฐ€์ง‘๋‹ˆ๋‹ค. ์ƒ์„ฑ์ž๋Š” ์ง„์งœ์™€ ๋น„์Šทํ•œ ๊ฐ€์งœ ๋ฐ์ดํ„ฐ๋ฅผ ๋งŒ๋“ค์–ด ํŒ๋ณ„์ž๋ฅผ ์†์ด๋ ค ํ•˜๊ณ , ํŒ๋ณ„์ž๋Š” ์ง„์งœ ๋ฐ์ดํ„ฐ์™€ ์ƒ์„ฑ์ž๊ฐ€ ๋งŒ๋“  ๊ฐ€์งœ ๋ฐ์ดํ„ฐ๋ฅผ ๊ตฌ๋ณ„ํ•˜๋ ค๊ณ  ํ•ฉ๋‹ˆ๋‹ค. ์ด๋Ÿฌํ•œ ์ ๋Œ€์  ํ•™์Šต(Adversarial Training) ๊ณผ์ •์„ ํ†ตํ•ด ์ƒ์„ฑ์ž๋Š” ์ ์  ๋” ์‹ค์ œ์™€ ๊ตฌ๋ณ„ํ•˜๊ธฐ ์–ด๋ ค์šด ๊ณ ํ’ˆ์งˆ์˜ ๋ฐ์ดํ„ฐ๋ฅผ ๋งŒ๋“ค๊ฒŒ ๋ฉ๋‹ˆ๋‹ค.

์ด๋ฅผ ์œ„ํ•œ Loss Function์€ ์œ„์™€ ๊ฐ™์ด ๊ตฌ์„ฑ๋˜๋Š”๋ฐ, ์—ฌ๊ธฐ์„œ x๋Š” ์‹ค์ œ ๋ฐ์ดํ„ฐ์ด๊ณ , z๋Š” ๊ฐ€์งœ ๋ฐ์ดํ„ฐ๋ฅผ ๋งŒ๋“œ๋Š” ๋ถ„ํฌ์ž…๋‹ˆ๋‹ค. ๊ทธ๋Ÿฌ๋ฏ€๋กœ D(x)๋ฅผ ์ตœ๋Œ€ํ•œ 1์— ๊ฐ€๊น๊ฒŒ ๋งŒ๋“ฆ์œผ๋กœ์จ ์‹ค์ œ ๋ฐ์ดํ„ฐ๋ฅผ 1๋กœ ๋ถ„๋ฅ˜ํ•˜๊ณ , D(G(z))๋ฅผ ์ตœ๋Œ€ํ•œ 0์— ๊ฐ€๊น๊ฒŒ ๋งŒ๋“ฆ์œผ๋กœ์จ ๊ฐ€์งœ ๋ฐ์ดํ„ฐ๋ฅผ 0์œผ๋กœ ๋ถ„๋ฅ˜ํ•˜๋„๋ก discriminator๋ฅผ ํ•™์Šตํ•˜๋Š” ๊ฒƒ์ž…๋‹ˆ๋‹ค.

3. Diffusion Model

Diffusion Model์€ ๋ฐ์ดํ„ฐ์— ์ ์ง„์ ์œผ๋กœ ๋…ธ์ด์ฆˆ๋ฅผ ์ถ”๊ฐ€ํ•˜์—ฌ ์™„์ „ํžˆ ๋ฌด์ž‘์œ„์ ์ธ ์ƒํƒœ๋กœ ๋งŒ๋“œ๋Š” Forward ํ™•์‚ฐ ๊ณผ์ •๊ณผ, ์ด ๋…ธ์ด์ฆˆ ์ƒํƒœ๋กœ๋ถ€ํ„ฐ ์ ์ง„์ ์œผ๋กœ ๋…ธ์ด์ฆˆ๋ฅผ ์ œ๊ฑฐํ•˜์—ฌ ์›๋ณธ ๋ฐ์ดํ„ฐ๋ฅผ ๋ณต์›ํ•˜๋Š” Reverse ๋ณต์› ๊ณผ์ •์„ ํ•™์Šตํ•ฉ๋‹ˆ๋‹ค. ํŠนํžˆ, ์—ญ๋ฐฉํ–ฅ ๊ณผ์ •์„ ํ•™์Šตํ•˜๊ธฐ ์œ„ํ•ด U-Net๊ณผ ๊ฐ™์€ ์‹ ๊ฒฝ๋ง์„ ์‚ฌ์šฉํ•˜๋ฉฐ, ์ด๋Š” ๋…ธ์ด์ฆˆ๊ฐ€ ๋‚€ ์ด๋ฏธ์ง€๋ฅผ ์ž…๋ ฅ์œผ๋กœ ๋ฐ›์•„ ์›๋ณธ ์ด๋ฏธ์ง€์˜ ๋…ธ์ด์ฆˆ๋ฅผ ์˜ˆ์ธกํ•˜๊ณ  ์ œ๊ฑฐํ•˜๋Š” ๋ฐฉ์‹์œผ๋กœ ์ž‘๋™ํ•ฉ๋‹ˆ๋‹ค.

Recap: Generative Model 

Application: ์‘์šฉ ๋ถ„์•ผ

์ƒ์„ฑ ๋ชจ๋ธ์€ ๋‹ค์–‘ํ•œ ๋ถ„์•ผ์— ์ ์šฉ๋˜๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค.

  • Classifier Guidance: Diffusion Model์— ๋ถ„๋ฅ˜๊ธฐ์˜ Gradient๋ฅผ ํ™œ์šฉํ•˜์—ฌ ํŠน์ • ํด๋ž˜์Šค์— ๋งž๋Š” ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•˜๋„๋ก ์œ ๋„ํ•ฉ๋‹ˆ๋‹ค.
  • Super Resolution: ์ €ํ™”์งˆ ์ด๋ฏธ์ง€๋ฅผ ๊ณ ํ™”์งˆ๋กœ ๋ณ€ํ™˜ํ•˜๋Š” ์ดˆํ•ด์ƒ๋„ ๊ธฐ์ˆ ์— Diffusion Model์ด ํ™œ์šฉ๋ฉ๋‹ˆ๋‹ค.
  • Inpainting: ์ด๋ฏธ์ง€์˜ ์ผ๋ถ€๊ฐ€ ๊ฐ€๋ ค์ง„ ๋ถ€๋ถ„์„ ์ž์—ฐ์Šค๋Ÿฝ๊ฒŒ ์ฑ„์›Œ ๋„ฃ๋Š” ๊ธฐ์ˆ ์— ์‚ฌ์šฉ๋ฉ๋‹ˆ๋‹ค.
  • Text Guided Image Generation: GLIDE ๋ชจ๋ธ๊ณผ ๊ฐ™์ด ํ…์ŠคํŠธ ์ž„๋ฒ ๋”ฉ์„ ์กฐ๊ฑด์œผ๋กœ ์‚ฌ์šฉํ•˜์—ฌ ํ…์ŠคํŠธ์— ํ•ด๋‹นํ•˜๋Š” ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค.

 

Discussion

  • p(x)๋ฅผ ์ตœ๋Œ€ํ™” ํ•˜๋Š” ๊ฒƒ์˜ ์˜๋ฏธ์™€ ELBO
    • p(x)๋Š” ๋ฐ์ดํ„ฐ์˜ likelihood์—ฌ์„œ ๋ชจ๋ธ์ด ์ตœ๋Œ€ํ™”ํ•˜์—ฌ ์˜ˆ์ธกํ•˜๊ธธ ๋ฐ”๋ผ๋Š”๋ฐ, p(x)๋ฅผ ์ •ํ™•ํ•˜๊ฒŒ ์•Œ๊ธฐ ์–ด๋ ต๊ธฐ ๋•Œ๋ฌธ์— ELBO๋ฅผ ํ•˜ํ•œ์œผ๋กœ ์„ค์ •ํ•˜๊ณ  ํ•˜ํ•œ์„ ๋†’์—ฌ์„œ ์ตœ์†Œํ•œ ์ด ๊ธฐ์ค€๋ณด๋‹ค๋Š” ๋†’์•„์•ผ ํ•จ์„ ์„ค์ •ํ•œ๋‹ค
  • p()์™€ q()๊ฐ€ ๋‹ค๋ฅธ๋ฐ ์–ด๋–ป๊ฒŒ ๋Œ€์‹  ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋‚˜
    • ๋Œ€์‹  ์‚ฌ์šฉํ•œ๋‹ค๊ธฐ ๋ณด๋‹ค๋Š” encodeํ•˜๋Š” ๊ณผ์ •์„ ์•Œ๋ฉด decodeํ•˜๋Š” ๊ณผ์ •์„ ์•Œ ์ˆ˜ ์žˆ๊ธฐ ๋•Œ๋ฌธ์— ๊ฐ™์€ ๋ถ„ํฌ๋ฅผ ์‚ฌ์šฉํ•ด์„œ encoder๋ฅผ ํ†ตํ•ด decoder๋กœ ๋…ธ์ด์ฆˆ์—์„œ ์ด๋ฏธ์ง€๋ฅผ ๋ณต์›ํ•ด๋‚˜๊ฐ€๋Š” ๊ฒƒ์ด๋‹ค
  • diffusion์—์„œ ์ถ”๋ก ํ•  ๋•Œ๋Š” ์ œ๊ฑฐํ•˜๋Š” ๊ณผ์ •์—์„œ ํ•™์Šตํ•œ ๊ฐ€์ค‘์น˜๋งŒ ์‚ฌ์šฉํ•˜๋Š”๊ฐ€(๋…ธ์ด์ฆˆ๋ฅผ ์ถ”๊ฐ€ํ•˜๋Š” ๊ณผ์ •์˜ ๊ฐ€์ค‘์น˜๋ฅผ ์ œ์™ธํ•˜๊ณ )
    • ๊ฐ€์šฐ์‹œ์•ˆ ๋…ธ์ด์ฆˆ๋ฅผ ์‚ฌ์šฉํ•˜๊ณ  denoisingํ•  ๋•Œ๋„ ๊ฐ€์šฐ์‹œ์•ˆ์„ ์‚ฌ์šฉํ•จ,์ถ”๋ก ํ•  ๋•Œ๋„ ๋žœ๋คํ•˜๊ฒŒ ๊ฐ€์šฐ์‹œ์•ˆ ๋…ธ์ด์ฆˆ์—์„œ ์ƒ์„ฑํ•ด์•ผ ํ•จ
  • ์ด๋ฏธ์ง€ ํฌ๊ธฐ๋ฅผ ๊ทธ๋Œ€๋กœ ์ง„ํ–‰ํ•˜๋Š”์ง€ latent space์—์„œ flatten ๋˜์–ด์„œ ๊ฐ€๋Š”์ง€
    • ์ด๋ฏธ์ง€ ํฌ๊ธฐ๋Š” ๊ทธ๋Œ€๋กœ ๊ฐ€๋Š”๊ฒŒ ๋งž๋‹ค
  • GAN๊นŒ์ง€์˜ ํ๋ฆ„์€ latent space์—์„œ ์ƒˆ๋กœ ์ด๋ฏธ์ง€๋ฅผ ์ƒ์„ฑํ•˜๋Š” ๋А๋‚Œ์œผ๋กœ ํ๋ฆ„์ด ์ง„ํ–‰๋˜์—ˆ๋Š”๋ฐ, Diffusion ๋ชจ๋ธ์—์„œ๋Š” noise๋ฅผ ์ถ”๊ฐ€ํ•˜๊ณ  ์ด๋ฅผ ์˜ˆ์ธกํ•˜์—ฌ ์‚ญ์ œํ•˜๋Š” ๋ฐฉํ–ฅ์œผ๋กœ ์žก์€ ์ด์œ ๊ฐ€ ๊ถ๊ธˆํ•˜๊ณ , noise๊ฐ€ ์žˆ๋Š” ์ƒํƒœ์—์„œ๋„ train์„ ํ•˜์—ฌ ๋ณต์›ํ•  ์ˆ˜๋„ ์žˆ์„ํ…๋ฐ ๊ทธ๋ ‡๊ฒŒ ํ•œ ์ด์œ ?
    • GAN์€ ๋‹จ๊ณ„์ ์ธ ํ•™์Šต์ด ์•„๋‹ˆ๋ผ ํ•œ๋ฒˆ์— ํ•™์Šต์„ ํ•ด๋ฒ„๋ฆฌ์ง€๋งŒ, Diffusion ๋ชจ๋ธ์€ ๋‹จ๊ณ„์ ์œผ๋กœ noise๋ฅผ ์ œ๊ฑฐํ•˜๋ฉด์„œ ํ•™์Šตํ•˜๊ธฐ ๋•Œ๋ฌธ์— ์ด๋ฏธ์ง€์˜ ๊ณ„์ธต์ ์ธ ๊ตฌ์กฐ๋ฅผ ํ•™์Šตํ•  ์ˆ˜ ์žˆ์„ ๊ฒƒ ๊ฐ™๋‹ค: ๋…ธ์ด์ฆˆ ์ •๋„๋ฅผ ์กฐ์ ˆํ•˜๋Š” ๋ฒ ํƒ€ ๊ฐ’์ด ์•„์ฃผ ์ž‘๋‹ค
    • GAN๊ฐ™์€ ๊ฒฝ์šฐ๋Š” ์•„์˜ˆ random ํ•œ noise์—์„œ ์‹œ์ž‘ํ•˜์ง€๋งŒ, Diffusion์€ ์›๋ณธ ์ด๋ฏธ์ง€๋กœ๋ถ€ํ„ฐ ํ˜•์„ฑ๋œ noise๋กœ๋ถ€ํ„ฐ ํ•™์Šต๋˜๊ธฐ ๋•Œ๋ฌธ์— ๋” ์‰ฌ์šด ํ•™์Šต์ด ๊ฐ€๋Šฅํ•˜๋‹ค..? ⇒ ResNet ์ฒ˜๋Ÿผ!
    • ๊ธฐ์ค€์ ์œผ๋กœ ์žก์•„๋‘๊ณ , ๋ณต์›์ด ์šฉ์ดํ•˜๋„๋ก ํ•˜๋Š” ๊ด€์ . ์•„์˜ˆ ๋žœ๋คํ•˜๊ฒŒ ์‹œ์ž‘ํ•ด์„œ ์ด๋ฏธ์ง€๋ฅผ ๋งŒ๋“œ๋Š” ๊ฒƒ๋ณด๋‹ค ์ด๋ฏธ์ง€๊ฐ€ ์ฃผ์–ด์ง€๊ณ  ๋…ธ์ด์ฆˆ๋งŒ ์˜ˆ์ธกํ•˜๋Š” ํŽธ์ด ๋” ๊ฐˆํ”ผ๋ฅผ ์žก๋Š”๋ฐ ์šฉ์ดํ•  ๊ฒƒ ๊ฐ™๋‹ค
๋ฐ˜์‘ํ˜•
์ €์ž‘์žํ‘œ์‹œ ๋น„์˜๋ฆฌ ๋ณ€๊ฒฝ๊ธˆ์ง€ (์ƒˆ์ฐฝ์—ด๋ฆผ)

'Study: Artificial Intelligence(AI) > AI: 2D Vision(Det, Seg, Trac)' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€

[2D Vision] ์—ฐ์„ธ YAI ๊ธฐ์ดˆ์‹ฌํ™”CV: Transformer & Vision Transformer  (0) 2025.08.21
[Survey] Semantic 3D Reconstruction ๊ด€๋ จ ๋‚ด์šฉ ์ •๋ฆฌ  (0) 2025.08.13
[2D Vision] ์—ฐ์„ธ YAI ๊ธฐ์ดˆ์‹ฌํ™”CV: YOLO  (6) 2025.08.12
[2D Vision] ์—ฐ์„ธ YAI ๊ธฐ์ดˆ์‹ฌํ™”CV: R-CNN, Faster R-CNN  (4) 2025.08.04
[2D Vision] 2D Point Tracking: co-tracker ์‚ฌ์šฉ๋ฒ•  (0) 2025.04.24
    'Study: Artificial Intelligence(AI)/AI: 2D Vision(Det, Seg, Trac)' ์นดํ…Œ๊ณ ๋ฆฌ์˜ ๋‹ค๋ฅธ ๊ธ€
    • [2D Vision] ์—ฐ์„ธ YAI ๊ธฐ์ดˆ์‹ฌํ™”CV: Transformer & Vision Transformer
    • [Survey] Semantic 3D Reconstruction ๊ด€๋ จ ๋‚ด์šฉ ์ •๋ฆฌ
    • [2D Vision] ์—ฐ์„ธ YAI ๊ธฐ์ดˆ์‹ฌํ™”CV: YOLO
    • [2D Vision] ์—ฐ์„ธ YAI ๊ธฐ์ดˆ์‹ฌํ™”CV: R-CNN, Faster R-CNN
    DrawingProcess
    DrawingProcess
    ๊ณผ์ •์„ ๊ทธ๋ฆฌ์ž!

    ํ‹ฐ์Šคํ† ๋ฆฌํˆด๋ฐ”