GAN Fashion Photo Shoot: Garment to Model Images Using Conditional GANs - Costa M. Colbert, Chief Scientist MAD Street Den Inc. Costa Colbert

Page created by Rhonda Herrera

Uncategorized

English

Like
Share
Embed
Fullscreen
Slides
Download HTML
Download PDF
Abuse

←

→

Page content transcription

If your browser does not render page correctly, please read the page content below

GAN Fashion Photo Shoot: Garment to Model Images Using Conditional GANs - Costa M. Colbert, Chief Scientist MAD Street Den Inc. Costa Colbert

GAN Fashion Photo Shoot:
Garment to Model Images Using
     Conditional GANs.
       Costa M. Colbert, Chief Scientist
              MAD Street Den Inc.

         Costa Colbert

Studies show higher purchase rates when clothing is shown on
human figures.

Live model photography is expensive

• Brands and retailers on average spend $100-500 per
  look. Lower per-look prices do not include hair, makeup,
  and styling
• Shooting capacity is limited
     • 35-40 looks per day with hair & makeup
     • 60-70 looks per day without hair & makeup
• Bulk of the cost includes:
     •   Models’ time (at least $1,200 day rate)
     •   Photographer’s time
     •   Digital tech & post production
     •   Hair, makeup, styling
• Cost does not usually include:
     •   Pulling samples
     •   Transporting samples to photo studio
     •   Photo studio & equipment
     •   Time to cast models & hire photographers & stylists
     •   Time of internal teams involved in a photo shoot process
     •   Reshoots due to items not selling with a current image (3-5% of
         items)

GANs to the rescue...

If quality and control of images is sufficient...

GANs to the rescue...

GANs to the rescue...

Garment images                 Generated images

A few examples…

Garment       Catalog   Generated   Generated
image         photo     image       image

A few examples…

Garment       Catalog   Generated   Generated
image         photo     image       image

A few examples…

Garment       Catalog   Generated   Generated
image         photo     image       image

Varying pose

Generative Adversarial Network

                                                        Training Dataset
                                                        providing real samples x

Samples z from prior distribution e.g. N(0,1)

                                                                                          Real
                                                                                          /
                                                                                          Fake

  z
                                                                          Discriminator

                                                D(x) decides if sample is from x

       Generator                      G(z) approximates a sample from x

Also use L1 reconstruction loss term: abs(G(z)-x)

Conditional Generative Adversarial Network

                                                 Training Dataset providing
                                                 real samples x ~ X

Samples z from prior distribution e.g.,
garments, pose, other labels

                                                                                  Real or
                                                                                  Fake ?

                                                              Discriminator CNN

                                           D(x, garment) decides if sample is from x,
                                           also requiring correct garment
           Generator CNN
       G(z) approximates a sample from x

Conditional GAN - discriminator

                                                 (Patch GAN,
                                                 Isola et al.
 Input is Model                                  2016)
 Image
 concatenated to                                 Real/Fake is
 Garment Image                                   determined by
                                                 observing
                             Discriminator CNN
                                                 patches of
                                                 limited
                    6 CNN layers                 extent.
                    Convolution
                    Instance Normalization
                    Dropout

Hmm.., maybe that global discriminator term wasn’t such a
bad idea..

Conditional GAN - generator

                           Encoder         Decoder

                                     Latent
  Pose                                               Fashion Model Image
                                     Vector
  Garment Image                                      Decoder 6 CNN layers
                                     4x3x512
  Encoder 6 CNN layers                               Deconvolution/Unpool
  Convolution                                        Dropout
  Instance Normalization                             Instance Normalization

  Adam Optimizer                                     GTX1080ti 2-4 GB

Pose
Interpolation

GAN generator - latent vector

                    Encoder         Decoder

                          Latent
                          Vector
                          512x4x3

latent vector - Interpolation

                  Enc         Dec

     X1garment                      X1

                        LV1

                  Enc   LV2   Dec

                                    X2
     X2garment

latent vector - Interpolation

                  Enc         Dec

     X1garment                         X1
                                                              Dec

                        LV1
                                                                             XFi,n
                                    F(LV1,LV2)

                  Enc   LV2   Dec

                                       X2
     X2garment

                                                 Fi,n(x,y) = x + (y-x)*i/n

Latent Variable
Interpolation

Shoes
Neckline
Hemline

Latent Variable
Interpolation

Hemline

Latent Variable
Interpolation

Color
Background

Note sleeves.

latent vector – modify values

                 Enc         Dec

    X1garment                      X1
                                              Dec

                       LV1
                                                    XF,
                                   F(LV1,i)         i
                   Latent
                   Vector
                   512x4x3

Principal Component Analysis (PCA)

PCA is a dimension-reduction tool that can reduce a large set
of variables to a small set that still contains most of the
information in the large set.

PCA transforms a number of (possibly) correlated variables
into a (smaller) number of uncorrelated variables (principal
components).

PCA determines the new dimensions on the basis of
variance.

Principal Component Analysis (PCA)

       Use PCA to go from 512x4x3 (~6k) dimensions to 100.

LV6k         PCA          LV100

                      Choose an entry, scale by +/- 10

LV100      Inv(PCA)      LV6k                            Dec

                                                               XF,
                                                               i

PCA Latent
Variable
Interpolation

Skin color
Model build

PCA Latent
Variable
Interpolation

Shoes

Conclusions and Future Work
    Conditional GAN’s are well-suited for image
    generation in well-defined domains.

    Good enough for the casual observer not to
    notice.

    GAN’s have many “moving parts,” but we are
    getting better at using them.

    More work needed on accessories, choosing
    specific shoes, handbags, etc. Requires more
    thought on implementing conditioning labels.

A big thanks to Preferred Networks

Thank you!!

      l   support@madstreetden.com

You can also read

THE EXPLOITATION OF WOMEN AND GIRLS IN INDIA'S - HOME-BASED GARMENT SECTOR By Siddharth Kara

Use of Foldable Containers in Garment on Hanger Transport - R. A. Slingerland

UN(DER) PAID IN THE PANDEMIC - 2020 REPORT - Business & Human Rights ...

Dress- and Auto-Makers in the "Free Trade" Arena

Toward Accurate and Realistic Outfits Visualization with Attention to Details

Spinning around workers' rights - International companies linked to forced labour in Tamil Nadu spinning mills - The Centre for Research on ...

What next for Asian garment production after COVID-19? - The perspectives of industry stakeholders - ILO

The impact of COVID-19 on Myanmar's garment sector - ILO Liaison Office in Myanmar

Impact of garment and textile trade preferences on livelihoods in Cambodia - Oxfam America

STILL UN(DER) PAID 2021 REPORT - Labour Behind the Label

3D CNN-PCA: A Deep-Learning-Based Parameterization for Complex Geomodels Yimin Liu, Louis J. Durlofsky

Prediction Medicine: Biomarkers, Risk Calculators and Magnetic Resonance Imaging as Risk Stratification Tools in Prostate Cancer Diagnosis - MDPI

Package 'MTS' June 4, 2021 - CRAN

Learning Dota 2 Team Compositions

2020 Professional Development Training Prospectus 0844 375 4301 - Property ...

PCA2003 32 kHz watch circuit with programmable adaptive motor pulse and pulse period Rev. 5 - 1 May 2019 - 32 kHz watch circuit with programmable ...

Protocatechuic acid attenuates cerebral aneurysm formation and progression by inhibiting TNF-alpha/Nrf-2/NF-kB-mediated inflammatory mechanisms in ...

Recognizing faces with PCA and ICA - Bruce A. Draper,a,* Kyungim Baek,b Marian Stewart Bartlett,c and J. Ross Beveridgea

PCA9615 2-channel multipoint Fast-mode Plus differential I2C-bus buffer with hot-swap logic Rev. 2 - 16 September 2021

TOOL KIT Patient Controlled Analgesia (PCA) Guidelines of Care For the Opioid Naïve Patient

A STEP-BY-STEP GUIDE TO THE BLACK-LITTERMAN MODEL

Deep Learning in Asset Pricing - AQR Capital Management

Deep Learning in Asset Pricing

Vector Flight Controller + OSD User Guide - December, 2014 Version 2.0 Software Version 11.48+

PCA9306 Dual bidirectional I2C-bus and SMBus voltage-level translator Rev. 9.1 - 31 August 2021 Product data sheet - NXP

Personal Care Attendant Program - Mass.gov

FÉDÉRATION INTERNATIONALE DE SKI INTERNATIONAL SKI FEDERATION INTERNATIONALER SKIVERBAND - TIMING-BOOKLET Alpine Skiing - FIS Ski

Cloudaerias Types Of Clouds Recognition API - Microsoft

Collecting the ephemeral social digital photograph for the future - Collecting Social Photo by Nordiska museet, Stockholm County Museum, The ...

Accurate Stock Price Forecasting Using Robust and Optimized Deep Learning Models

Recognition and Repetition Counting for LME exercises in Exercise-based CVD Rehabilitation: A Comparative Study using Artificial Intelligence ...

QUANTUM EFFICIENCY ENHANCEMENT OF A GAN-BASED GREEN LIGHT-EMITTING DIODE BY A GRADED INDIUM COMPOSITION P-TYPE INGAN LAYER - MDPI

Molecular Biology 2020 - Research use only - Real Laboratory

Marketing Discipline Guidelines - for RO / SKO Dealerships of Public Sector Marketing Companies - Hindustan Petroleum

Cheer Makeup Free Samples - She Will Fight

Guideline Salmonella Monitoring Pigs - QS Qualität und ...

TARIFFS 2021 - Lapland Ice Driving

The Beauty of Trees Mary Keenan - Gash Gardens & Nursery, Co. Laois - Limerick.ie

SAMPLE PACKAGING AND TRANSPORT - Primoris

Microwave heating of food materials: Role of susceptor - IOPscience

ADVISORY ACTIVE SOIL GAS INVESTIGATIONS - California Environmental Protection Agency Department of Toxic Substances Control Los Angeles Regional ...

Customer Information Handling Laboratory Samples - A001080/00E - MTU

Dynamic tensile properties, deformation, and failure testing of impact loaded coal samples with various water content

Draft Method 1633 Analysis of Per- and Polyfluoroalkyl Substances (PFAS) in Aqueous, Solid, Biosolids, and Tissue Samples by LC-MS/MS - Office of ...

Investigation of Structure of Technology Cycle Time of Hydraulic Manipulators in the Process of Loading Forwarders with Logs - Crojfe

Data Envelopment Analysis in the Presence of Correlated Evaluation Variables

The Effects of Homogenizing and Quenching and Tempering Treatments on Crack Healing - MDPI

RCPAQAP - 2019 Product Catalogue

COBAS SARS-COV-2 QUALITATIVE ASSAY FOR USE ON THE COBAS 6800/8800 SYSTEMS - US FOOD AND DRUG ...

The Pricing of the Illiquidity Factor's Conditional Risk with Time-varying Premium - American Economic ...