photo2cartoon

人像卡通化探索项目 (photo-to-cartoon translation project)

4,011

771

4,011

View on GitHub

Top Related Projects

AnimeGANv2

5,290

[Open Source]. The improved version of AnimeGAN. Landscape photos/videos to anime

White-box-Cartoonization

3,984

Official tensorflow implementation for CVPR2020 paper “Learning to Cartoonize Using White-box Cartoon Representations”

DualStyleGAN

1,680

[CVPR 2022] Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer

Bringing-Old-Photos-Back-to-Life

15,581

Bringing Old Photo Back to Life (CVPR 2020 oral)

Quick Overview

Photo2Cartoon is an AI-powered project that transforms portrait photos into cartoon-style images. It uses a combination of face parsing, face detection, and generative adversarial networks (GANs) to create high-quality cartoon renderings while preserving the subject's key facial features and expressions.

Pros

Produces high-quality cartoon-style images from portrait photos
Preserves facial features and expressions effectively
Offers both a pre-trained model and the ability to train custom models
Provides a user-friendly web interface for easy use

Cons

Limited to portrait photos; doesn't work well with full-body images or non-human subjects
Requires specific hardware (NVIDIA GPU) for optimal performance
May struggle with certain facial features or complex backgrounds
Documentation is primarily in Chinese, which may be challenging for non-Chinese speakers

Code Examples

Loading the pre-trained model:

from photo2cartoon import Photo2Cartoon

model = Photo2Cartoon()

Converting a photo to a cartoon:

input_path = "path/to/input/photo.jpg"
output_path = "path/to/output/cartoon.jpg"

cartoon_image = model.inference(input_path)
cartoon_image.save(output_path)

Training a custom model:

from photo2cartoon import Photo2CartoonTrainer

trainer = Photo2CartoonTrainer(
    photo_dir="path/to/photo/dataset",
    cartoon_dir="path/to/cartoon/dataset",
    epochs=100,
    batch_size=1
)
trainer.train()

Getting Started

To get started with Photo2Cartoon:

Clone the repository:

git clone https://github.com/minivision-ai/photo2cartoon.git
cd photo2cartoon

Install dependencies:
```
pip install -r requirements.txt
```

Download the pre-trained model:

wget https://github.com/minivision-ai/photo2cartoon/releases/download/v1.0/photo2cartoon_weights.zip
unzip photo2cartoon_weights.zip

Run the web interface:
```
python ui_server.py
```
Open a web browser and navigate to http://localhost:3000 to use the interface.

Competitor Comparisons

AnimeGANv2

5,290

[Open Source]. The improved version of AnimeGAN. Landscape photos/videos to anime

Pros of AnimeGANv2

Supports multiple anime styles, offering more versatility in output
Generally produces higher quality anime-style images with better detail preservation
Includes pre-trained models for easier implementation

Cons of AnimeGANv2

Requires more computational resources due to its complex architecture
Less focused on cartoon-style output, which may not suit all use cases
Documentation is less comprehensive, potentially making it harder for beginners

Code Comparison

AnimeGANv2:

from test import AnimeGANv2
model = AnimeGANv2()
output = model.inference('input.jpg')

photo2cartoon:

from photo2cartoon import Photo2Cartoon
p2c = Photo2Cartoon()
cartoon = p2c.inference('input.jpg')

Both repositories offer similar high-level APIs for inference, but AnimeGANv2 provides more options for style selection and fine-tuning. photo2cartoon has a simpler implementation focused specifically on cartoon-style conversion.

AnimeGANv2 excels in producing high-quality anime-style images with multiple style options, while photo2cartoon offers a more streamlined approach for cartoon-style conversion. The choice between the two depends on the specific requirements of the project, available computational resources, and the desired output style.

White-box-Cartoonization

3,984

Official tensorflow implementation for CVPR2020 paper “Learning to Cartoonize Using White-box Cartoon Representations”

Pros of White-box-Cartoonization

More detailed and customizable cartoonization process
Provides a white-box approach, offering better interpretability
Supports both image and video cartoonization

Cons of White-box-Cartoonization

Requires more computational resources
Longer processing time for cartoonization
More complex setup and usage

Code Comparison

White-box-Cartoonization:

output = cartoonize(input_img, model)
guided_filter = GuidedFilter(r=5, eps=2e-1)
output = guided_filter(output, output)

photo2cartoon:

c2p = Photo2Cartoon()
cartoon_img = c2p.inference(img)

White-box-Cartoonization offers more control over the cartoonization process, allowing for fine-tuning of parameters and applying additional filters. photo2cartoon provides a simpler, more straightforward implementation with fewer customization options.

Both projects aim to transform photos into cartoon-style images, but White-box-Cartoonization offers a more comprehensive approach with additional features and flexibility. However, this comes at the cost of increased complexity and resource requirements. photo2cartoon, on the other hand, provides a more streamlined solution that may be easier to integrate into existing projects but with less control over the output.

DualStyleGAN

1,680

[CVPR 2022] Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer

Pros of DualStyleGAN

Offers more diverse and flexible style transfer options
Capable of generating high-quality, high-resolution outputs
Provides better preservation of facial details and expressions

Cons of DualStyleGAN

More complex implementation and potentially higher computational requirements
May require more fine-tuning and parameter adjustments for optimal results
Limited to facial style transfer, while Photo2Cartoon can handle full-body images

Code Comparison

DualStyleGAN:

from models.stylegan2_generator import Generator
from models.dual_generator import DualGenerator

generator = Generator(size, style_dim, n_mlp)
dual_generator = DualGenerator(generator, n_style_layers)

Photo2Cartoon:

from models.UGATIT import UGATIT
from utils.utils import *

model = UGATIT(args)
model.build_model()

DualStyleGAN focuses on a more sophisticated generator architecture, while Photo2Cartoon utilizes a UGATIT-based model. DualStyleGAN's implementation suggests a more flexible approach to style manipulation, potentially offering greater control over the output. Photo2Cartoon's code appears simpler and more straightforward, which may make it easier to use and integrate into existing projects.

PhotoMaker

10,005

PhotoMaker [CVPR 2024]

Pros of PhotoMaker

More versatile, capable of generating various styles beyond cartoons
Supports custom style inputs for personalized results
Offers more advanced features like multi-subject handling

Cons of PhotoMaker

Potentially more complex to use due to additional features
May require more computational resources
Less specialized for cartoon-style outputs compared to Photo2Cartoon

Code Comparison

PhotoMaker:

from photomaker import PhotoMaker

pm = PhotoMaker()
result = pm.generate(
    input_image="path/to/input.jpg",
    style_image="path/to/style.jpg",
    num_outputs=1
)

Photo2Cartoon:

from photo2cartoon import Photo2Cartoon

p2c = Photo2Cartoon()
result = p2c.transform(
    input_image="path/to/input.jpg"
)

Both repositories offer Python-based interfaces for image transformation. PhotoMaker provides more flexibility with style inputs and multiple output options, while Photo2Cartoon focuses specifically on cartoon-style conversions with a simpler API. PhotoMaker's approach allows for greater customization but may require more setup and configuration. Photo2Cartoon offers a more straightforward solution for users specifically interested in cartoon-style transformations.

Bringing-Old-Photos-Back-to-Life

15,581

Bringing Old Photo Back to Life (CVPR 2020 oral)

Pros of Bringing-Old-Photos-Back-to-Life

Focuses on restoring and enhancing old, damaged photos
Utilizes advanced AI techniques for face restoration and colorization
Provides a comprehensive solution for multiple photo restoration tasks

Cons of Bringing-Old-Photos-Back-to-Life

More complex setup and usage compared to Photo2Cartoon
Requires more computational resources due to its comprehensive approach
May have a steeper learning curve for users unfamiliar with deep learning frameworks

Code Comparison

Photo2Cartoon:

from photo2cartoon import Photo2Cartoon
p2c = Photo2Cartoon()
cartoon = p2c.inference(img)

Bringing-Old-Photos-Back-to-Life:

from bringing_old_photos_back_to_life import Restoration
restorer = Restoration()
restored_image = restorer.restore(image_path)

Both projects use Python and provide simple inference methods, but Bringing-Old-Photos-Back-to-Life offers a more comprehensive restoration process, while Photo2Cartoon focuses specifically on creating cartoon-style images from photos.

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot

README

äººåå¡éå (Photo to Cartoon)

ä¸æç | English Version

ä¹å¯ä»¥åå¾æä»¬çaiå¼æ¾å¹³å°è¿è¡å¨çº¿ä½éªï¼https://ai.minivision.cn/#/coreability/cartoon

ææ¯äº¤æµQQç¾¤ï¼937627932

Updates

2021.12.2: å¨Replicateå¹³å°ä½éª
2020.12.2: å¼æºåºäºpaddlepaddleçé¡¹ç®photo2cartoon-paddleã
2020.12.1: å¢å onnxæµè¯æ¨¡å, è¯¦æè¯·è§ test_onnx.pyã

ç®ä»

Unpaired image translationæµæ´¾æç»å¸æ¹æ³æ¯CycleGANï¼ä½åå§CycleGANççæç»æå¾å¾åå¨è¾ä¸ºææ¾çä¼ªå½±ä¸ä¸ç¨³å®ãè¿æçè®ºæU-GAT-ITæåºäºä¸ç§å½ä¸åæ¹æ³ââAdaLINï¼è½å¤èªå¨è°èInstance NormåLayer Normçæ¯éï¼åç»åattentionæºå¶è½å¤å®ç°ç²¾ç¾çäººåæ¥æ¼«é£æ ¼è½¬æ¢ã

Start

å®è£ä¾èµåº

é¡¹ç®æéçä¸»è¦ä¾èµåºå¦ä¸ï¼

python 3.6
pytorch 1.4
tensorflow-gpu 1.14
face-alignment
dlib
onnxruntime

Cloneï¼

git clone https://github.com/minivision-ai/photo2cartoon.git
cd ./photo2cartoon

ä¸è½½èµæº

è°·æç½ç | ç¾åº¦ç½ç æåç :y2ch

äººåå¡éåé¢è®ç»æ¨¡åï¼photo2cartoon_weights.pt(20200504æ´æ°)ï¼åæ¾å¨modelsè·¯å¾ä¸ã
å¤´ååå²æ¨¡åï¼seg_model_384.pbï¼åæ¾å¨utilsè·¯å¾ä¸ã
äººè¸è¯å«é¢è®ç»æ¨¡åï¼model_mobilefacenet.pthï¼åæ¾å¨modelsè·¯å¾ä¸ãï¼From: InsightFace_Pytorchï¼
å¡éç»å¼æºæ°æ®ï¼cartoon_dataï¼åå«trainBåtestBã
äººåå¡éåonnxæ¨¡åï¼photo2cartoon_weights.onnx è°·æç½çï¼åæ¾å¨modelsè·¯å¾ä¸ã

æµè¯

python test.py --photo_path ./images/photo_test.jpg --save_path ./images/cartoon_result.png

æµè¯onnxæ¨¡å

python test_onnx.py --photo_path ./images/photo_test.jpg --save_path ./images/cartoon_result.png

è®ç»

1.æ°æ®åå¤

æ£æµäººè¸åå³é®ç¹ã
æ ¹æ®å³é®ç¹æè½¬æ ¡æ£äººè¸ã
ä½¿ç¨äººååå²æ¨¡åå°èæ¯ç½®ç½ã

python data_process.py --data_path YourPhotoFolderPath --save_path YourSaveFolderPath

âââ dataset
    âââ photo2cartoon
        âââ trainA
            âââ xxx.jpg
            âââ yyy.png
            âââ ...
        âââ trainB
            âââ zzz.jpg
            âââ www.png
            âââ ...
        âââ testA
            âââ aaa.jpg 
            âââ bbb.png
            âââ ...
        âââ testB
            âââ ccc.jpg 
            âââ ddd.png
            âââ ...

2.è®ç»

éæ°è®ç»:

python train.py --dataset photo2cartoon

å è½½é¢è®ç»åæ°:

python train.py --dataset photo2cartoon --pretrained_weights models/photo2cartoon_weights.pt

å¤GPUè®ç»(ä»å»ºè®®ä½¿ç¨batch_size=1ï¼åå¡è®ç»):

python train.py --dataset photo2cartoon --batch_size 4 --gpu_ids 0 1 2 3

Q&A

Qï¼ä¸ºä»ä¹å¼æºçå¡éåæ¨¡åä¸å°ç¨åºä¸çæææå·®å¼ï¼

Qï¼å¦ä½éåæææå¥½çæ¨¡åï¼

Aï¼é¦åè®ç»æ¨¡å200k iterationsï¼ç¶åä½¿ç¨FIDææ æéåºæä¼æ¨¡åï¼æç»æéåºçæ¨¡åä¸ºè¿ä»£90k iterationsæ¶çæ¨¡åã

Qï¼å³äºäººè¸ç¹å¾æåæ¨¡åã

Qï¼äººååå²æ¨¡åæ¯å¦è½ç¨ä¸åå²åèº«åï¼

Tips

åè

U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation [Paper][Code]

InsightFace_Pytorch

Top Related Projects

Convert designs to code with AI

Introducing Visual Copilot: A new AI model to turn Figma designs to high quality code using your components.

Try Visual Copilot

Top Related Projects

Quick Overview

Pros

Cons

Code Examples

Getting Started

Competitor Comparisons

Pros of AnimeGANv2

Cons of AnimeGANv2

Code Comparison

Pros of White-box-Cartoonization

Cons of White-box-Cartoonization

Code Comparison

Pros of DualStyleGAN

Cons of DualStyleGAN

Code Comparison

Pros of PhotoMaker

Cons of PhotoMaker

Code Comparison

Pros of Bringing-Old-Photos-Back-to-Life

Cons of Bringing-Old-Photos-Back-to-Life

Code Comparison

Convert designs to code with AI

README

äººåå¡éå (Photo to Cartoon)

ç®ä»

Start

å®è£ ä¾èµåº

Cloneï¼

ä¸è½½èµæº

æµè¯

æµè¯onnxæ¨¡å

è®­ç»

Q&A

Qï¼ä¸ºä»ä¹å¼æºçå¡éåæ¨¡åä¸å°ç¨åºä¸­çæææå·®å¼ï¼

Qï¼å¦ä½éåæææå¥½çæ¨¡åï¼

Qï¼å ³äºäººè¸ç¹å¾æåæ¨¡åã

Qï¼äººååå²æ¨¡åæ¯å¦è½ç¨ä¸åå²åèº«åï¼

Tips

åè

Top Related Projects

Convert designs to code with AI

äººåå¡éå (Photo to Cartoon)

ç®ä»

å®è£ä¾èµåº

Cloneï¼

ä¸è½½èµæº

æµè¯

æµè¯onnxæ¨¡å

è®ç»

Qï¼ä¸ºä»ä¹å¼æºçå¡éåæ¨¡åä¸å°ç¨åºä¸çæææå·®å¼ï¼

Qï¼å¦ä½éåæææå¥½çæ¨¡åï¼

Qï¼å³äºäººè¸ç¹å¾æåæ¨¡åã

Qï¼äººååå²æ¨¡åæ¯å¦è½ç¨ä¸åå²åèº«åï¼

åè