mirror of https://github.com/Zeyi-Lin/HivisionIDPhotos.git synced 2025-09-15 14:58:34 +08:00

一个轻量级的AI证件照制作算法。

Go to file

ZeYi Lin ca95e1e29b docs: face++ error code		2024-09-09 23:43:21 +08:00
.devcontainer	feat: add devcontainer support (#80 )	2024-09-08 22:26:59 +08:00
.github/workflows	fix: requirements	2024-09-08 16:04:07 +08:00
.vscode	opt	2024-09-05 17:05:52 +08:00
assets	docs:update image	2024-09-09 03:17:42 +08:00
demo	docs: mnn model	2024-09-07 18:16:04 +08:00
docs	docs: face++ error code	2024-09-09 23:43:21 +08:00
hivision	docs: reference	2024-09-09 18:40:48 +08:00
scripts	feat: birefnet model (#82 )	2024-09-09 01:10:35 +08:00
test	fix:package	2024-09-05 21:55:43 +08:00
.gitignore	feat: watermark (#83 )	2024-09-09 11:55:30 +08:00
app.py	feat: watermark enhance (#86 )	2024-09-09 18:33:56 +08:00
app.spec	update:some-code	2024-09-05 22:46:06 +08:00
deploy_api.py	refactor: choose handler	2024-09-08 04:06:22 +08:00
docker-compose.yml	docs: docker pull	2024-09-07 03:55:08 +08:00
Dockerfile	style: docker support gradio print	2024-09-07 02:31:49 +08:00
inference.py	feat: birefnet model (#82 )	2024-09-09 01:10:35 +08:00
LICENSE	Create LICENSE	2024-09-04 12:17:55 +08:00
README_EN.md	docs: sync multilingual	2024-09-09 03:07:43 +08:00
README_JP.md	docs: sync multilingual	2024-09-09 03:07:43 +08:00
README_KO.md	docs: readme	2024-09-09 03:12:23 +08:00
README.md	docs: reference	2024-09-09 18:40:48 +08:00
requests_api.py	fix api func name (#87 )	2024-09-09 19:53:49 +08:00
requirements-app.txt	feat: support gradio>=4.43.0	2024-09-06 15:13:34 +08:00
requirements-dev.txt	opt	2024-09-05 17:05:52 +08:00
requirements.txt	fix: requirements	2024-09-08 16:04:07 +08:00

README_EN.md

HivisionIDPhoto

English / 中文 / 日本語 / 한국어

Related Projects：

SwanLab: Used throughout the training of the portrait matting model for analysis and monitoring, as well as collaboration with lab colleagues, significantly improving training efficiency.

Recent Updates
Project Overview
Community
Preparation
Demo Startup
Python Inference
API Service Deployment
Docker Deployment
Contact Us
Contributors

🤩 Recent Updates

Online Experience: 、
2024.09.09: Added new Matting Model BiRefNet-v1-lite | Gradio added advanced parameter settings tab
2024.09.08: Added new Matting Model RMBG-1.4 | ComfyUI Workflow - HivisionIDPhotos-ComfyUI contributed by AIFSH
2024.09.07: Added Face Detection API Option Face++, achieving higher precision in face detection
2024.09.06: Added new matting model modnet_photographic_portrait_matting.onnx
2024.09.05: Updated Restful API Documentation
2024.09.02: Updated Adjust Photo KB Size, DockerHub
2023.12.01: Updated API Deployment (based on fastapi)

Project Overview

🚀 Thank you for your interest in our work. You may also want to check out our other achievements in the field of image processing, feel free to reach out: zeyi.lin@swanhub.co.

HivisionIDPhoto aims to develop a practical and systematic intelligent algorithm for producing ID photos.

It utilizes a comprehensive AI model workflow to recognize various user photo-taking scenarios, perform matting, and generate ID photos.

HivisionIDPhoto can achieve:

Lightweight matting (purely offline, fast inference with CPU only)
Generate standard ID photos and six-inch layout photos based on different size specifications
Support pure offline or edge-cloud inference
Beauty effects (waiting)
Intelligent formal wear change (waiting)

If HivisionIDPhoto helps you, please star this repo or recommend it to your friends to solve the urgent ID photo production problem!

🏠 Community

We have shared some interesting applications and extensions of HivisionIDPhotos built by the community:

HivisionIDPhotos-windows-GUI: Windows client application built by zhaoyun0071
HivisionIDPhotos-ComfyUI: ComfyUI ID photo processing workflow built by AIFSH

🔧 Preparation

Environment installation and dependencies:

Python >= 3.7 (project primarily tested on Python 3.10)
OS: Linux, Windows, MacOS

1. Clone the Project

git clone https://github.com/Zeyi-Lin/HivisionIDPhotos.git
cd  HivisionIDPhotos

2. Install Dependency Environment

It is recommended to create a python3.10 virtual environment using conda, then execute the following commands

pip install -r requirements.txt
pip install -r requirements-app.txt

3. Download Weight Files

Method 1: Script Download

python scripts/download_model.py --models all

Method 2: Direct Download

Store in the project's hivision/creator/weights directory:

modnet_photographic_portrait_matting.onnx (24.7MB): Official weights of MODNet, download
hivision_modnet.onnx (24.7MB): Matting model with better adaptability for pure color background replacement, download
rmbg-1.4.onnx (176.2MB): Open-source matting model from BRIA AI, download and rename to rmbg-1.4.onnx
birefnet-v1-lite.onnx(224MB): Open-source matting model from ZhengPeng7, download and rename to birefnet-v1-lite.onnx

4. Face Detection Model Configuration (Optional)

Extended Face Detection Model	Description	Documentation
MTCNN	Offline face detection model, high-performance CPU inference, default model, lower detection accuracy	Use it directly after cloning this project
Face++	Online face detection API launched by Megvii, higher detection accuracy, official documentation	Usage Documentation

5. GPU Inference Acceleration (Optional)

If you need to use NVIDIA GPU for accelerated inference, ensure that you have installed CUDA and cuDNN, then find the corresponding onnxruntime-gpu version to install according to the documentation, for example:

# CUDA 12.x, cuDNN 8
pip install onnxruntime-gpu==1.18.0

After completing this, calling models like birefnet-v1-lite will utilize GPU acceleration for inference.

🚀 Run Gradio Demo

python app.py

Running the program will generate a local web page where you can perform operations and interact with ID photos.

🚀 Python Inference

Core parameters:

-i: Input image path
-o: Save image path
-t: Inference type, options include idphoto, human_matting, add_background, generate_layout_photos
--matting_model: Selection of portrait matting model weights
--face_detect_model: Selection of face detection model

More parameters can be viewed by running python inference.py --help

1. ID Photo Production

Input 1 photo to obtain 1 standard ID photo and 1 high-definition ID photo in 4-channel transparent png

python inference.py -i demo/images/test.jpg -o ./idphoto.png --height 413 --width 295

2. Portrait Matting

python inference.py -t human_matting -i demo/images/test.jpg -o ./idphoto_matting.png --matting_model hivision_modnet

3. Add Background Color to Transparent Image

Input 1 4-channel transparent png to obtain 1 image with added background color

python inference.py -t add_background -i ./idphoto.png -o ./idphoto_ab.jpg  -c 4f83ce -k 30 -r 1

4. Obtain Six-Inch Layout Photo

Input 1 3-channel photo to obtain 1 six-inch layout photo

python inference.py -t generate_layout_photos -i ./idphoto_ab.jpg -o ./idphoto_layout.jpg  --height 413 --width 295 -k 200

⚡️ Deploy API Service

Start Backend

python deploy_api.py

Request API Service

For detailed request methods, please refer to the API Documentation, which includes the following request examples:

🐳 Docker Deployment

1. Pull or Build Image

Choose one of the following methods

Method 1: Pull the latest image:

docker pull linzeyi/hivision_idphotos

Method 2: Directly build the image from Dockerfile:

After ensuring that the model weight file hivision_modnet.onnx is placed in hivision/creator/weights, execute the following in the project root directory:

docker build -t linzeyi/hivision_idphotos .

Method 3: Build using Docker Compose:

Ensure that the model weight file hivision_modnet.onnx is placed in hivision/creator/weights, then execute the following in the project root directory:

docker compose build

2. Run Services

Start Gradio Demo Service

Run the following command, and you can access it locally at http://127.0.0.1:7860.

docker run -d -p 7860:7860 linzeyi/hivision_idphotos

Start API Backend Service

docker run -d -p 8080:8080 linzeyi/hivision_idphotos python3 deploy_api.py

Start Both Services Simultaneously

docker compose up -d

Environment Variables

This project provides some additional configuration options, which can be set using environment variables:

Environment Variable	Type	Description	Example
FACE_PLUS_API_KEY	Optional	This is your API key obtained from the Face++ console	`7-fZStDJ····`
FACE_PLUS_API_SECRET	Optional	Secret corresponding to the Face++ API key	`VTee824E····`

Example of using environment variables in Docker:

docker run  -d -p 7860:7860 \
    -e FACE_PLUS_API_KEY=7-fZStDJ···· \
    -e FACE_PLUS_API_SECRET=VTee824E···· \
    linzeyi/hivision_idphotos

📖 Cite Projects

MTCNN:

@software{ipazc_mtcnn_2021,
    author = {ipazc},
    title = {{MTCNN}},
    url = {https://github.com/ipazc/mtcnn},
    year = {2021},
    publisher = {GitHub}
}

ModNet:

@software{zhkkke_modnet_2021,
    author = {ZHKKKe},
    title = {{ModNet}},
    url = {https://github.com/ZHKKKe/MODNet},
    year = {2021},
    publisher = {GitHub}
}

💻 Development Tips

1. How to Modify Preset Sizes?

Modify size_list_EN.csv and then run app.py again, where the first column is the size name, the second column is the height, and the third column is the width.

📧 Contact Us

If you have any questions, please email zeyi.lin@swanhub.co

Contributors

Zeyi-Lin、SAKURA-CAT、Feudalman、swpfY、Kaikaikaifang、ShaohonChen、KashiwaByte

README_EN.md

HivisionIDPhoto

Table of Contents

🤩 Recent Updates

Project Overview

🏠 Community

🔧 Preparation

1. Clone the Project

2. Install Dependency Environment

3. Download Weight Files

4. Face Detection Model Configuration (Optional)

5. GPU Inference Acceleration (Optional)

🚀 Run Gradio Demo

🚀 Python Inference

1. ID Photo Production

2. Portrait Matting

3. Add Background Color to Transparent Image

4. Obtain Six-Inch Layout Photo

⚡️ Deploy API Service

Start Backend

Request API Service

🐳 Docker Deployment

1. Pull or Build Image

2. Run Services

Environment Variables

📖 Cite Projects

💻 Development Tips

📧 Contact Us

Contributors

StarHistory

README_EN.md Unescape Escape

HivisionIDPhoto

Table of Contents

🤩 Recent Updates

Project Overview

🏠 Community

🔧 Preparation

1. Clone the Project

2. Install Dependency Environment

3. Download Weight Files

4. Face Detection Model Configuration (Optional)

5. GPU Inference Acceleration (Optional)

🚀 Run Gradio Demo

🚀 Python Inference

1. ID Photo Production

2. Portrait Matting

3. Add Background Color to Transparent Image

4. Obtain Six-Inch Layout Photo

⚡️ Deploy API Service

Start Backend

Request API Service

🐳 Docker Deployment

1. Pull or Build Image

2. Run Services

Environment Variables

📖 Cite Projects

💻 Development Tips

📧 Contact Us

Contributors

StarHistory

README_EN.md