zi2zi - HackMD

# <font class="h2">**字到字 zi2zi**</font> <style> .h1 { background: linear-gradient(135deg,#fff,#73BF00) ; color: #BF0060; display:block; padding: 6px 5px; border-radius: 4px; } .h2 { background: linear-gradient(180deg,#fff 50%,#73BF00) ; color: #467500; display:block; padding: 6px 5px; border-radius: 8px; border-bottom: 3px solid #467500; } </style> ![](https://hackmd.io/_uploads/Hy3Z0hR_h.png =40%x) [TOC] ## zi2zi: Master Chinese Calligraphy with Conditional Adversarial Networks - **WEB** https://kaonashi-tyc.github.io/2017/04/06/zi2zi.html - **官方** ![](https://hackmd.io/_uploads/BJOHl0LP3.png =7%x) https://github.com/kaonashi-tyc/zi2zi - **修正後** ![](https://hackmd.io/_uploads/BJOHl0LP3.png =7%x) https://github.com/chiaoooo/zi2zi_tensorflow ``` 生成環境指令： conda env export --name <env_name> > environment.yml 開啟環境指令： conda env create -f environment.yml ``` **V100 環境建立長這樣：** ![image](https://hackmd.io/_uploads/B1gWazbJPC.png) ## 使用 Anaconda Prompt (Anaconda3) #### Requirements ``` * Python = 3.7 * CUDA * cudnn * Tensorflow = 1.14.0 * Pillow * numpy * scipy = 1.2.1 * imageio = 2.9.0 ``` #### 建議使用虛擬環境！！！電腦要有 <font color="hotpink">**NVIDIA GPU**</font>！！！ ``` conda create --name zi2zi python=3.7 conda activate zi2zi conda install tensorflow-gpu==1.14.0 git clone https://github.com/Circle472/zi2zi_tensorflow.git cd zi2zi_tensorflow ``` #### 測試 tensorflow-gpu ``` >python >>>import tensorflow as tf >>>tf.test.is_gpu_available() ``` --- <br> ### 前置選項1：製作 charset，自己指定你想生成的字將要 train 的字放入 train.txt ![image](https://hackmd.io/_uploads/B1t44SLxA.png) 將要 val 的字放入 val.txt ![image](https://hackmd.io/_uploads/SJH8VrUxR.png) * 做 train 的 json 檔 ``` python m1_json_train.py.py ``` * 做 val 的 json 檔 ``` python m2_json_val.py ``` * 合併兩個 json 檔 ``` python m3_merge_json.py.py ``` **執行完會得到 cjk.json 就代表成功！** 要自己替換掉原本的 charset/cjk.json ### 前置選項2：製作 charset，自己指定你想生成的字將要 train 的字放入 train.txt ![image](https://hackmd.io/_uploads/B1t44SLxA.png) * 做 train 的 json 檔 ``` python m1_json_train.py.py ``` * 自動生成 big5-train （取差集）　val 的 json 檔 ``` python bigfive_val.py ``` * 做 val 的 json 檔 ``` python m2_json_val.py ``` * 合併兩個 json 檔 ``` python m3_merge_json.py.py ``` **執行完會得到 cjk.json 就代表成功！** 要自己替換掉原本的 charset/cjk.json --- <br> <h2 style="color:green;">程式執行</h2> ### 建立環境使用下面指令可以直接生成環境！！！！！ ``` conda env create -f environment.yml ``` 建立 sample 資料夾 ``` mkdir image_train mkdir image_val ``` *--srcfont: 來源字體路徑位置 --dstfont: 目標字體路徑位置 --charset: 要讀取的字集 e.g. CN、CNT、JP、KR、<font color=red>TWTrain</font>、<font color=red>TWVal</font> --samplecount:取幾張圖訓練（數字） --sampledir:圖片存放位置（對應 package.py 的 --dir） --label: 類別編號，在<font color=red>同模型訓練多字體</font>時需更換，ex: 2、3... --shuffle: 是否重新排序字集中文字的排序 e.g. 0: false, 1: true* 這裡設定<font color=hotpink>**來源字體為源樣黑體，目標字體為 CircleFont，訓練字數 1000 字**</font>。 ``` python font2img.py --src_font=font/GenYoGothicTW-EL-01.ttf --dst_font=font/CircleFont.ttf --charset=TWTrain --sample_count=1000 --sample_dir=image_train --label=1 --filter=1 --shuffle=1 python font2img.py --src_font=font/GenYoGothicTW-EL-01.ttf --dst_font=font/CircleFont.ttf --charset=TWVal --sample_count=670 --sample_dir=image_val --label=1 --filter=1 --shuffle=0 ``` ## 建立訓練、驗證資料 object **得到 train.obj 和 val.obj 在 save_dir 資料夾** 得到 train.obj save_dir 預設 `experiment/data` ``` python package.py --dir=image_train --save_dir=experiment/data --split_ratio=0.1 ``` 得到 val.obj 會在最後驗證步驟 infer.py 用到（這裡 --save_dir 與 infer.py 的 --source_obj 相同） ``` python package.py --dir=image_val --save_dir=experiment/data/val --split_ratio=1 ``` ## TRAIN *--experimentdir: 訓練要存的資料夾（已存在），會在內建立 checkpoint、log、sample 資料夾 --experimentid: 模型編號（數字） --batchsize: 設定 1 epoch ? batch（數字）* ``` python train.py --experiment_dir=experiment --experiment_id=1 --batch_size=16 --lr=0.001 --epoch=1000 --sample_steps=50 --schedule=20 --L1_penalty=100 --Lconst_penalty=15 ``` ## 推論結果 INFER *--modeldir: 訓練後的 checkpoint 資料夾 --batchsize: 圖片中的文字列數 --experimentids: 對應 font2img 的 --label 數字（預設 1 代表要推論出 label=1 的驗證資料集）* ``` python infer.py --model_dir=experiment/checkpoint/experiment_1_batch_16 --batch_size=1 --source_obj=experiment/data/val/val.obj --embedding_ids=1 --save_dir=experiment/infer_1 ``` <p style="color:green;font-weight:bold;">如果要推論沒訓練過的字（沒看過的字）:</p> 把46-56行改成下面這樣 ``` def draw_example(ch, src_font, dst_font, canvas_size, x_offset, y_offset, filter_hashes): dst_img = draw_single_char(ch, dst_font, canvas_size, x_offset, y_offset) # check the filter example in the hashes or not dst_hash = hash(dst_img.tobytes()) if dst_hash in filter_hashes: src_img = draw_single_char(ch, src_font, canvas_size, x_offset, y_offset) example_img = Image.new("RGB", (canvas_size * 2, canvas_size), (255, 255, 255)) example_img.paste(src_img, (canvas_size, 0)) return example_img src_img = draw_single_char(ch, src_font, canvas_size, x_offset, y_offset) example_img = Image.new("RGB", (canvas_size * 2, canvas_size), (255, 255, 255)) example_img.paste(dst_img, (0, 0)) example_img.paste(src_img, (canvas_size, 0)) return example_img ``` 並重新執行　 * python font2img.py --src_font=font/GenYoGothicTW-EL-01.ttf --dst_font=font/CircleFont.ttf --charset=TWVal --sample_count=670 --sample_dir=image_val --label=1 --filter=1 --shuffle=0 * python package.py --dir=image_val --save_dir=experiment/data/val --split_ratio=1 :::warning [點我進入打包字體教學！](https://hackmd.io/@h93YMTP_SrK5XODkOdtuKg/Sk20ATBMp) :::

Syntax	Example	Reference
# Header	Header	基本排版
- Unordered List	Unordered List
1. Ordered List	Ordered List
- [ ] Todo List	Todo List
> Blockquote	Blockquote
Bold font	Bold font
Italics font	Italics font
~~Strikethrough~~	~~Strikethrough~~
19^th^	19^th
H~2~O	H₂O
++Inserted text++	Inserted text
==Marked text==	Marked text
[link text](https:// "title")	Link
![image alt](https:// "title")	Image
`Code`	`Code`	在筆記中貼入程式碼
```javascript var i = 0; ```	`var i = 0;`
:smile:		Emoji list
{%youtube youtube_id %}	Externals
$L^aT_eX$	L^aT_eX
:::info This is a alert area. :::	This is a alert area.