10/1進度紀錄 AIGC

# 10/1進度紀錄 AIGC ## 陳孟蓉 ### 跑 miragenews pipeline: 1. load dataset from hugging face 2. encdoe dataset 3. train model 4. compile encode py檔 5. test - 他的多模態偵測器是直接基於MiRAGe-Img和MiRAGe-Txt的預測進行推斷(用兩個模型) dataset:https://huggingface.co/datasets/anson-huang/mirage-news/viewer/default/test1_nyt_mj?views%5B%5D=test1_nyt_mj&views%5B%5D=train {"text": "A man pretends to be injured in a staged truck bomb attack for publicity in Lakki Marwat, Pakistan on Friday.", "label": 1} {"text": "Former French Prime Minister Dominique de Villepin manipulated the media with false information after the verdict was issued in Paris on Thursday.", "label": 1} result: ![image](https://hackmd.io/_uploads/BJcAMzqhlx.png) 1. true 0 = real（真新聞）、1 = fake（假新聞）。 2. prob prob: 0.0116 → 模型認為這筆資料是假新聞的機率約 1.16% 3. pred * 模型的最終預測標籤（0 或 1），依據 threshold 判斷。 * threshold 預設是 0.5，prob >= threshold → pred=1(假) ## 廖奕皓 ### 重現FreqNet論文：利用ForenSynths資料集(progan/{categories}/(0_real, 1_fake))，訓練模型，進行accuracy測試。目前已完成訓練(85 epochs)。測試完 progan本身提供的test資料集(無訓練過): ![螢幕擷取畫面 2025-10-01 134110](https://hackmd.io/_uploads/By7w-Bq3xe.png) 結果: 不好，感覺模型沒收斂或者有地方沒用好測試中 train資料集中的 **種類car**(有訓練過): 目的: 看是不是模型沒收斂還是別的問題 ![image](https://hackmd.io/_uploads/HJfKtH93xx.png) 問題: 訓練完好像因為error沒有存到checkpoint裡面...........................，只能之後重跑 = =