當 IRB 核准不再是免死金牌：研究倫理寫作的進階論證

從「宣告合規」升級到「論證正當性」——同意範圍對齊、AI 揭露、重製與隱私的權衡，以及審稿人真正在看的倫理推理

進階 · 約 14 分鐘 ·#研究倫理#學術寫作#知情同意#AI揭露#可重製性#資料隱私

當「我們有 IRB 核准」不再是免死金牌：倫理敘事的進階寫法

你已經知道作者署名（authorship）、利益揭露（conflict of interest, COI）與資料誠信（data integrity）是論文倫理的三根支柱。但請想像一個情境：你的稿件附上了 IRB 核准文號、寫了標準的「informed consent was obtained」一句，方法段也照規範交代了。結果審稿人卻回了一句讓你冒冷汗的話：

"The authors state that consent was obtained, but given the secondary use of behavioral log data and the vulnerable population involved, the manuscript does not explain how consent scope was matched to the actual analyses performed."

問題不在於你「有沒有做倫理程序」，而在於你「有沒有把倫理推理（ethical reasoning）寫成可被檢驗的論述」。進階的倫理寫作，重點從宣告合規（declaring compliance）轉向論證正當性（arguing justification）。這篇文章假設你已掌握入門的三支柱，我們直接進入更難、也更常被退稿卡住的層次：同意範圍與資料用途的對齊、演算法與 AI 的倫理揭露、可重製性與倫理的張力，以及如何把這些寫成審稿人信服的文字。

研究倫理與寫作進階概念示意圖

從「合規語言」到「論證語言」：倫理段落的修辭升級

入門篇教你寫出合規句（compliance statement）：誰核准、誰同意、編號多少。進階寫作要求你在此之上，補上為什麼這個程序對應到你實際做的分析。審稿人——尤其教育科技、學習分析這類處理人類行為數據的領域——已經不滿足於「有做」，他們要看「做得對不對」。

語言上的差別非常具體。比較下面兩句：

不佳（只宣告，留下漏洞）：

"Informed consent was obtained from all participants. The study was approved by the IRB (No. 202507EM058)."

改良（宣告＋對齊範圍＋指出限制）：

"Written informed consent was obtained from all participants prior to data collection (IRB No. 202507EM058). The consent form explicitly covered the secondary analysis of interaction logs for research purposes; physiological data, which fell outside the original consent scope, were excluded from the present analyses. Participants retained the right to withdraw their data at any point, and three participants who exercised this right were removed before modeling."

差別在哪？第二句把「同意」這個動作拆解成可檢驗的維度：範圍（scope）、排除規則（exclusion）、可撤回性（revocability）、撤回後的實際處置。這正是進階倫理寫作的核心動作——把一個籠統的倫理宣稱，分解成審稿人可以逐項核對的小命題。

一個實用的自我檢查：每當你寫下一句倫理宣稱，問自己「審稿人能不能反問『那……呢？』」。如果一句「consent was obtained」會引出「那二次使用呢？那敏感資料呢？那未成年者呢？」，就代表這句話的論證密度（argumentative density）不足。

同意範圍與分析用途的對齊：被低估的退稿地雷

學習分析、教育數據探勘領域有一個結構性風險：資料常常是為了某個目的收集、卻被用於另一個分析。學生在課堂上使用 AI 助教，當下同意的是「改善教學體驗」；但你後來想拿這批對話軌跡（dialogue logs）做認知層次（Bloom's taxonomy）的研究分析。這兩件事的同意基礎並不自動相等。

進階寫作要處理的，是同意—用途對齊（consent–use alignment）的論證。這裡有三種常見情境，每種的寫法不同：

情境一：原始同意已涵蓋研究用途。 直接點明範圍，並引述同意書語言。

"The platform's consent framework, agreed to at account creation, explicitly permits the use of de-identified learning records for educational research. The analyses reported here fall squarely within this scope."

情境二：原始同意未涵蓋，但資料已充分去識別化、走的是 IRB 豁免（waiver of consent）路徑。 必須說明豁免的正當理由，而非含糊帶過。

不佳：

"Consent was waived because the data were anonymous."

改良：

"Because the analysis used only fully de-identified behavioral logs with no re-identification pathway, the IRB granted a waiver of informed consent under the minimal-risk provision. No demographic attributes that could enable re-identification (e.g., rare program–cohort combinations) were retained in the analytic dataset."

注意改良版做了兩件事：一是說明豁免的法律／倫理依據（minimal-risk provision），二是主動回應「匿名真的匿名嗎」這個審稿人必問的疑慮——明確交代了準識別碼（quasi-identifiers）的處理。光說「anonymous」是不夠的，因為審稿人知道高維行為資料的去識別化往往是假象。

情境三：補做同意或採取 opt-out。 誠實寫出時間線，不要假裝同意是前瞻收集的。

"As the log data predated the research question, participants were contacted post hoc and given a 30-day window to opt out of the secondary analysis; data from non-respondents were retained under the IRB-approved opt-out protocol, while data from those who opted out were permanently excluded."

這裡的進階觀念是：倫理敘事必須與資料時間線（data provenance timeline）一致。審稿人若發現你的「prospective consent」其實是事後補的，信任會崩潰。寫作上寧可暴露複雜性，也不要為了好看而抹平時間線。

AI 與演算法的倫理揭露：新支柱

入門三支柱成形於 AI 大規模介入研究流程之前。今天，一篇處理教育數據的論文若用了大型語言模型（LLM）做標註、評分、生成或分析，沒有揭露 = 倫理瑕疵。這是一根正在快速成形的新支柱，也是當前審稿最敏感的地帶之一。

要揭露什麼？至少四個維度：

使用範圍：LLM 用在哪個環節？資料清洗、標註、評分、文字生成、還是寫作潤飾？
可重製性風險：模型版本、時間點、參數（如 temperature、reasoning effort）——因為 LLM 輸出不確定，同一 prompt 在不同版本可能給出不同結果。
人類監督（human oversight）：模型輸出是否經人工校驗？標註的人機一致性（agreement）如何？
偏誤（bias）考量：模型是否可能對某類學生、某類語言表達系統性偏誤？

不佳（隱匿或輕描淡寫）：

"Responses were scored using AI."

改良（可檢驗的揭露）：

"Open-ended responses were scored by GPT-4o (version dated 2024-08, temperature = 0) using a fixed rubric prompt (Appendix B). To assess reliability, two human raters independently scored a stratified 15% subsample; agreement between the model and the human consensus was κ = 0.81. Discrepancies were concentrated in responses mixing Chinese and English, suggesting a language-related limitation that we discuss in §5.3."

這段示範了進階揭露的完整度：模型身分、版本、參數、prompt 可取得性、信度驗證、以及主動指認偏誤的方向。最後一句尤其關鍵——主動承認模型在中英混雜文本上的弱點，反而增加而非減損可信度。審稿人最怕的是作者假裝 AI 沒有盲點。

另外注意「寫作潤飾」這個灰色地帶。多數期刊現在要求揭露生成式 AI 在寫作上的協助。寫法上要區分「AI 協助修飾語言」與「AI 生成研究內容（idea, analysis, interpretation）」——前者通常可接受並揭露，後者觸及作者署名的本質（AI 不能當作者，因為無法為內容負責、無法簽署 authorship criteria）。

改良（寫作協助揭露）：

"Generative AI (a large language model) was used solely to improve the grammatical fluency of author-written drafts. All research design, analysis, interpretation, and claims are the authors' own; the authors take full responsibility for the entire content."

可重製性與倫理的張力：當「公開資料」撞上「保護受試者」

進階研究者會遇到一個真實的兩難：開放科學（open science）要求你公開資料與程式碼以利重製（reproducibility），但研究倫理要求你保護受試者隱私。這兩個善的目標會直接衝突，而論文必須論證你如何權衡（trade-off），而不是假裝衝突不存在。

低段位的處理是二選一：要嘛不公開（被批不透明），要嘛全公開（洩漏隱私）。進階的處理是設計分層揭露（tiered disclosure）並把理由寫清楚。

看一個例子

假設你研究的是學生與 AI 助教的對話如何影響認知投入。原始對話含大量可識別個人的內容（學號、口頭提及的姓名、特定課程細節）。你想支持重製，又不能直接丟出原始逐字稿。一段成熟的「Data Availability Statement」可以這樣寫：

不佳（含糊推託）：

"Data are available upon reasonable request."

這句話現在被許多期刊與後設研究（meta-research）視為近乎無意義——研究顯示這類承諾的實際兌現率極低。

改良（分層、具體、可操作）：

"We adopt a tiered data-sharing approach to balance reproducibility against participant privacy. (1) The fully de-identified, aggregated feature matrix and all analysis code are openly available at [repository DOI], sufficient to reproduce every figure and statistical test. (2) The raw dialogue transcripts contain personal identifiers and cannot be openly released; qualified researchers may apply for access under a data-use agreement reviewed by the host institution's IRB, with re-identification expressly prohibited. (3) A synthetic, structurally faithful sample of transcripts is provided to illustrate the data format without exposing real participants."

這個寫法的進階之處：它不把「重製」與「隱私」當成單一開關，而是沿著資料的敏感度梯度（sensitivity gradient）拆成三層——完全開放的衍生資料、受控存取的原始資料、以及合成樣本作為格式示範。每一層都附上正當理由。審稿人看到的不是推託，而是經過設計的權衡。

動手試試

拿你正在寫（或讀過）的一篇論文，找出它的「Data Availability Statement」與倫理段落，逐句做這個練習：在每句旁邊標註它回答了哪個維度（範圍／排除／可撤回／去識別化方法／AI 揭露／重製層級）。空白最多的維度，就是審稿人最可能下手的地方。把那幾格補起來，你的倫理論述密度就會明顯提升。

不止合規，還要對齊「敏感資料敘事」原則

回到寫作本身。當論文需要在正文（而非只在方法段）討論敏感資料——例如生理數據、性別認同、社經背景——倫理寫作會延伸到敘事設計（narrative design）。一個被低估的進階技巧是：先說「為什麼收集」，再說「如何保護」，最後給受試者「控制權」，這個順序本身就是一種倫理表態。

不佳（把人當資料點）：

"We collected students' gender, socioeconomic status, and stress levels to predict performance."

改良（價值先行、保護明確、保留控制權）：

"To examine whether learning support is equitably distributed, we invited participants to optionally share demographic and physiological indicators. These attributes were used only for aggregate, group-level analysis and never for individual assessment; participants could decline any item and could withdraw shared data at any time through their account settings."

差別不只在禮貌。第二句把收集行為重新框定為邀請（invitation）而非抽取（extraction），明確區分「群體分析」與「個人評估」，並指出可撤回的具體途徑。這種寫法在處理弱勢族群（vulnerable populations）或敏感維度時，往往是審稿人判斷研究團隊是否「真的把倫理當回事」的依據。

重點回顧

從宣告轉向論證：入門寫「有做倫理程序」，進階寫「為什麼這個程序對應到實際分析」。每句倫理宣稱都應能承受審稿人的「那……呢？」追問。
同意—用途對齊是退稿地雷：尤其二次使用（secondary use）的資料，必須論證同意範圍涵蓋你實際做的分析，並讓倫理敘事與資料時間線一致。
AI 揭露是新支柱：使用 LLM 做標註、評分或分析時，揭露模型版本、參數、人類監督與信度，並主動指認偏誤方向；寫作協助與研究貢獻要分清。
重製與隱私要明寫權衡：用分層揭露（tiered disclosure）取代「upon reasonable request」或全有全無，沿資料敏感度梯度設計開放層級。
敏感資料敘事有順序：價值先行、保護明確、保留撤回控制權，把收集框定為邀請而非抽取。

深入探討（研究所視角）

如果你要把倫理寫作推到研究所與審稿層次，有三個更深的議題值得鑽研。

其一，從程序倫理（procedural ethics）到日常倫理（ethics in practice）的落差。 Guillemin 與 Gillam（2004）區分了「拿到 IRB 核准」這類程序倫理，與研究現場不斷浮現的「倫理重要時刻（ethically important moments）」——例如某位學生在對話中透露了心理困擾，你的資料蒐集系統該如何反應？高水準的論文會在 Limitations 或 Discussion 中誠實處理這些非程序性的判斷，而非假裝 IRB 核准窮盡了所有倫理問題。把這種反身性（reflexivity）寫進論文，是成熟研究者的標誌。

其二，去識別化的理論極限。 進階研究者應理解，對高維行為資料而言，「匿名」是一個程度問題而非二元狀態。差分隱私（differential privacy）提供了一個可量化的框架：透過注入校準過的雜訊，給出隱私損失的數學上界（以 ε 參數刻畫），讓「我們保護了隱私」從修辭變成可驗證的宣稱。即使你的論文未實作差分隱私，理解其精神也會讓你的去識別化論述更精準——例如改用「我們將 re-identification 風險降到 k-anonymity ≥ 某值」這類可檢驗語言，取代空泛的「data were anonymized」。

其三，倫理作為認識論問題（epistemological dimension）。 在學習分析這類領域，倫理與效度（validity）是綁在一起的。若你的同意基礎有偏（例如只有自願揭露生理數據的學生進入樣本），這不只是倫理問題，更是選擇偏誤（selection bias）這個方法學問題。進階寫作能把這兩個層面交織起來論證：「我們的 opt-in 設計保障了自主性，但也可能引入自我選擇偏誤，我們在 §5 以敏感度分析評估其影響」。當你能讓審稿人看見你同時用倫理與效度兩副眼鏡審視自己的設計，你的論文就從「合規」躍升到「可信」。這正是進階倫理寫作最終要達到的境界——倫理不是論文末尾的免責聲明，而是貫穿整個研究論證的一條主線。

← 上一篇

把研究倫理寫進論文：作者署名、利益揭露與資料誠信

--

5

32.3%

140.05

82.02%

62,201

AI 回覆桌面通知

聊天訊息通知

聲音通知

更多設定